Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for extracting keyword from text

A keyword and text technology, applied in the field of keyword extraction using deep learning algorithms, can solve the problem of low accuracy and achieve high extraction accuracy

Active Publication Date: 2017-03-15
SHANGHAI XIAOI ROBOT TECH CO LTD
View PDF3 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This type of algorithm is a dimensionality reduction algorithm for text keywords, and the accuracy rate is not very high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for extracting keyword from text
  • Method and device for extracting keyword from text
  • Method and device for extracting keyword from text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The technical solution of the present invention is different from the prior art, and utilizes word part-of-speech information, word semantic information and word statistical information at the same time, and uses deep learning technology to extract text keywords. However, the neural network using deep learning technology must first convert the relevant expressions into digital expressions. The present invention provides indexed digital expressions of words, digital expressions of part-of-speech tagging information, and digital expressions of statistical information of words. provides the basis for its use.

[0038] Please see figure 1 , figure 1 A flow chart showing one aspect of the method for extracting keywords from text of the present invention, the method includes:

[0039] 101: Segment the text to obtain the words corresponding to the text;

[0040]102: Carry out part-of-speech tagging on words;

[0041] 103: adding statistical information to words;

[0042] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for extracting a keyword from a text. The method comprises the following steps: dividing the text into words to obtain the words corresponding to the text; labeling word characteristics of the words; adding statistical information to the words; indexing the words; inputting label information, the statistical information and indexing information of the words into a deep learning extraction model so as to obtain importance weights of the words; according to the importance weights, selecting at least one corresponding word as the keyword of the text. In correspondence to the method, the invention further provides a device for extracting the keyword from the text.

Description

technical field [0001] The present invention relates to a method and device for extracting keywords from a text, in particular to a method and device for extracting keywords using a deep learning algorithm. Background technique [0002] The continuous development of information technology has led to explosive growth of information in many fields. How to quickly and accurately obtain the required information from large-scale text information has become a huge challenge. Keyword extraction is an effective means to solve the above problems. It is one of the core technologies in the field of text mining and plays a very important role. [0003] Existing keyword extraction techniques include pLSA (probabilistic latent semantic analysis), LDA (latent Dirichlet distribution), SVD (singular value decomposition), LSA (latent semantic analysis), TFI-DF (term frequency-inverse document frequency) and other algorithms. [0004] The TFI-DF algorithm believes that the most meaningful wo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/31G06F40/205G06F40/216G06F40/30
Inventor 贾祯白杨朱频频
Owner SHANGHAI XIAOI ROBOT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products