Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Named entity recognizing method based on LSTM

A technology of named entity recognition and gradient descent algorithm, applied in the information field, can solve the problems of few network layers, low recognition rate of unregistered words, and insufficient recognition accuracy, and achieve the effect of improving accuracy

Inactive Publication Date: 2018-04-06
北京知道未来信息技术有限公司
View PDF5 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Dictionary-based named entity recognition relies heavily on the dictionary database and cannot identify unregistered words
HMM (Hidden Markov) and CRF (Conditional Random Field) methods based on word frequency statistics can only associate the semantics of the next word with the previous word, and the recognition accuracy is not high enough, especially the recognition rate of unregistered words is low
The method based on the artificial neural network model has the problem of gradient disappearance during training, and in actual applications, the number of network layers is small, and the final named entity recognition results have no obvious advantages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity recognizing method based on LSTM
  • Named entity recognizing method based on LSTM
  • Named entity recognizing method based on LSTM

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to make the above objects, features and advantages of the present invention more obvious and understandable, the present invention will be further described in detail below through specific implementation cases and in conjunction with the accompanying drawings.

[0031] The invention discloses a named entity recognition method based on LSTM, such as recognizing a person's name, a place name, an organization name, a brand name, a company name, etc. from an unstructured text. The core problem to be solved in the present invention comprises two: 1. use LSTM-CRF model to improve the precision of named entity recognition; 2. add the feature of the character vector of word, solve the recognition (Out of Vocabulary, OV ).

[0032] In order to improve the accuracy of named entity recognition, the present invention adds LSTM character features and LSTM character epithet feature layers on top of the traditional CRF model, and its detailed structure is as follows figure...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a named entity recognizing method based on LSTM. The method includes the steps that training linguistic data of named entity recognition is labeled to form labeled linguisticdata; words and characters in the labeled linguistic data are converted into vector quantities; a named entity recognition model based on LSTM is set up by using word and character vector quantities,and parameters of the named entity recognition model are trained; named entity recognition prediction is conducted on the data to be predicted through the trained named entity recognition model. By adopting the vector quantities based on the words and characters, characteristics of the characters and the words can be obtained, and the problem of unregistered words can be avoided; in addition, compared with a traditional pure CRF model algorithm, the named entity recognizing method adopting long short-term memory neural network is capable of absorbing more character and word characteristics tofurther improve entity recognition precision.

Description

technical field [0001] The invention belongs to the field of information technology, and in particular relates to an LSTM-based named entity recognition method. Background technique [0002] Named Entity Recognition (NER for short) refers to identifying entities with specific meanings in text, mainly including names of people, places, institutions, and proper nouns. Practical scenarios for named entity recognition methods include: [0003] Scenario 1: Event detection. Place, time, and person are several basic components of time. When constructing an event summary, relevant persons, places, units, etc. can be highlighted. In the event search system, related people, time, and places can be used as index keywords. The relationship between several components of an event describes the event in more detail at the semantic level. [0004] Scenario 2: Information retrieval. Named entities can be used to enhance and improve the effect of the retrieval system. When the user enter...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
CPCG06F40/295G06F40/30
Inventor 岳永鹏唐华阳
Owner 北京知道未来信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products