Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

English word sense disambiguation method based on phrase structure syntax tree

A word meaning disambiguation and syntax tree technology, applied in natural language data processing, special data processing applications, instruments, etc., to achieve accurate calculation, avoid inaccurate screening and weighting, and improve the accuracy rate

Inactive Publication Date: 2016-06-15
QILU UNIV OF TECH
View PDF1 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to overcome the deficiencies of the existing word sense disambiguation technology, mainly to solve the screening and weighting of context word sense related words and the calculation problems of word sense correlation, and propose a new English word sense disambiguation based on the phrase structure syntax tree. divergence method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • English word sense disambiguation method based on phrase structure syntax tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be further described in detail below in conjunction with specific embodiments.

[0045] Take the sentence "⊙The coaches 'teaching football a standing on the bus @." as an example, disambiguate the ambiguous word coach in it.

[0046] According to the WordNet3.0 dictionary, the meaning of the ambiguous word coach is shown in Table 1.

[0047] Table 1 The meaning table of coach#n

[0048] lexical number

Glossary

coach#n#1

coach, manager, handler -- ((sports) someone in charge of training an athlete or a team)

coach #n #2

coach, private instructor, tutor -- (a person who gives private instruction (as in singing, acting, etc.))

coach #n #3

passenger car, coach, carriage -- (a railcar where passengers ride)

coach #n #4

coach, four-in-hand, coach-and-four -- (a carriage pulled by four horses with one driver)

coach #n #5

bus, autobus, coach, charabanc, double-decker, jitney, motorbus, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an English word sense disambiguation method based on a phrase structure syntax tree, and belongs to natural language processing. The method comprises the steps that 1, phrase structure syntax analysis is conducted on a sentence, and a phrase structure syntax tree of the sentence is generated; 2, word sense relevant words are screened by taking the phrase structure syntax tree as the basis; 3, a word sense disambiguation model is constructed, and correct word sense is determined by evaluating the intimate level of word sense of ambiguous words and the word sense relevant words; 4, parameters of the word sense disambiguation model in the step 3 are optimized according to a word sense tagged corpus through a genetic algorithm; 5, the step 1 and the step 2 are repeatedly conducted on words to be subjected to disambiguation, and correct word sense of the ambiguous words is determined through the optimized word sense disambiguation model obtained in the step 4. According to the English word sense disambiguation method based on the phrase structure syntax tree, the phrase structure syntax tree is utilized for screening the word sense relevant words and giving disambiguation weight to the word sense relevant words, interference of noise words can be reduced, the computing accuracy of word sense relevancy is improved, and the accuracy of English word sense disambiguation is improved.

Description

technical field [0001] The invention relates to an English word sense disambiguation method, in particular to an English word sense disambiguation method based on a phrase structure syntax tree, and belongs to the technical field of natural language processing. Background technique [0002] Word sense disambiguation refers to judging the correct meaning of an ambiguous word according to its context. Word meaning is the basic unit that constitutes the meaning of a sentence and the premise of understanding a sentence. Word sense disambiguation is a basic task in the field of natural language processing, and has a wide range of application requirements in machine translation, information retrieval, text classification, question answering systems and other fields. [0003] The meaning of an ambiguous word is determined by its context. The ability to accurately select contextual word sense related words will directly affect the performance of the word sense disambiguation syste...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/242G06F40/253
Inventor 鹿文鹏成金勇张维玉
Owner QILU UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products