Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

A word sense disambiguation method and system based on graph model

A technology of word sense disambiguation and graph model, which is applied in the field of word sense disambiguation method and system based on graph model, and can solve problems such as inapplicability of Chinese

Active Publication Date: 2019-02-19
ZAOZHUANG UNIV
View PDF9 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this technical solution uses the semantic relationship contained in BabelNet, not the semantic knowledge in HowNet; it is suitable for English word sense disambiguation, but it is not applicable to Chinese, and it cannot solve how to combine multiple Chinese and English resources to complement each other. , realize the problem of fully mining the disambiguation knowledge in resources and improving the performance of word sense disambiguation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A word sense disambiguation method and system based on graph model
  • A word sense disambiguation method and system based on graph model
  • A word sense disambiguation method and system based on graph model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0101] as attached figure 1 Shown, the word sense disambiguation method and system based on graph model of the present invention, comprise the steps:

[0102] S1. Extract contextual knowledge: perform part-of-speech tagging on ambiguous sentences, extract content words as context knowledge, and content words refer to nouns, verbs, adjectives, and adverbs;

[0103] Example: Take the treatment of “Around the implementation of the “Guiding Opinions” and in combination with the actual work of traditional Chinese medicine, all localities should intensify their efforts to actively and steadily promote the reform of traditional Chinese medicine medical institutions.” words. The part-of-speech tagging process uses the word segmentation system NLPIR-ICTCLAS of the Chinese Academy of Sciences. After part-of-speech tagging, "around / v" / wkz guidance / v opinion / n" / ude1 implementation / vn implementation / vn of / wky, / ude1 actuality / n of / wd combined with / v traditional Chinese medicine / n work / vn,...

Embodiment 2

[0165] as attached Figure 5 Shown, the word sense disambiguation system based on graph model of the present invention, this system comprises,

[0166] The contextual knowledge extraction unit performs part-of-speech tagging on ambiguous sentences, and extracts content words as context knowledge, and content words refer to nouns, verbs, adjectives, and adverbs;

[0167] The similarity calculation unit is used to perform English-based similarity calculation, word vector-based similarity calculation, and HowNet-based similarity calculation. The similarity calculation unit includes:

[0168] The English similarity calculation unit is used to mark the context knowledge with HowNet word meaning information, and perform word meaning mapping processing to obtain a set of English words; then use the word similarity calculation algorithm based on word vectors and knowledge base to perform similarity on the obtained English words Calculation; considering that HowNet is bilingual, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a word sense disambiguation method and system based on a graph model, and belongs to the field of natural language processing technology. The technical problem to be solved bythe present invention is how to combine multiple Chinese and English resources, complement each other's advantages, realize full exploitation of disambiguation knowledge in resources, and improve wordsense disambiguation performance.The technical scheme adopted is as follows: 1, a word sense disambiguation method based on graph model, comprising the following steps: S1, extracting contextual knowledge: carrying out part-of-speech tagging on ambiguous sentences, extracting substantive words as contextual knowledge, wherein the substantive words refer to nouns, verbs, adjectives and adverbs; S2, similarity calculation: performing similarity calculation based on English, similarity calculation based on word vector and similarity calculation based on HowNet; 3, constructing a disambiguation graph; S4, performing the correct choice of word meaning. 2, A word sense disambiguation system based on graph model, which comprises a context knowledge extraction unit, a similarity calculation unit,a disambiguation graph construction unit and a word sense correct selection unit.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a word sense disambiguation method and system based on a graph model. Background technique [0002] Word sense disambiguation refers to determining the specific meaning of ambiguous words according to the specific context in which they are located. It is a basic research in the field of natural language processing. have a direct impact. Whether it is Chinese or English and other Western languages, the phenomenon of polysemy is common. [0003] Traditional graph-based methods for disambiguating Chinese word senses mainly use one or more Chinese knowledge resources, and are plagued by insufficient knowledge resources, resulting in low word sense disambiguation performance. Therefore, how to combine a variety of Chinese and English resources, complement each other's advantages, fully tap the disambiguation knowledge in the resources, and improve the performance...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F16/31
CPCG06F40/205G06F40/211G06F40/30Y02D10/00
Inventor 孟凡擎燕孝飞张强陈文平鹿文鹏
Owner ZAOZHUANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products