Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Training method and device for word sense disambiguation model

A word sense disambiguation and training method technology, applied in the computer field, can solve the problem of inaccurate disambiguation results, achieve accurate word sense disambiguation, accurate word sense disambiguation model, and improve accuracy

Active Publication Date: 2020-06-19
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
View PDF9 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003]However, when using this method for word sense disambiguation, the disambiguation results are usually not accurate enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training method and device for word sense disambiguation model
  • Training method and device for word sense disambiguation model
  • Training method and device for word sense disambiguation model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The solutions provided in this specification will be described below in conjunction with the accompanying drawings.

[0057] As mentioned in the background art, when word sense disambiguation is performed based on a supervised learning method, the disambiguation result is often not accurate enough. In order to improve the accuracy of disambiguation results, those skilled in the art also try to use the following two methods to disambiguate word senses: a) perform word sense disambiguation based on word meaning definitions in dictionaries. Specifically, if the word meaning s_i of w in the dictionary contains the vocabulary e, then if e also appears in a sentence containing w, then it is considered that the word meaning of w in the sentence should be s_i. However, this method strongly depends on the definition of the dictionary. If there is no e in the sentence, but only the synonyms of e appear, the result cannot be obtained. b) Word sense disambiguation based on topics....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a training method and device for a word sense disambiguation model, and the method comprises the steps: obtaining a word co-occurrence graph and a semantic association graph; and selecting a first word from the training text; obtaining a positive example sample and a negative example sample corresponding to the first word; calculating the similarity betweeneach word in the training text and the word represented by each node in each semantic association graph, and selecting a target association graph based on the similarity;based on the target association graph, determining a semantic vector of the first word, and based on the word co-occurrence graph, determining word vectors of other words; based on the determined semantic vector and word vector,encoding by utilizing an encoder; based on the word co-occurrence graph, determining a word vector of each word in the two samples; carrying out encoding by using an encoder according to the determined word vecto; based on the coding result, calculating a first text distance between the training text and the positive example sample, and calculating a second text distance between the training textand the negative example sample; and training an encoder by taking the condition that the first text distance is smaller than the second text distance as a target.

Description

technical field [0001] One or more embodiments of this specification relate to the field of computer technology, and in particular to a method and device for training a word sense disambiguation model. Background technique [0002] Word sense disambiguation refers to automatically judging the current meaning of a polysemous word based on the context of the word. In traditional techniques, word sense disambiguation is usually based on supervised learning methods. For example, based on the context C, the posterior probability P(s_i|C) of each sense s_i of the word to be disambiguated is obtained through a supervised learning method. The meaning s_k=argmaxP(s_i|C) of the maximum posterior probability is taken as the meaning determined after disambiguation. [0003] However, when word sense disambiguation is performed by this method, the disambiguation results are usually not accurate enough. Therefore, a more accurate word sense disambiguation method needs to be provided. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/30G06F40/284G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/044G06N3/045G06F18/22
Inventor 钱隽夫
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products