Word vector generation method and device, computer equipment and storage medium

A word vector and computer technology, applied in the field of word vector generation, computer equipment and storage media, can solve the problems of low efficiency and low quality of word vector generation, and achieve the effect of improving generation quality, reducing the number of annotations, and improving training efficiency

Pending Publication Date: 2022-07-08
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Based on this, it is necessary to address the above-mentioned technical problems and provide a method, device, computer equipment and storage medium for generating word vectors to solve the problems of low generation efficiency and low quality of word vectors

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word vector generation method and device, computer equipment and storage medium
  • Word vector generation method and device, computer equipment and storage medium
  • Word vector generation method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In the following description, for the purpose of illustration rather than limitation, specific details such as specific system structures and technologies are set forth in order to provide a thorough understanding of the embodiments of the present invention. However, it will be apparent to those skilled in the art that the present invention may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary detail.

[0046] It is to be understood that the term "comprising" when used in the present specification and appended claims indicates the presence of the described feature, integer, step, operation, element and / or component, but does not exclude one or more other The presence or addition of features, integers, steps, operations, elements, components and / or sets thereof.

[0047]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an artificial intelligence technology, and provides a word vector generation method and device, computer equipment and a storage medium, and the method comprises the steps: training a word in a corpus through a preset model, and obtaining an initialization vector of the word in the corpus; the candidate words of the to-be-constructed word vector words generated based on the initialization vectors of the words are sorted through the preset sorting model to obtain the positive correlation set and the negative correlation set of the to-be-constructed word vector words, and the candidate words are sorted, so that the number of annotations of the words is reduced, the training efficiency of the model is improved, and the training efficiency of the model is improved. The positive correlation set and negative correlation set samples are obtained through the preset sorting model, the quality of the samples is improved, the high-quality samples are sent into the contrast learning model for training, and the generation quality of the word vector of the word vector word to be constructed is improved.

Description

technical field [0001] The present invention relates to the field of artificial intelligence, and in particular, to a method, device, computer equipment and storage medium for generating word vectors. Background technique [0002] Natural language processing is an important direction in the field of computer science and artificial intelligence. At present, natural language processing tasks include machine translation, sentiment analysis, text summarization, text classification and information extraction. In natural language processing tasks, the first step is to consider how to enable computers to represent natural language. Computers cannot directly represent natural language. Therefore, we need to design a method to mathematicalize natural language so that computers can process it. This is word vector. [0003] In the prior art, word vectors are mainly obtained by an active learning method or a comparative learning method. In the active learning method, a large amount of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/284G06F40/30G06F40/247
CPCG06F40/284G06F40/30G06F40/247
Inventor 司世景王健宗张传尧
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products