Word vector training method and device

A training method and word vector technology, applied in the field of word vector training methods and devices, can solve problems such as large amount of calculation

Active Publication Date: 2017-06-06
BEIHANG UNIV
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Using the existing technology, when a new vocabulary is added to the vocabulary, because the frequency of each word has changed, it is necessary to re-learn all the vocabulary in the new vocabulary to obtain new word vectors for each vocabulary, training words The amount of calculation is large when vector

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word vector training method and device
  • Word vector training method and device
  • Word vector training method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0047] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects, without having to use To describe a specific order or sequence. It should be understood that the data used in this way can be interchanged under appropriate circumstances, so that the embodiments of the present invention described herein can, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a word vector training method and a word vector training device. The word vector training method includes: acquiring a newly occurring word bank, wherein words in the newly occurring word bank and words in an old word bank form a new word bank, and words in the old word bank correspond to old word vectors; performing initialization treatment on the words in the newly occurring word bank so as to achieve the purposes that word vectors of the words in the newly occurring word bank, which belong to the old word bank, are the old word vectors, and word vectors of the words in the newly occurring word bank, which only belong to the newly occurring word bank, are random word vectors; respectively updating the word vectors of the words in the newly occurring word bank according to noise distribution corresponding to the old word bank and noise distribution corresponding to the newly occurring word bank. The word vector training method and the word vector training device reduce calculation amount in training of the word vectors.

Description

Technical field [0001] The present invention relates to machine learning technology, in particular to a word vector training method and device. Background technique [0002] In machine learning technology, in order to make the machine understand the meaning of human language, the word representation tool of the neural network language model converts each vocabulary in the human language into the form of word vector, so that the computer can learn the human language through the word vector The meaning of each word. [0003] In the prior art, the word representation tool obtains the word vector of each vocabulary by learning all the words in the vocabulary. [0004] With the existing technology, when a new vocabulary is added to the vocabulary, because the frequency of each word has changed, it is necessary to relearn all the vocabulary in the new vocabulary to obtain new word vectors for each vocabulary, and to train the words The amount of calculation is large when the vector is us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/2365G06F40/242G06F40/284
Inventor 李建欣刘垚鹏彭浩陈汉腾张日崇
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products