Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Word vector language model based on emergencies

A technology of emergencies and language models, applied in biological neural network models, natural language data processing, semantic analysis, etc., can solve problems such as the weakening of semantic information expression ability

Active Publication Date: 2020-01-14
RENMIN UNIVERSITY OF CHINA +1
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This approach is well motivated, but for some words can lead to over-smoothing of differences between word senses that have changed over time
[0011] 3) Although the non-random initialization method solves the alignment problem, it makes the words have rich semantic information of the previous corpus, and the ability to express semantic information of current emergencies is weakened

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word vector language model based on emergencies
  • Word vector language model based on emergencies
  • Word vector language model based on emergencies

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0037] The present invention will be further explained below in combination with specific embodiments.

[0038] The method of the invention involves technologies such as intelligent analysis and language model, and can be used to generate word vectors in text stream data, and makes the generated word vectors more contain emergency information.

[0039] The present invention proposes a word embedding model (Bursty Word Embeddings, BWE) based on emergencies based on the CBOW structure in the Word2Vec model.

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a word vector language model based on emergencies. According to the language model, a traditional Word2Vec model is used for training a context; wherein the training comprises the steps of calculating input hidden layer information in an input layer of the model, adding vector representation of an emergency into the input layer, and obtaining final hidden layer representation through weighted summation to enable the vector representation of the emergency and the final hidden layer representation to jointly influence the final hidden layer representation; the generated hidden layer representation is not only related to the context, but also related to the emergency. The invention provides a new word vector model related to emergencies. The word vector model is used for modeling text stream data accommodating emergencies. According to the method, the word vector model with emergency characteristics can be learned to identify semantic changes, and emergency vector representation is added to improve semantic correlation.

Description

technical field [0001] The invention relates to the technical field of dynamic word vector generation models, in particular to a word vector language model based on emergencies. Background technique [0002] The semantics of words change over time, and many factors affect the semantics of words, including cultural changes, the emergence of new technologies, etc. For example, the word "Amazon" originally referred to the tropical rainforest. When the word "Amazon" is used, people tend to think it refers to e-commerce. [0003] Emergent events refer to events that are suddenly mentioned many times in a short period of time, such as "Notre Dame de Paris fire" and "Taiwan earthquake". In the emergency event detection model, the emergency event is generally expressed as <target word, start time, end time>. In previous work, people only focus on the representation of semantics in bursts, we propose to combine bursts and word vectors to capture and understand the correlation...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F40/30G06F40/289G06K9/62G06N3/04
CPCG06F16/355G06N3/045G06F18/22G06F18/2411
Inventor 赵鑫朱秋昱张明
Owner RENMIN UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products