Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

A text implication relation recognition method based on multi-granularity information fusion

A relational recognition and multi-granularity technology, applied in neural learning methods, character and pattern recognition, text database clustering/classification, etc., can solve problems such as multi-time, lack of semantic reasoning, ignoring sentence interaction information, etc.

Active Publication Date: 2019-02-01
SUN YAT SEN UNIV +1
View PDF4 Cites 118 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method based on feature classification usually requires more time and computational cost
With the excellent performance of deep neural networks in various tasks of artificial intelligence, the mainstream research work of text entailment recognition is to use deep neural networks to model text sequences to complete the representation and matching of sentences. There are some shortcomings: First, the word2vec, GloVe or Str2Matrix used in the sentence representation process all rely on the existing corpus, and more and more new words, low-frequency words or compound words do not appear or rarely appear in the training corpus , and these words are not all included in the pre-training word vector, the large proportion of new word vector missing will affect the training effect of the model
The second is that the vector representation of two sentences in the mainstream method calculates the distance relationship matrix or the method based on text similarity does not have the ability of semantic reasoning
The third is that there is little discussion on the interactive information between the premise text and the hypothetical text. The existing technology is to encode or map the two texts separately, then simply aggregate the vectors, and then go through the deep sequence model and predict the classification. This process ignores the interaction information between sentences, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text implication relation recognition method based on multi-granularity information fusion
  • A text implication relation recognition method based on multi-granularity information fusion
  • A text implication relation recognition method based on multi-granularity information fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0081] Such as figure 1 A text implication recognition method that integrates multi-granularity information is shown, including the process of model establishment, model training and model prediction. The specific method steps are as follows:

[0082] The model building process includes: input the training sample set obtained at the input layer; for the input text pairs P and Q at the character vector layer, respectively establish a convolutional neural network (CNN) model with character granularity as the input unit, and analyze the Each word extracts character features to obtain each new word vector; in the word vector fusion layer, the Highway network layer is established, and the word vector established by the character-level convolutional neural network (CNN) model layer is passed in, and the word vector sequence based on character features is output , and then combine them with the original pre-trained word vector one by one to obtain a word vector that combines two gran...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a text implication relation recognition method which fuses multi-granularity information, and proposes a modeling method which fuses multi-granularity information fusionand interaction between words and words, words and words, words and sentences. The invention firstly uses convolution neural network and Highway network layer in character vector layer to establish word vector model based on character level, and splices with word vector pre-trained by GloVe; Then the sentence modeling layer uses two-way long-short time memory network to model the word vector of fused word granularity, and then interacts and matches the text pairs through the sentence matching layer to fuse the attention mechanism, finally obtains the category through the integration classification layer; After the model is established, the model is trained and tested to obtain the text implication recognition and classification results of the test samples. This hierarchical structure method which combines the multi-granularity information of words, words and sentences combines the advantages of shallow feature location and deep feature learning in the model, so as to further improve the accuracy of text implication relationship recognition.

Description

technical field [0001] The present invention relates to the field of natural language processing, and more specifically, relates to a method for identifying text implication relations by fusing multi-granularity information. Background technique [0002] Textual entailment recognition research refers to judging the implied relationship (implicative, contradictory, or neutral) between two given texts (premise text and hypothetical text). This is an important task in the field of natural language processing. Traditional research methods mainly rely on the support of feature engineering, external semantic resources and tools, combined with machine learning methods to complete the text entailment classification. This method based on feature classification usually requires more time and computational cost. With the excellent performance of deep neural networks in various tasks of artificial intelligence, the mainstream research work of text entailment recognition is to use deep ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F17/27G06N3/04G06N3/08G06K9/62
CPCG06N3/084G06F40/30G06N3/045G06F18/2413G06F18/24147
Inventor 王慧瑶郭泽颖印鉴高静
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products