Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Classification method and device for short texts

A classification method and text classification technology, applied in text database clustering/classification, neural learning methods, unstructured text data retrieval, etc. relationship and other issues to achieve the effect of improving accuracy and high accuracy

Active Publication Date: 2020-01-17
BEIJING UNIV OF POSTS & TELECOMM
View PDF12 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In summary, the failure to capture the semantic relationship in short texts and the lack of training samples will lead to low accuracy when applying existing short text classification methods to classify short texts.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification method and device for short texts
  • Classification method and device for short texts
  • Classification method and device for short texts

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0060] In order to solve the problems in the prior art, an embodiment of the present invention provides a short text classification method and device.

[0061] see figure 1 , figure 1 A schematic flow diagram of a short text classification method provided by an embodiment of the present invention, which is applied to a client or a server, and the method includes:

[0062] S101. Obtain a short text to be classified.

[0063] Wherein, the content of the short...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a classification method and device for short texts. According to the method, to-be-classified short texts are classified; according to the entities obtained from the to-be-classified short texts and the affiliation relationships between the topics and the to-be-classified short texts, the to-be-classified short texts are classified; TEXT HETEROGENEOGRAPHICSare constructed, the constructed text heterogeneous graph is input into a preset text classification model; a classification result of the to-be-classified short text is obtained. By applying the text heterogeneous graph constructed by the embodiment of the invention, the semantic relationship in the to-be-classified short text can be captured; moreover, the heterogeneous graph convolutional neural network does not need too much annotation data during training, so that the accuracy of the trained text classification model during short text classification is higher, and therefore, the accuracy of short text classification can be improved by applying the method provided by the embodiment of the invention.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a classification method and device for short texts. Background technique [0002] With the rapid development of online social media and e-commerce, short texts such as online news, searches, comments, tweets, etc. appear more and more commonly on the Internet. Classifying short texts can help users manage texts efficiently. In view of this, short text classifications are widely used in many fields, such as sentiment analysis, news classification, query intent classification, etc. However, in many practical applications, there is very little labeled data, and manual labeling is extremely time-consuming and even requires professional knowledge. Therefore, there is an urgent need to study semi-supervised short text classification with only a relatively small amount of labeled data. [0003] At present, a short text classification method based on deep neural net...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F16/36G06N3/04G06N3/08
CPCG06F16/35G06N3/08G06F16/367G06N3/045
Inventor 石川胡琳梅杨天持
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products