Chat text feature classification method and device and storage medium

A chat text and feature classification technology, applied in text database clustering/classification, neural learning methods, unstructured text data retrieval, etc., can solve the problems of difficult description of artificial features, inability to parallelize, performance degradation, etc., to improve reasoning performance, avoiding data sparsity, avoiding the effect of the curse of dimensionality

Active Publication Date: 2021-01-12
XIAMEN MEIYA PICO INFORMATION
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the performance degradation, dimensionality disaster, data sparseness, artificial features that are difficult to describe the hidden semantics of the existing text classification methods on social chat short texts mentioned above, and the use of recurrent neural network methods for text classification and recognition, the calculation Reasoning is slow and cannot be parallelized, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chat text feature classification method and device and storage medium
  • Chat text feature classification method and device and storage medium
  • Chat text feature classification method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to make the purpose, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings. Obviously, the described embodiments are only some of the embodiments of the present invention, rather than all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0046] figure 1 Shown is an exemplary device architecture 100 to which the chat text feature classification method or the chat text feature classification apparatus of the embodiments of the present application can be applied.

[0047] Such as figure 1 As shown, the device architecture 100 may include terminal devices 101 , 102 , 103 , a network 104 and a server 105 . The network 104 is used as a medium for providing communication lin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a chat text feature classification method and device, and a storage medium, and the method comprises the steps: preprocessing an obtained a chat text to acquire a word vector;inputting the word vector into a convolution network layer; calculating and generating local feature vectors of the chat text; and connecting the local feature vectors to form a context semantic feature vector; inputting the context semantic feature vector into a deep convolutional neural network to output a first fixed length vector; combining the word vector with a position vector representing the position of each word in the chat text to form a joint word vector; obtaining a second fixed length vector by the joint word vector through a threshold linear unit network GLU and in combination with a multi-core Depthwise convolutional network layer; connecting the first fixed length vector with the second fixed length vector to obtain a multi-level text semantic vector; and inputting the multi-level text semantic vector into a full connection network layer to calculate an output vector; and for the output vector, calculating a classification probability value of the chat text by using a softmax function to obtain a feature category to which the chat text belongs.

Description

technical field [0001] The present invention relates to the field of natural language processing, in particular to a chat text feature classification method, device and storage medium. Background technique [0002] The instant messaging tool represented by WeChat has become an important medium for social interaction, self-expression and sharing of opinions for the vast number of Internet users. As of December 2019, the number of active WeChat users exceeded 960 million, and the number of QQ users exceeded 740 million. Hundreds of millions of active users come from different social and cultural backgrounds, and use social instant messaging tools to communicate in life and work, express opinions, express feelings, and socialize all the time. But at the same time, some illegal and criminal activities are also hidden in it. A large amount of false information, advertising information, Internet fraud and other illegal information are released through QQ and WeChat, seriously end...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/30G06F40/284G06F40/205G06N3/04G06N3/08G06Q50/00
CPCG06F16/355G06F40/30G06F40/284G06F40/205G06N3/08G06Q50/00G06N3/045
Inventor 赵建强黄剑杜新胜张辉极陈诚邓叶勋蒋卓陈思萌
Owner XIAMEN MEIYA PICO INFORMATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products