Multi-label text classification method and system
A text classification, multi-label technology, applied in text database clustering/classification, unstructured text data retrieval, special data processing applications, etc., can solve problems such as improving training errors, reducing model performance and classification accuracy, and achieving Provides accuracy and avoids errors
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0049] In order to solve the problem that existing methods ignore the correlation between labels and texts, this embodiment provides a multi-label text classification method based on LSTM-CNN and attention mechanism, considering the correlation between labels and texts, calculating The probability of several labels (label subsets) that are more relevant to the text is sufficient, which significantly improves the prediction efficiency of the model, avoids unnecessary errors, and improves accuracy.
[0050] Please refer to the attached figure 1 , the multi-label text classification method comprises the following steps:
[0051] S101, given a training set containing text sequences and label spaces, using a long short-term memory network
[0052] LSTM extracts the global feature vectors of all words in the text sequence, and uses the convolutional neural network (CNN) to aggregate the global feature vectors of all the words in the text sequence to obtain the semantic vectors of a...
Embodiment 2
[0107] This embodiment provides a multi-label text classification system based on LSTM-CNN and attention mechanism, which is used to implement the multi-label text classification method based on LSTM-CNN and attention mechanism described in the above embodiments.
[0108] Please refer to the attached Figure 4 , the multi-label text classification system includes an encoding module, a decoding module and a classification module, wherein:
[0109] The encoding module is used to obtain a training set including a text sequence and a label space, use a long short-term memory network to extract the global feature vectors of all words in the text sequence, and use a convolutional neural network to aggregate the obtained global feature vectors to obtain the text Semantic vectors for each word in the sequence;
[0110] The decoding module is used to calculate the weight coefficients of each label in the note space and all words in the text sequence respectively, construct th...
Embodiment 3
[0113] A computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, the steps in the multi-label text classification method described above are realized.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com