Hierarchical text classification method and system
A text classification and hierarchical technology, applied in text database clustering/classification, neural learning methods, text database indexing, etc., can solve problems such as error stacking, achieve the effect of improving accuracy and reducing the number of prediction errors
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0051] In this embodiment, a hierarchical text classification method is mainly based on figure 1 In the hierarchical structure shown, a classifier is trained on each non-leaf node, and then the connection between text class labels is used to introduce the concept of "adjusted probability matrix" between class labels. The adjusted probability matrix obtained through training is The text class labels are revised globally, and a global hierarchical text classification model is constructed to obtain more accurate text class labels. figure 2 Shows the flow of the training phase of the hierarchical text classification method of this embodiment, image 3 The flow of the actual classification stage of the hierarchical text classification method of this embodiment is shown, and the above two stages will be described in detail below with reference to the accompanying drawings.
[0052] The training phase of this embodiment mainly includes several steps such as obtaining the training s...
Embodiment 2
[0084] This embodiment provides a hierarchical text classification system, including:
[0085] The text acquisition module is used to obtain the training set text and the text to be classified;
[0086] A text preprocessing module, configured to preprocess each text obtained;
[0087] The text vectorization module is used to vectorize the preprocessed text, and express the words in the text as vector forms;
[0088] Each level classifier training module, the classifier obtained is used to preliminarily predict the text class label probability vector, and the vector element represents the probability that the text is divided into each class label; the construction method of each level classifier is: according to the text class label The tree-type hierarchical structure, numbering the class label nodes in the text class label hierarchical tree; taking the training text vector set and the text subsets corresponding to the categories of each layer as input, using the neural netwo...
PUM
![No PUM](https://static-eureka-patsnap-com.libproxy1.nus.edu.sg/ssr/23.2.0/_nuxt/noPUMSmall.5c5f49c7.png)
Abstract
Description
Claims
Application Information
![application no application](https://static-eureka-patsnap-com.libproxy1.nus.edu.sg/ssr/23.2.0/_nuxt/application.06fe782c.png)
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com