Chinese short text classification method based on characteristic extension
A classification method and short text technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of discrete short text features, inability to obtain classification effects, short length, etc., to improve accuracy and recall rate Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0039] Embodiments of the present invention are now described in conjunction with the accompanying drawings.
[0040] Such as figure 1 As shown, the present invention includes five main steps: establishing a background knowledge base, expanding short texts in the training set, establishing a classification model, expanding short texts to be classified and generating classification results.
[0041] Step (1) Establish the background knowledge base: According to the long text corpus, use the improved Apriori algorithm to mine the binary groups of feature words with co-occurrence relationship and the same category tendency, so as to establish the background knowledge base. The specific steps are:
[0042] Step ① Segment the long texts in the long text corpus, and each long text only retains nouns, time words, location words, location words, verbs, adjectives, distinguishing words, status words and strings, so as to obtain the long text corpus feature word set;
[0043] Step ② C...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com