Online short text data stream classification method based on feature extension
A classification method and short text technology, applied in text database clustering/classification, text database query, unstructured text data retrieval, etc., can solve problems that cannot handle continuous data well, text classification technology is difficult to be effective, and models cannot issues such as better performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0075] In this example, if figure 2 As shown, an online short text data stream classification method based on feature expansion is carried out as follows:
[0076] Step 1: Build the Word2vec model based on the external corpus, and obtain the word vector set Vec:
[0077] Step 1.1: According to the sliding window mechanism, the given short text data stream Stream={d 1 , d 2 ,...,d e ,...,d E} is divided into T sets of data blocks according to time, recorded as D={D 1 ,D 2 ,...,D t ,...,D T}, where d e Indicates the e-th short text in the short text data stream Stream; D t Represent the data block at time t in the short text data stream Stream, e=1,2,...,E, t=1,2,...,T;
[0078] Step 1.2: Obtain the text external corpus for the short text data stream Stream from the knowledge base, denoted as C'={d' 1 ,d' 2 ,...,d′ m ,...,d′ M}, m=1,2,...,M, where M represents the text external corpus C 1 The total number of texts, d′ m represents the mth text, and has q=1,2,....
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com