Topic modeling method based on data enhancement
A technology of topic modeling and topic model, which is applied in digital data processing, natural language data processing, special data processing applications, etc., can solve the problems of increasing time cost, unfavorable short text feature expansion and selection, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0033] In this example, if figure 1 As shown, a topic modeling method based on data enhancement is carried out as follows:
[0034] Step 1. Obtain document collection D={D 1 ,...,D d ,...,D |D|}, where D d Indicates the dth document, 1≤d≤|D|; assume the dth document D d is composed of |S| sentences, then let the dth document D d The set of sentences is S d ={S d,1 ,...,S d,s ,...,S d,|S|}, S d,s Indicates the dth document D d In the sth sentence, 1≤s≤|S|; assuming the dth document D d is composed of N words, then let the dth document D d The set of words for W d,j Indicates the dth document D d The jth word in , 1≤j≤N d ; Then let all the words in the document collection D constitute the word collection W={W 1 ,...,W i ,...,W V}, W i Indicates the i-th word, 1≤i≤V. The document set selected by the present invention is Sina Weibo data. Sina Weibo data is the original files published by Weibo users or content posted by other users. The characters of the pu...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com