Unbalanced big data set-oriented unsupervised text topic related gene extraction method
A large data set and extraction method technology, applied in special data processing applications, unstructured text data retrieval, text database clustering/classification, etc., can solve problems that cannot accurately reflect real information, uneven distribution, and multivariate density estimation Difficulty and other issues
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0069] The present invention provides an unsupervised text topic-related gene extraction method oriented to unbalanced large data sets, which uses factor analysis and density peak algorithm to obtain clusters of high-dimensional sample sets, and labels unlabeled samples accordingly; Local density and information entropy are used to improve the feature selection method based on the CHI statistical matrix to strengthen the feature expression of low-density and small sample clusters; the fast fixed point algorithm based on negative entropy (FastICA) is used to analyze the high-order between multidimensional data Statistical correlation is used to extract independent hidden theme feature genes and complete the removal of high-order redundancy between components. This method does not require the use of large-scale labeled samples for training, and can effectively avoid the pre-definition of sample category relationships and feature structures; it also overcomes the use of over-sampli...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com