A Text Representation Method Using Local Embedded Topic Modeling
A text representation and topic technology, applied in the field of computer science and information retrieval, can solve the problems of not being a mapping, not being able to provide mapping functions, and not being able to transfer known data knowledge, etc., to achieve wide practicability and stable and coherent performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0074] In order to better illustrate the purpose and advantages of the present invention, the implementation of the method of the present invention will be described in further detail below in conjunction with the accompanying drawings and examples.
[0075] In the experiment, two widely used English text classification corpora (20newsgroup, RCV1) were used to test the invention. 20newsgroup consists of 20 associated newsgroups, containing a collection of 20,000 texts. RCV1 is a large-scale multi-class dataset, which is an archive of more than 800,000 human-classified newswire stories obtained by Reuters. We extracted four types of texts: M11 (equity investment market), M12 (bond market), M131 (international banking market) and M132 (foreign exchange market). Table 1 shows some statistics about these datasets. Table 1 shows some statistics about these datasets.
[0076] Table 1 Statistics of the 2 corpora, D is the total number of texts. W is the vocabulary size, is the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com