Retrieval method and method using same for establishing text semantic extraction module
A model and document technology, applied in the field of text semantic extraction model establishment based on implicit semantic analysis, can solve problems such as limited function, high time complexity, and amazing time complexity, and achieve the effect of removing redundancy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] When m≤n, according to the above step 2402, perform singular value decomposition on the keyword_document matrix A (m×n), and the matrix generated after the decomposition is the keyword vector matrix U, the diagonal matrix Σ, and the document vector matrix V T , in order to simplify the matrix and highlight the relationship between the dimensions of the matrix, the elements in the matrix are represented by "*", as follows:
[0057]
[0058] According to the above step 1404, the corresponding generated target matrix is:
[0059]
[0060] Assumption D 1 and D 2 are two rows of elements randomly selected from the document_keyword matrix D, C 1 and C 2 is the matrix C respectively with D 1 and D 2 The corresponding two rows of elements can be obtained:
[0061] C 1 =D 1 U (8)
[0062] C 2 =D 2 U (9)
[0063] due to D 1 with D 2 Respectively expressed as {w 1,1 ,w 1,2 ,...,w 1,m} and {w 2,1 ,w 2,2 ,...,w 2,m}, then D 1 with D 2 The inner product o...
Embodiment 2
[0098] When m>n, also according to the above step 2402, the matrix obtained after performing singular value decomposition on the keyword_document matrix A is the keyword vector matrix U, the diagonal matrix Σ, and the document vector matrix V T , similarly, in order to simplify the matrix and highlight the relationship between the dimensions of the matrix, the elements in the matrix are represented by "*", as follows:
[0099]
[0100] When m>n, the present invention only uses the matrix U 1 (m×n) to construct the target matrix C, where U 1 is the economic matrix of the matrix U, and its n is determined by the number of singular values of the matrix Σ, that is to say, n is equivalent to the number of documents in the document set.
[0101] Therefore, when m>n, the target matrix C can be defined as:
[0102] C=DU 1 (29)
[0103] The details are as follows:
[0104]
[0105] It can be seen from formula (30) that when m>n, C is an n×n matrix, and ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com