Method for automatic indexing and searching word and word attributes in Chinese text
A technology of automatic indexing and word indexing, which is applied in special data processing applications, instruments, electrical and digital data processing, etc. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
example 1
[0104] Suppose the retrieval condition is: "the adverb is equal to 1 adjective", that is, retrieve all instances where the adverb is immediately followed by a word that is an adjective.
[0105] There are 2 conditional items in the retrieval condition: adverb and adjective.
[0106] The left condition item "adverb" is a word attribute, and its feature code is 0000 2000 (hexadecimal). In the word attribute index, the primary key whose logical "AND" operation result with this feature code is not 0 is only 0000 2010 and 0000 2008, and there are 4 associated words: "one", "no", "more", "always" . Check the word index again, and the positions of the first words corresponding to the word examples are 17, 49, 19, 30, 58, and 68.
[0107] The right condition item "adjective" is a word attribute, and its feature codes are 0010 0010 and 0010 0008 (hexadecimal). In the word attribute index, the only primary keys whose logical "AND" operation result with this feature code is not 0 are ...
example 2
[0113] Establish retrieval condition again and be: " verb ∧ two words are less than 4 nouns ∧ two words ", promptly retrieve the 1st word or the 2nd word or the 3rd word that is the whole example of two word nouns after the two word verbs.
[0114] The left condition item "verb ∧ two characters" is a word attribute connected by "AND" operation, and the sum of the feature codes of "verb" and "two characters" is 0020 0008 (hexadecimal). In the word attribute index, the primary key whose logical "AND" operation result with this feature code is not 0 is only 0020 0008 itself, and there are 5 associated words: "encourage", "continue", "lose", "support", " face to face". Check the word index again, and the occurrence positions of the first word of the corresponding word example are 73, 82, 39, 70, 77.
[0115] The right condition item "noun ∧ two characters" is a word attribute connected by "AND" operation, and the sum of the feature codes of "noun" and "two characters" is 0040 000...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com