Intelligent acquisition method and system for main character words of Chinese novel
An acquisition method and novel technology, applied in the field of intelligent acquisition method and system of main character words in Chinese novels, to achieve the best intelligent computing effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0029] like figure 1 As shown, this embodiment proposes a method for intelligently acquiring main character words in Chinese novels, and the method includes:
[0030] S100 , taking words from the full text of the novel according to the preset number of words, and calculating the word frequency in turn, so as to obtain a plurality of first high-frequency words in the top ranks.
[0031] Based on the punctuation marks, numerical symbols, etc. in the article, the novel is segmented to form a short sentence list, and then the words are selected from the short sentence list;
[0032] Take 5 words, 4 words, 3 words and 2 words as the word lengths to get words, and get 5 word length, 4 word length, 3 word length and 2 word length of dictionary containing words and word frequency respectively , merge the obtained four dictionaries, and select high-frequency words from the merged dictionaries in order of word frequency. Select the top 10 keys of the value (word frequency) in the dict...
Embodiment 2
[0044] Corresponding to the above-mentioned Embodiment 1, this embodiment proposes an intelligent acquisition system for the main character words of Chinese novels, and the system includes:
[0045] The first high-frequency word acquisition module is used to extract words from the full text of the novel according to the preset number of words and to calculate the word frequency in turn, so as to obtain a plurality of first high-frequency words in the first few positions;
[0046] The second high-frequency word acquisition module is used to take the obtained multiple high-frequency words as the origin, take the words with a fixed length before and after the origin, and then select the words according to the preset number of words and arrange them in turn to calculate the word frequency, and select to obtain a plurality of second high-frequency words. high frequency words;
[0047] The stop dictionary filtering module is used for using the stop dictionary to filter and remove us...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com