Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Intelligent acquisition method and system for main character words of Chinese novel

An acquisition method and novel technology, applied in the field of intelligent acquisition method and system of main character words in Chinese novels, to achieve the best intelligent computing effect

Pending Publication Date: 2022-06-03
王峰 +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Related word frequency statistical calculation methods are often concentrated on social texts, but novels are a kind of fictional text, which are two different types of texts, and their statistical calculation methods are completely different. It is basically invalid to apply social text statistical calculation methods to novels of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intelligent acquisition method and system for main character words of Chinese novel

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] like figure 1 As shown, this embodiment proposes a method for intelligently acquiring main character words in Chinese novels, and the method includes:

[0030] S100 , taking words from the full text of the novel according to the preset number of words, and calculating the word frequency in turn, so as to obtain a plurality of first high-frequency words in the top ranks.

[0031] Based on the punctuation marks, numerical symbols, etc. in the article, the novel is segmented to form a short sentence list, and then the words are selected from the short sentence list;

[0032] Take 5 words, 4 words, 3 words and 2 words as the word lengths to get words, and get 5 word length, 4 word length, 3 word length and 2 word length of dictionary containing words and word frequency respectively , merge the obtained four dictionaries, and select high-frequency words from the merged dictionaries in order of word frequency. Select the top 10 keys of the value (word frequency) in the dict...

Embodiment 2

[0044] Corresponding to the above-mentioned Embodiment 1, this embodiment proposes an intelligent acquisition system for the main character words of Chinese novels, and the system includes:

[0045] The first high-frequency word acquisition module is used to extract words from the full text of the novel according to the preset number of words and to calculate the word frequency in turn, so as to obtain a plurality of first high-frequency words in the first few positions;

[0046] The second high-frequency word acquisition module is used to take the obtained multiple high-frequency words as the origin, take the words with a fixed length before and after the origin, and then select the words according to the preset number of words and arrange them in turn to calculate the word frequency, and select to obtain a plurality of second high-frequency words. high frequency words;

[0047] The stop dictionary filtering module is used for using the stop dictionary to filter and remove us...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses an intelligent acquisition method and system for main role words of a Chinese novel, main roles of the novel are found by adopting a method of filtering useless words through two-step word frequency statistics and a deactivation dictionary, the main role words of the novel can be effectively found, and the optimal intelligent calculation effect is achieved. And the novel role can be quickly found, so that the Chinese novel can be further intelligently processed.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of natural language processing, and in particular, to a method and system for intelligently acquiring main character words in Chinese novels. Background technique [0002] Currently, there is no method for processing character words in Chinese novels in the prior art. The related word frequency statistical calculation methods often focus on social texts, but novels are a kind of fictional texts, which are two different types of texts, and their statistical calculation methods are completely different. It is basically invalid to transfer the statistical calculation methods of social texts to novels. of. SUMMARY OF THE INVENTION [0003] To this end, embodiments of the present invention provide a method and system for intelligently acquiring main character words in Chinese novels, so as to solve the lack of processing methods for character words in Chinese novels in the prior art. [...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/216G06F40/242G06F40/30
CPCG06F40/216G06F40/242G06F40/30
Inventor 王峰王凯
Owner 王峰
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products