Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

National defense science and technology hot word discovery method and system based on big data

A big data, national defense technology, applied in the direction of electrical digital data processing, natural language data processing, instruments, etc., can solve problems that cannot be directly and effectively applied

Pending Publication Date: 2020-04-28
中国人民解放军军事科学院军事科学信息研究中心
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to overcome the problem that the traditional term extraction method cannot be directly and effectively applied to the recognition of national defense science and technology terms, and at the same time, facing the needs of quickly grasping the key hotspot knowledge in the field of national defense science and technology, comprehensively use improved term extraction, hot word sorting, entity classification and other technologies, A method for discovering hot words of national defense science and technology based on big data is proposed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • National defense science and technology hot word discovery method and system based on big data
  • National defense science and technology hot word discovery method and system based on big data
  • National defense science and technology hot word discovery method and system based on big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0050] Such as figure 1 As shown, the present invention proposes a method for discovering Chinese national defense science and technology hot words based on big data. The national defense science and technology dynamic news database used in the present invention is formed by tracking and accumulating sorted information sources related to national defense science and technology. The vocabulary of national defense technology is an important vocabulary closely related to national defense technology that has been accumulated for a long time. Learning its characteristics can help the machine to effectively identify national defense technology terms.

[0051] Step 1. Construct the training corpus through the accumulated defense science and technology vocabulary, and observe and summarize the pattern characteristics of the defense science and techn...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a national defense science and technology hot word discovery method and system based on big data, and the method comprises: inputting a news dynamic text in a specific time period into a pre-built Chinese national defense science and technology term extraction model based on CRF, and outputting a national defense science and technology hot word candidate set; performing popularity sorting on national defense science and technology terms in the national defense science and technology hot word candidate set through a Newton cold cutting method, and outputting a national defense science and technology hot word set; and inputting the national defense science and technology hot word set into a pre-established national defense science and technology hot word classification model, and outputting national defense science and technology hot word category information. Technologies such as term extraction, hot word sorting and entity classification are comprehensively applied; the hot word discovery method oriented to the national defense science and technology field is proposed for the first time, and the result shows that the method can effectively mine the nationaldefense science and technology hot words appearing in Chinese dynamic news, and helps researchers to track and grasp the latest hot spots and key knowledge clues in the national defense science and technology field in time.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to the technical field of information extraction, in particular to a method and system for discovering hot words of national defense science and technology based on big data. Background technique [0002] The traditional term extraction methods can be roughly divided into three categories: (1) Rule-based methods. Mainly based on some linguistic knowledge, the matching rules of terms are summarized, such as FASTR system, Terms system, etc. The advantage of this method is that it is simple to implement and has a high recognition accuracy, but the matching rules of terms need to be summarized manually, which is time-consuming It is laborious, and it is easy to have the problem of missing recognition caused by incomplete coverage of rules; (2) The method based on statistics. One is an unsupervised statistical method, which relies entirely on statistical quan...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/242G06F40/284G06F40/289
CPCY02D10/00
Inventor 田昌海罗威赵超阳谭玉珊罗准辰武帅毛彬叶宇铭宋宇
Owner 中国人民解放军军事科学院军事科学信息研究中心
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products