Keyword extraction method and device based on information entropy, equipment and medium

An extraction method and technology of information entropy, applied in the fields of instruments, unstructured text data retrieval, electronic digital data processing, etc. The effect of accurate keyword extraction, avoiding misjudgment, and reducing redundancy

Active Publication Date: 2021-04-20
PING AN TECH (SHENZHEN) CO LTD
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, because the traditional keyword extraction technology only considers the number of documents in which a certain word appears, it is very easy to fail in a scene with a similar text language environment, and it is difficult to play the role of discovering differentiated keywords. Words only appear once in other categories of people, and because the word appears in every document, the word will be treated as a "common word", thereby reducing the weight of the word in the vocabulary of the category to which it belongs, resulting in recognition error

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Keyword extraction method and device based on information entropy, equipment and medium
  • Keyword extraction method and device based on information entropy, equipment and medium
  • Keyword extraction method and device based on information entropy, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060]In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0061] like figure 1 Shown is a flow chart of a preferred embodiment of the keyword extraction method based on information entropy of the present invention. According to different requirements, the order of the steps in the flowchart can be changed, and some steps can be omitted.

[0062] The keyword extraction method based on information entropy is applied to one or more electronic devices, and the electronic device is a device that can automatically perform numerical calculation and / or information processing according to preset or stored instructions. Hardware includes, but is not limited to, microprocessors, Application Specific Integrated Circuits (ASICs), Programmable Gate Arrays (Field-Programmable Gate Arrays, FPGAs), digital processors (Di...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of artificial intelligence, and provides an information entropy-based keyword extraction method, apparatus and device, and a medium, which can preprocess a received tag text, reduce the redundancy of characters while standardizing the text, improve the speed and effect of data processing, and adopt an inverse information entropy vector to improve the accuracy of keyword extraction. On one hand, the effect of the TFIDF can be copied under the condition that the data quality is high, on the other hand, noise-doped data can be effectively processed, keywords with the category distinguishing capacity are mined, the problem that the category distinguishing capacity of a traditional TFIDF fails is effectively solved, then misjudgment is avoided, and the method has high interpretability and is suitable for large-scale popularization and application. And extracting a target keyword according to the word weight matrix to realize automatic and accurate keyword extraction. In addition, the invention also relates to a blockchain technology, and the target keyword can be stored in the blockchain node.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a keyword extraction method, device, equipment and medium based on information entropy. Background technique [0002] In the field of artificial intelligence, keyword extraction plays an important role. For example, in the process of enterprise employee training, more and more links have gradually shifted from offline to online. Taking the professional training of life insurance agents as an example, if the training is to be practical and effective, so that agents have the opportunity to apply what they have learned to the actual life insurance sales, customer retention and other scenarios, the training and production departments must differentiate between the crowd and the training. In-depth research on chemistry. The traditional method mainly revolves around conducting regular communication with the business department to collect the appeals and points ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/216G06F40/289G06F16/33
Inventor 许丹
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products