Keyword extraction method and device, encoder and decoder

A keyword and coding technology, applied in the field of information processing, can solve problems such as insufficient overall effect, and achieve the effect of reducing repeated keywords, improving accuracy, and increasing the total number of

Pending Publication Date: 2022-07-01
ALIBABA GRP HLDG LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This approach relies too much on human-written rules, and the overall effect is not good enough
[0004] Another way is to extract keywords based on sequence-to-sequence model, however, this way faces two problems: 1) How to generate a document representation good enough to reflect the most important key words in the original document for keyword extraction Semantic information; 2) How to model the relationship between keywords in the keyword set, that is, how to better learn the conditional probability P(y n |y ), where y n is the keyword that needs to be generated currently, y is the generated keyword sequence

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Keyword extraction method and device, encoder and decoder
  • Keyword extraction method and device, encoder and decoder
  • Keyword extraction method and device, encoder and decoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0093] In order to make the objectives, technical solutions and advantages of the present application clearer, the embodiments of the present application will be described in detail below with reference to the accompanying drawings. It should be noted that, the embodiments in the present application and the features in the embodiments may be arbitrarily combined with each other if there is no conflict.

[0094] In a typical configuration of the present application, a computing device includes one or more processors (CPUs), input / output interfaces, network interfaces, and memory.

[0095] Memory may include non-persistent memory in computer readable media, random access memory (RAM) and / or non-volatile memory in the form of, for example, read only memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.

[0096] Computer-readable media includes both persistent and non-permanent, removable and non-removable media, and storage of information ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a keyword extraction method and device, an encoder and a decoder, and the method comprises the steps: transmitting a keyword generation process, namely decoding information, to an encoding processing part in a manner of dynamically modifying a syntactic graph structure of an original document through the generated keyword information; therefore, the two problems of modeling the keyword relationship and obtaining better semantic representation are solved at the same time, and the document coding effect is improved. Furthermore, more accurate and diversified keyword sequences are obtained by matching with a diversity inference process. In other words, the keyword generation accuracy is improved, the generated repeated keywords are reduced, and the total number of the generated keywords is increased.

Description

technical field [0001] The present application relates to, but is not limited to, information processing technology, especially a keyword extraction method and apparatus, encoder and decoder. Background technique [0002] Keyword extraction is to extract the core content to be expressed in the text by some means given a long text, so as to accurately and quickly extract key information from a large amount of information. These keywords can be entities with specific meanings, or some basic concepts or events. The extracted keywords can be represented by a keyword sequence, and the keyword sequence can be listed in order according to the degree of confidence. The higher the degree of confidence, the higher the ranking. The extracted keyword sequences can be applied to the subject tags of articles in different fields such as travel notes, notes, news, etc., as well as document retrieval and recommendation systems. [0003] In the related art, one way is to extract keywords by...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/258G06F40/211G06F16/35G06F40/126G06F40/216G06F40/284
CPCG06F40/258G06F40/211G06F40/284G06F40/126G06F40/216G06F16/355
Inventor 张浩宇龙定坤徐光伟王潇斌谢朋峻黄非
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products