Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Abstract automatic generation method based on concept pointer network

An automatic generation and abstract technology, applied in neural learning methods, biological neural network models, instruments, etc., can solve the problems of insufficient abstraction of generated abstracts, and achieve the effect of strong adaptability and generalization ability.

Active Publication Date: 2019-11-12
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF4 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to solve the problem of insufficient abstraction in the automatic summarization task, and propose a method for automatically generating abstracts based on concept pointer networks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Abstract automatic generation method based on concept pointer network
  • Abstract automatic generation method based on concept pointer network
  • Abstract automatic generation method based on concept pointer network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0120] The present embodiment has described the concrete implementation process of the present invention, as figure 1 shown.

[0121] From figure 1 As can be seen, the inventive method flow process is as follows:

[0122] Step A, preprocessing; specific to the present embodiment is to carry out word segmentation to corpus, remove the processing of stop words;

[0123] Among them, the word segmentation operation uses the PTB word segmenter to perform word segmentation processing, and uses the nltk tool to perform the operation of removing stop words.

[0124] Step B. Initialize the concept word vector and the input text word vector, the size is 128 dimensions, and the word vector of a certain concept word is [8.9154e-05,6.2667e-05,6.4418e-05,...,7.1736e- 05,-2.4704e-05,1.2438e-04], the word vector of a word in the input text is [2.0672e-04,1.1223e-04,6.8911e-05,...,7.5825e-06,- 7.2777e-06,9.8726e-05]

[0125] Step C, use a multi-layer encoder to learn document content repr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an abstract automatic generation method based on a concept pointer network, and belongs to the technical field of natural language processing. The method comprises: on the basis of a pointer network, providing a concept pointer network, and finding out a plurality of concepts of input text words firstly; then, selecting the most appropriate concepts according to currentlyinput text semantic information, text word information and concept information, and giving appropriate output probabilities to the concepts; and finally, adding the concept pointer network into the encoding-decoding attention-increasing model, and optimizing the model by using reinforcement learning and remote supervision modes on the basis of the cross entropy training model in combination with apointer-generator mechanism, thereby finally generating an abstract. According to the method, the document content is expressed in a deeper level on the abstract level of the concept, and the model is trained by utilizing a remote supervision strategy, so that the abstract generation model has stronger adaptability and generalization ability, and a high-quality abstract generation mode is constructed.

Description

technical field [0001] The invention relates to a method for automatically generating abstracts based on a concept pointer network, belonging to the technical field of natural language processing. Background technique [0002] With the development and progress of society, the information on the Internet is increasing rapidly, and the rapidly increasing amount of information has brought people a variety of information, but at the same time it also makes people have to spend a lot of time understanding and looking for useful information. The explosion problem has become a very serious problem in today's society. If there is a method that can extract key information from long texts, it will help people understand a large amount of information in a short period of time, conveniently and quickly. The automatic summarization task is a task to extract key information from the text. The summarization can be done manually, but it will consume a lot of manpower and material resources...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/34G06F16/9038G06N3/04G06N3/08
CPCG06F16/345G06F16/9038G06N3/08G06N3/044G06N3/045
Inventor 高扬王文博周宇翔
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products