Chart English abstract generation method based on fusion spatial position attention mechanism

A technology of spatial location and attention, applied in machine learning, special data processing applications, instruments, etc., can solve problems such as illogical English summary generation, difficulty in obtaining data, and poor performance

Pending Publication Date: 2022-02-08
GUILIN UNIV OF ELECTRONIC TECH
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the traditional English abstract generation method uses the corpus to focus on the data table "what is said" and "in what way". They use the method of statistical learning to search for the abstract describing the data table. Once the style of the data table changes, the The method will cause its description to be distorted, so this type of method lacks versatility; the time-series English summary generation method can analyze the key information expressed in the data table through the time in the data table and the causal relationship between the data, although the time-series English summary generation method method can also generate a descriptive summary of the data, but there are the following problems:
[0004] (1) This method is different from other NLG tasks. For example, sentences and words in machine translation appear in pairs, and training data is easier to generate. However, data visualization tasks require structured data, and there are problems in the process of obtaining data. certain difficulties;
[0005] (2) This method equivalently marks the instance data (name, value, etc.) in the abstract, which may lead to incorrect abstracts predicted by the model;
[0006] (3) In the process of generating data description summaries, each group of source sentences and target sentences are equivalent to each other, without considering the possible spatial position embedding relationship between each group of sentences, and the word vector search algorithm is not used in the generation process, May generate illogical English summaries
Although statistically-learned models can also generate data descriptions, they use pre-defined templates to generate summaries, resulting in a lack of versatility in these methods and fewer choices in terms of grammatical style and vocabulary
In the model based on the encoder-decoder framework, the role of the encoder is to identify the data of the input table, and the role of the decoder is to use the long short-term memory network to create a description based on the table data. However, this method does not perform well in content selection, and lacks coherence between sentences
The model for generating text summarization based on structured data first encodes the summarization text into a record table, and then combines the content selection and planning mechanism into the neural network model for description. However, this method not only does not encode the potential space between words The positional relationship cannot cover the positional relationship between various sentences, and does not classify and mark different types of data, resulting in illogical situations in the generated summary
[0008] Traditional natural language algorithms RNN and LSTM can only extract features sequentially from left to right or from right to left, which may lead to two problems: (1) The feature calculated at time t depends on the value of the feature at time t-1, which is extremely large. This limits the parallel capability of the model; (2) In the long-term dependence process between eigenvalues, there may be a loss of characteristic information
[0009] Once the style of the data table of the traditional English summary generation method changes, it will lead to distortion of the English summary description and lack of versatility
The generation method of English summary based on time series also has the problems of difficulty in obtaining structured data, incorrect model prediction, and illogical English summary generation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chart English abstract generation method based on fusion spatial position attention mechanism
  • Chart English abstract generation method based on fusion spatial position attention mechanism
  • Chart English abstract generation method based on fusion spatial position attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0049] refer to figure 1 , a graph English summary generation method based on the fusion of spatial location attention mechanism, including the following steps:

[0050]1) Create a graph English summary description dataset: select data from multiple websites as the data source for model training, create a graph English summary description dataset, which consists of bar graphs and line graphs, and use the crawler framework to capture 8,300 records Data, 8300 pieces of data respectively include advertising industry, agriculture, chemical industry, construction industry, consumer industry, e-commerce industry, economic industry, energy and environment industry, finance and insurance industry, health and medical industry, Internet industry, life industry, media industry Data statistics tables, table titles and For the English summary of the chart description, the English summary generation task of TransChartText is used to generate a descriptive English summary through the given ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a chart English abstract generation method based on a fusion spatial position attention mechanism. The method comprises the following steps: 1) creating a chart English abstract description data set; 2) replacing a chart data value with a data variable; 3) performing word vector position coding based on a spatial relationship; and (4) searching a word vector result by adopting Diverse Beam Search. The method comprises replacing a chart data value with a data variable based on a fusion spatial position attention mechanism; learning a relationship between words by adopting a spatial attention mechanism; enhancing the spatial position relationship between the word vectors and correct word position sorting; and using a Diverse Beam Search to search for a better word vector result, so that the quality of generating the chart English abstract can be improved.

Description

technical field [0001] The invention relates to a computer natural language generation technology, in particular to a method for generating English summaries of graphs based on a fusion spatial position attention mechanism. Background technique [0002] Data visualization presents high-dimensional and complex data in intuitive forms such as bar charts and line charts. However, according to relevant research, there are certain difficulties in the analysis and utilization of charts in practice and are not fully utilized. Research on the graph corpus shows that 35% of the descriptions cannot express the key information conveyed by the text through the traditional visual way of observing graphs, and 26% of the descriptions can only express a small part of the expected information of the graph. However, using English abstracts to analyze and describe the content of the chart can achieve the purpose of reducing the difficulty of chart analysis and making the chart more intuitive a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/34G06F16/33G06F16/31G06N20/00
CPCG06F16/345G06F16/3346G06F16/313G06N20/00Y02D10/00
Inventor 王鑫许文全覃琴冯意颜靖柯王琴
Owner GUILIN UNIV OF ELECTRONIC TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products