Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Document title generation method and device, equipment and storage medium

A document and title technology, applied in the field of artificial intelligence, can solve the problems of consuming a lot of manpower and time, not considering the influence of other information, and the title accuracy is not high enough, so as to achieve the effect of improving accuracy and solving the problem of insufficient accuracy

Pending Publication Date: 2022-04-26
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Visually Rich Document (Visually Rich Document) has become a very common and important file form in people's daily work and life because it contains a large amount of text, layout, and format information. However, processing such documents, such as adding corresponding titles to documents, requires It consumes a lot of manpower and time cost
[0003] The existing document title generation method usually compares the document to be processed with the existing document with the title, and generates the corresponding title for the document to be processed according to the comparison. This method does not take into account the text content in the document Influenced by other information of , so the accuracy of generating titles is not high enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document title generation method and device, equipment and storage medium
  • Document title generation method and device, equipment and storage medium
  • Document title generation method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0112] In detail, the specific implementation of each module of the document title generation device 100 is as follows:

[0113] Step 1: Obtain original document information, which includes original text information, original image information, and original location information.

[0114] In the embodiment of the present invention, the original document information refers to a visually rich text document (Visually Rich Document), wherein the visually rich text document refers to a semantic structure that is not only determined by the text content, but also related to visually rich documents such as typesetting, table structure, and font. The textual data associated with the element.

[0115] Specifically, the original document information includes original text information, original image information and original location information, the original text information refers to the text content in the original document information, and the original image information refers to the o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an artificial intelligence technology, and discloses a document title generation method, which comprises the following steps of: performing block division on original document information to obtain a plurality of pieces of text sub-information, a plurality of pieces of image sub-information and a plurality of pieces of position sub-information; inputting the plurality of pieces of text sub-information into a text coding model for text coding to obtain a text feature vector; performing weighted addition on the text feature vector, the image features in the multiple pieces of image sub-information and the multi-dimensional position vector after position coding to obtain a final input vector, and inputting the final input vector into a transformer encoder model for fusion coding to obtain a final output feature; and performing feature decoding on the final output feature by using a decoder module to obtain a document block containing a title. In addition, the invention also relates to a block chain technology, and image features can be stored in nodes of a block chain. The invention further provides a document title generation device, electronic equipment and a storage medium. According to the invention, the accuracy of document title generation can be improved.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a document title generation method, device, electronic equipment and computer-readable storage medium. Background technique [0002] With the acceleration of the digitalization process, the structural analysis and content extraction of documents, images and other carriers have become a key part of the success or failure of the digital transformation of enterprises. Automatic, accurate and fast information processing is crucial to the improvement of productivity. Visually Rich Document (Visually Rich Document) has become a very common and important file form in people's daily work and life because it contains a large amount of text, layout, and format information. However, processing such documents, such as adding corresponding titles to documents, requires It consumes a lot of manpower and time costs. [0003] The existing document title generation method...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06V30/40G06K9/62G06N3/04G06N3/08G06V30/148G06V10/74G06V10/774
CPCG06N3/08G06N3/045G06F18/22G06F18/214
Inventor 唐小初张祎頔舒畅陈又新
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products