Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for automatically obtaining news headline, computer equipment and storage medium

An automatic acquisition and title technology, applied in the field of data processing, can solve the problems of increasing labor costs, inability to accurately distinguish the headlines of news, etc., and achieve the effect of saving labor costs.

Active Publication Date: 2020-11-13
CHENGDU SOBEY DIGITAL TECH CO LTD
View PDF10 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, in practical engineering applications, just getting the OCR recognition results in the news scene cannot accurately distinguish the headlines of the news.
The main reason is that there are scrolling subtitles and news titles in the news, which cannot be achieved by using location information and text information alone. The title can be extracted well. If the template and location information are roughly used to determine the title , then changing a piece of news requires changing the template and position threshold. This method increases the labor cost and is still not advisable.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for automatically obtaining news headline, computer equipment and storage medium
  • Method and system for automatically obtaining news headline, computer equipment and storage medium
  • Method and system for automatically obtaining news headline, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to have a clearer understanding of the technical features, purposes and effects of the present invention, specific implementations of the present invention are now described. It should be understood that the specific embodiments described here are only used to explain the present invention, and are not intended to limit the present invention, that is, the described embodiments are only some of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0048] The relevant terms involved in the present invention are described as follows:

[0049] OCR: Optical Character Recognition, optical character recognition;

[0050] PSENET: Progressive Scale Expansion Network, progressive scale expansion network;

[0051] CRNN: Convolutional Recurrent Neural Ne...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and system for automatically obtaining news headlines, computer equipment and a storage medium, and the method comprises the steps: obtaining the coordinate information of each single-row textbox of a single-frame picture in a news video and the text information in the textbox through OCR, and determining a to-be-selected headline through employing a textbox clustering and character similarity comparison method; using BERT, LSTM and CRF jointly to extract entities of text information, screening out non-headline information through entity recognition results, and determining finally news headlines according to in-out point information of a single piece of news. The method has a good effect on extracting different types of news headlines; meanwhile, manual auxiliary operations such as manual marking and template making are not needed, so that the labor cost can be greatly saved, and the method has profound significance in news headline extraction work.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method, system, computer equipment and storage medium for automatically obtaining news headlines. Background technique [0002] In recent years, TV news programs have developed and expanded rapidly, and with the popularity of TV, TV news has gradually replaced paper news as the first way for people to obtain news. Among them, news headlines, as a high-level summary of news and topic essence, can be used as content identification and indexing of video clips, which is of great significance for understanding the content of news. However, manual identification of news headlines is time-consuming and laborious. Therefore, automatic positioning, extraction and identification of news headlines are the corresponding It provides a practical and effective way to perform advanced semantic annotation, video database establishment and intelligent retrieval on video streams. It has ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32G06K9/62G06N3/04G06F40/258G06F40/295G06F16/903
CPCG06F40/258G06F40/295G06F16/90344G06V20/635G06V30/10G06N3/044G06N3/045G06F18/22G06F18/23
Inventor 温序铭牟骏杰谢超平
Owner CHENGDU SOBEY DIGITAL TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products