Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Video caption recognition method and device, equipment and storage medium

A recognition method and subtitle technology, applied in the field of computer vision, can solve problems such as poor accuracy, complex video background images, and low detection efficiency, and achieve the effect of improving accuracy

Active Publication Date: 2020-08-25
TENCENT TECH (SHENZHEN) CO LTD
View PDF11 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the related art, when the OCR recognition model is used for subtitle text recognition, due to the complexity of the video background image, some character recognition errors may occur, and the accuracy of the subtitle recognition result is low
In related technologies, CTPN or EAST algorithms based on deep learning are used for text region detection. In relatively simple scenarios, the detection effect is better, but it takes a long time and the detection efficiency is low.
The OCR recognition model in the related art has limitations when identifying specific application scenarios. For example, when recognizing video subtitles, the background of the video subtitles is complex, and the accuracy of subtitle recognition using the OCR recognition model in the related art is poor. ; Another example is that there is no OCR recognition method for small-language video subtitles, and the subtitles of small-language videos cannot be recognized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video caption recognition method and device, equipment and storage medium
  • Video caption recognition method and device, equipment and storage medium
  • Video caption recognition method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The drawings are merely schematic illustrations of the present disclosure and are not necessarily drawn to scale. The same reference numerals in the drawings denote the same or similar parts, and thus repeated descriptions thereof will be omitted.

[0061] Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of embodiments of the present disclosure. However, those skilled in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a video caption recognition method and device, equipment and a storage medium, and relates to the technical field of computer vision. The method comprises the following steps: acquiring multiple frames of images from a to-be-identified video containing subtitles; identifying subtitles in the multiple frames of images to obtain an initial subtitle identification result of each frame of image; obtaining an editing distance between initial subtitle identification results of two adjacent frames of images in the multiple frames of images; obtaining a plurality of frames of continuous similar images based on the editing distance between the initial subtitle recognition results of the two adjacent frames of images; obtaining semantic credibility of an initial subtitle recognition result of the multi-frame continuous similar images; and determining a final caption identification result of the multi-frame continuous similar images according to the semantic credibility. The invention improves the accuracy of the identification result of the video subtitles to a certain extent.

Description

technical field [0001] The present disclosure relates to the technical field of computer vision, and in particular, to a video subtitle recognition method, device, equipment and readable storage medium. Background technique [0002] With the development of computer technology and the Internet, the language types of videos available to users are also becoming more and more abundant. When users process videos in various languages, they can extract and identify subtitles from videos through video subtitle extraction technology for various purposes, such as video classification. [0003] Optical character recognition (Optical Character Recognition, OCR) technology is usually used to identify video subtitles. OCR solutions generally include two steps: 1) text area detection: find the area containing text; 2) text recognition: identify the text in the area. In the related art, when the OCR recognition model is used for subtitle text recognition, due to the complexity of the vide...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/32G06K9/62G06N3/04
CPCG06N3/049G06V20/41G06V20/635G06V30/10G06N3/047G06N3/045G06F18/22G06F18/2415
Inventor 彭俊石吴飞彭艺
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products