Video caption recognition method and device, equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A recognition method and subtitle technology, applied in the field of computer vision, can solve problems such as poor accuracy, complex video background images, and low detection efficiency, and achieve the effect of improving accuracy

Active Publication Date: 2020-08-25

TENCENT TECH (SHENZHEN) CO LTD

View PDF11 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In the related art, when the OCR recognition model is used for subtitle text recognition, due to the complexity of the video background image, some character recognition errors may occur, and the accuracy of the subtitle recognition result is low

In related technologies, CTPN or EAST algorithms based on deep learning are used for text region detection. In relatively simple scenarios, the detection effect is better, but it takes a long time and the detection efficiency is low.

The OCR recognition model in the related art has limitations when identifying specific application scenarios. For example, when recognizing video subtitles, the background of the video subtitles is complex, and the accuracy of subtitle recognition using the OCR recognition model in the related art is poor. ; Another example is that there is no OCR recognition method for small-language video subtitles, and the subtitles of small-language videos cannot be recognized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0060] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The drawings are merely schematic illustrations of the present disclosure and are not necessarily drawn to scale. The same reference numerals in the drawings denote the same or similar parts, and thus repeated descriptions thereof will be omitted.

[0061] Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of embodiments of the present disclosure. However, those skilled in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a video caption recognition method and device, equipment and a storage medium, and relates to the technical field of computer vision. The method comprises the following steps: acquiring multiple frames of images from a to-be-identified video containing subtitles; identifying subtitles in the multiple frames of images to obtain an initial subtitle identification result of each frame of image; obtaining an editing distance between initial subtitle identification results of two adjacent frames of images in the multiple frames of images; obtaining a plurality of frames of continuous similar images based on the editing distance between the initial subtitle recognition results of the two adjacent frames of images; obtaining semantic credibility of an initial subtitle recognition result of the multi-frame continuous similar images; and determining a final caption identification result of the multi-frame continuous similar images according to the semantic credibility. The invention improves the accuracy of the identification result of the video subtitles to a certain extent.

Description

technical field [0001] The present disclosure relates to the technical field of computer vision, and in particular, to a video subtitle recognition method, device, equipment and readable storage medium. Background technique [0002] With the development of computer technology and the Internet, the language types of videos available to users are also becoming more and more abundant. When users process videos in various languages, they can extract and identify subtitles from videos through video subtitle extraction technology for various purposes, such as video classification. [0003] Optical character recognition (Optical Character Recognition, OCR) technology is usually used to identify video subtitles. OCR solutions generally include two steps: 1) text area detection: find the area containing text; 2) text recognition: identify the text in the area. In the related art, when the OCR recognition model is used for subtitle text recognition, due to the complexity of the vide...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/00G06K9/32G06K9/62G06N3/04

CPCG06N3/049G06V20/41G06V20/635G06V30/10G06N3/047G06N3/045G06F18/22G06F18/2415

Inventor 彭俊石吴飞彭艺

Owner TENCENT TECH (SHENZHEN) CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Video caption recognition method and device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology