Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio and video processing method, device, equipmentand medium

A technology of audio and video processing and equipment, applied in the computer field, can solve problems such as increased labor costs, low accuracy, and limitations, and achieve the effects of reducing labor costs, improving accuracy and recall, and improving experience

Inactive Publication Date: 2019-01-22
GUANGZHOU BAIGUOYUAN INFORMATION TECH CO LTD
View PDF12 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the accuracy of short video tag classification is limited by the performance of the algorithm. If the algorithm performance is relatively poor, the accuracy of classifying short videos into different tags will be relatively low, which will consume a lot of manpower for review and increase labor costs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio and video processing method, device, equipmentand medium
  • Audio and video processing method, device, equipmentand medium
  • Audio and video processing method, device, equipmentand medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, but not to limit the present invention. In addition, it should be noted that, for the convenience of description, only parts related to the present invention are shown in the drawings but not all structures or components.

[0058] The existing technology uses three-dimensional convolution for label classification of video content. It is necessary to transform the two-dimensional convolutional neural network that processes a single image into a three-dimensional convolutional neural network that can process multiple images, so as to be directly used for convolution of image classification. Neural network, but the three-dimensional convolution leads to very large network parameters, making network training difficult, that is, there ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio and video processing method, a device, equipment and a medium, which relate to the computer technical field. The method comprisesseparating image frame information andaudio information from the video file; extracting image feature information and audio feature information from the image frame information and the audio information, respectively; fusing the image feature information and the audio feature information into video content feature information; determining a classification result corresponding to the video file according to the video content characteristic information. The invention combines the audio characteristic information in the video and the image characteristic information of the video frame for video classification, improves the accuracyand recall rate of the video classification, thereby reducing the labor cost of the video classification examination.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to an audio and video processing method, device, equipment and medium. Background technique [0002] With the rapid development of computer technology, deep learning technology has made great progress in many fields of image understanding. For example, deep learning technology is applied to tasks such as object classification, object detection, and object segmentation in images. So far, deep learning technology has been very mature in the field of image understanding, and has been gradually applied to video content understanding tasks. However, compared with image content understanding, video content understanding still has a long way to go. In video content understanding tasks, video classification is the most basic task, and the field of video classification has become a hotspot for many researchers. [0003] Specifically, video classification is mainly to classify vid...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04N21/234H04N21/239H04N21/439H04N21/44H04N21/466
CPCH04N21/23418H04N21/239H04N21/4394H04N21/44008H04N21/4665
Inventor 刘文奇刘运梁柱锦
Owner GUANGZHOU BAIGUOYUAN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products