Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Video transcoding method and device based on geometric generation model, medium and equipment

A generative model and video transcoding technology, applied in the field of video transcoding, can solve the problems of GAN mode collapse and convergence, learning multiple modes, generator difficulty, etc., to achieve favorable preservation and transmission, good zoom effect, and rich images realistic effect

Active Publication Date: 2021-09-03
SOUTH CHINA UNIV OF TECH
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the super-resolution method based on the generative confrontation network (GAN) can generate more realistic and clear texture and other details, so it is widely used in the field of image generation. More commonly used are super-resolution, image translation and human pose generation, etc., but In practical applications, confrontation learning is required between the discriminator D and the generator G. If the data distribution has multiple clusters or is distributed in multiple isolated manifolds, it is difficult for the generator to learn multiple patterns well, and it is easy to Phenomena that cause GAN mode collapse and convergence difficulties

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video transcoding method and device based on geometric generation model, medium and equipment
  • Video transcoding method and device based on geometric generation model, medium and equipment
  • Video transcoding method and device based on geometric generation model, medium and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0070] This embodiment discloses a video transcoding method based on a geometric generative model. In this method, the common MPEG-4 standard-definition video is reconstructed through video super-resolution, and then encoded into an H.265 standard ultra-high-definition video, so as to facilitate super-resolution Transmission and storage of high-definition video. Such as figure 1 As shown, the steps of the method include:

[0071] S1. Build a geometric generation model; in this embodiment, such as figure 2 As shown in , the constructed geometric generative model is an improved Encoder-Decoder (encoding-decoding) structure, including an encoding part and a decoding part; where:

[0072] The encoding part includes two down-sampling layers connected in sequence, namely the first down-sampling layer and the second down-sampling layer;

[0073] The decoding part includes the first upsampling layer, the second upsampling layer and the convolution layer; the upsampling processing ...

Embodiment 2

[0124] This embodiment discloses a video transcoding device based on a geometric generation model. The device includes a model building module, a model training module, an acquisition module, a video decoding module, a video reconstruction module, and a video encoding module. The functions of each module are as follows:

[0125] A model building module is used to build a geometric generation model; in this embodiment, the constructed geometric generation model is as follows figure 2 Shown in, the specific structure description sees embodiment 1.

[0126] The model training module is used to train the constructed geometric generation model to obtain a super-resolution reconstruction model.

[0127] Get module for getting MPEG-4 video.

[0128] The video decoding module is used to decode the MPEG-4 video, and save it as continuous still pictures after decoding.

[0129] The video reconstruction module is used for performing super-resolution zoom-in reconstruction on each fram...

Embodiment 3

[0133] This embodiment discloses a storage medium, which stores a program. When the program is executed by a processor, the video transcoding method based on the geometric generation model described in Embodiment 1 is implemented as follows:

[0134] Construct a geometric generation model, and obtain a super-resolution reconstruction model after training;

[0135] get MPEG-4 video;

[0136] Decode MPEG-4 video and save it as continuous still pictures after decoding;

[0137] For each frame of picture obtained after MPEG-4 video decoding, the super-resolution reconstruction model is used to perform super-resolution zoom-in reconstruction of each frame of picture;

[0138] For super-resolution enlarged and reconstructed images, encode them into H.265 format video.

[0139] In this embodiment, reference may be made to the above-mentioned Embodiment 1 for specific implementation of the above-mentioned processes, and details are not repeated here.

[0140] In this embodiment, th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a video transcoding method and device based on a geometric generation model, a medium and equipment, and the method comprises the steps: constructing the geometric generation model, and obtaining a super-resolution reconstruction model after training; obtaining an MPEG-4 video; decoding the MPEG-4 video, and storing the decoded MPEG-4 video in a continuous static picture form; for each frame of picture obtained after the MPEG-4 video is decoded, performing super-resolution amplification reconstruction on each frame of picture through a super-resolution reconstruction model; and encoding the image after super-resolution amplification and reconstruction into a video in an H.265 format. According to the method, the problem of mode collapse in the generative adversarial network is solved from the perspective of geometry through the geometric generative model, adversarial learning is avoided, and the generated image is richer and more vivid.

Description

technical field [0001] The present invention relates to a video transcoding method, in particular to a video transcoding method, device, medium and equipment based on a geometric generation model. Background technique [0002] With the widespread popularity of the Internet, the Internet has increasingly become the main way for us to obtain information in our daily life. With the rapid rise of industries such as online video and live broadcast, people have put forward higher requirements for the quality of video, and the demand for high-definition or even ultra-high-definition video getting bigger. In recent years, domestic 4K and 8K video devices have been sought after by the public, but there are not enough ultra-high-definition video resources on the Internet, so these ultra-high-definition video devices can only play ordinary high-definition videos most of the time, and cannot take advantage of their hardware. The lack of ultra-high-definition video resources is due to t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T3/40G06T7/13G06T7/136G06T9/20
CPCG06T3/4053G06T7/13G06T7/136G06T9/20G06T2207/10016
Inventor 刘文顺孙季丰赵帅
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products