3D video intelligent multi-domain joint predictive coding method and device

A technology for predictive coding and viewpoint synthesis prediction, applied in the field of 3D video coding, to achieve the effect of improving coding efficiency and saving bit rate

Active Publication Date: 2020-09-15
TIANJIN UNIV
View PDF11 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The invention provides a 3D video intelligent multi-domain joint predictive coding method and device. The invention comprehensively analyzes and mines the time domain, space domain and viewpoint domain correlation of 3D video, proposes to use CNN to fuse multi-domain reference information, and proposes a hierarchical A multi-domain prediction mechanism is used to solve the problem of multi-domain reference information fusion; in addition, in the hierarchical prediction mechanism, an effective multi-domain joint prediction network is constructed, and a multi-scale coding unit is designed in the network to extract features. Use CNN to solve the multi-domain joint prediction problem of 3D video, see the description below for details:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • 3D video intelligent multi-domain joint predictive coding method and device
  • 3D video intelligent multi-domain joint predictive coding method and device
  • 3D video intelligent multi-domain joint predictive coding method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] The embodiment of the present invention proposes a 3D video intelligent multi-domain joint predictive coding method, see figure 1 , the method builds a suitable multi-domain joint prediction network, takes multi-domain reference information as input, and outputs the multi-domain prediction result of the current coding block. The specific implementation steps are as follows:

[0041] 1. Obtain multi-domain reference information

[0042] 3D video sequences have rich multi-domain correlations within frames, between frames, and between viewpoints. For the current coded block, it has spatial correlation with the adjacent coded pixel area in the frame, has temporal correlation with the co-located block in the inter-frame coded reference frame, and has a viewpoint with the co-located block in the coded reference frame of the adjacent view domain dependencies. The representation of the multi-domain reference information in the present invention and how to obtain it are explai...

Embodiment 2

[0070] Combine below Figure 1-Figure 5 Carry out feasibility verification to the scheme in embodiment 1, see the following description for details:

[0071] figure 1 The technical flow chart of the present invention is given, which mainly includes obtaining multi-domain reference information, constructing a hierarchical multi-domain prediction mechanism, constructing a spatio-temporal prediction network, obtaining spatio-temporal domain prediction results, obtaining viewpoint synthesis prediction blocks, constructing a multi-domain joint prediction network, obtaining There are six parts: comparison of multi-domain prediction results and rate-distortion cost, and selection of the optimal mode.

[0072] figure 2 The hierarchical prediction framework proposed by the present invention is given. It can be seen from the figure that the method includes a spatio-temporal prediction network and a multi-domain joint prediction network. Together with the view synthesis prediction bl...

Embodiment 3

[0077] A 3D video intelligent multi-domain joint predictive coding device, the device includes: a memory, a processor, and a computer program stored on the memory and operable on the processor, and the method described in Embodiment 1 is implemented when the processor executes the program step.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a 3D video intelligent multi-domain joint prediction coding method and device. The method comprises the following steps: (1) obtaining multi-domain reference information: taking reconstructed pixel regions on the left side, above and above the left side of a current coding block in a step length range as spatial domain reference information, taking an inter-frame predictionblock of the time domain correlation of adjacent frames as time domain reference information, and taking a viewpoint synthesis prediction block obtained by a viewpoint synthesis prediction technologyas inter-viewpoint reference information; (2) constructing a time-space prediction network, and obtaining a time-space domain prediction result by taking time-space domain reference information as input; and (3) constructing a multi-domain joint prediction network according to the time-space domain prediction result and the viewpoint synthesis prediction block to obtain a final multi-domain prediction result. The device comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, and the processor implements the method steps when executingthe program.

Description

technical field [0001] The present invention relates to the field of 3D video coding, in particular to a 3D video intelligent multi-domain joint predictive coding method and device. Background technique [0002] With the development of 3D technology, 3D video coding has become a major research hotspot in the field of multimedia. Compared with 2D video, 3D video has more data volume, which brings great challenges to video storage and transmission. Therefore, how to realize efficient 3D video compression coding has important theoretical research significance and practical application value. [0003] As a new generation video coding standard, HEVC (High Efficiency Video Coding) effectively improves compression efficiency. As a 3D extension of HEVC, 3D-HEVC adopts a coding architecture based on the MVD (Multiview Videoplus Depth, multi-viewpoint plus depth) video format. Based on HEVC’s existing technology, it adds a new technology for multi-viewpoint video and depth video cod...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04N19/597H04N19/147H04N19/149H04N19/103H04N19/50G06N3/04G06N3/08
CPCH04N19/597H04N19/147H04N19/149H04N19/103H04N19/50G06N3/08G06N3/045
Inventor 雷建军石雅南侯春萍张宗千彭勃
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products