Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Uni-modal based fast half-pel and fast quarter-pel refinement for video encoding

a video encoder and quarter-pixel technology, applied in the field of video compression, can solve the problems of large increase in complexity, inability to implement design, and difficulty in encoding time, so as to reduce the impact of aliasing, improve the efficiency of estimation of motion vectors with quarter-pixel accuracy, and focus on quality or speed

Inactive Publication Date: 2011-06-09
SONY GRP CORP +1
View PDF27 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0015]A method of half-pixel interpolation and quarter-pixel interpolation are adapted for reducing the impact of aliasing within motion estimation. To estimate a motion vector with quarter-pixel accuracy more efficiently, the improved method is able to skip checking certain points using the uni-modal assumption. In an embodiment, a diamond based refinement is implemented. Within the diamond based refinement are half-pel refinement and quarter-pel refinements. Furthermore, within the half-pel refinement are on-the-fly interpolation and pre-computed interpolation. Within quarter-pel refinement, the method depends on whether four neighbor half-pel points are checked or just one or two half-pel points. Moreover, within each of the different embodiments is the ability to focus on quality or speed wherein different methods are implemented to maximize the desired function. In another embodiment, a square based refinement is implemented.

Problems solved by technology

Furthermore, it was desired to make these improvements without such a large increase in complexity that the design is impractical to implement.
However, there is a trade-off between the size to which a frame can be compressed versus the processing time and resources required to encode such a compressed frame.
The ratio of I, P and B-frames in the GOP structure is determined by the nature of the video stream and the bandwidth constraints on the output stream, although encoding time may also be an issue.
This is particularly true in live transmission and in real-time environments with limited computing resources, as a stream containing many B-frames can take much longer to encode than an I-frame-only file.
One of the most time-consuming components within the encoding process is motion estimation.
Motion estimation-related aliasing is not able to be avoided by using inter-pixel motion estimation, and the aliasing deteriorates the prediction efficiency.
As a consequence, the computation complexity of searching the half-pixel motion vector and quarter-pixel motion vector becomes dominant.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Uni-modal based fast half-pel and fast quarter-pel refinement for video encoding
  • Uni-modal based fast half-pel and fast quarter-pel refinement for video encoding
  • Uni-modal based fast half-pel and fast quarter-pel refinement for video encoding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033]FIG. 1 shows a block diagram of the video coding layer 100 of a macroblock. The video coding layer 100 (e.g. the encoder) includes a combination of temporal and spatial predictions along with transform coding. An input video 102 is received and split into a plurality of blocks. The first picture of a sequence is usually “intra” coded using only information contained within itself. Each part of a block in an intra frame is then predicted at the intra prediction module 110 using spatially neighboring samples of previously coded blocks. The encoding process chooses which neighboring samples are utilized for intra prediction and how they are used. This process is conducted at the local decoder 118 as well as at the encoder 100. For the rest of the pictures of a sequence, usually “inter” coding is used. Inter coding implements motion compensation 112 from other previously decoded pictures. The encoding process for inter prediction / motion estimation at the motion estimation module 1...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method of half-pixel interpolation and quarter-pixel interpolation are adapted for reducing the impact of aliasing within motion estimation. To estimate a motion vector with quarter-pixel accuracy more efficiently, the improved method is able to skip checking certain points using the uni-modal assumption. In an embodiment, a diamond based refinement is implemented. Within the diamond based refinement are half-pel refinement and quarter-pel refinements. Furthermore, within the half-pel refinement are methods for on-the-fly interpolation and pre-computed interpolation. Within quarter-pel refinement, the method depends on whether four neighbor half-pel points are checked or just one or two half-pel points. Moreover, within each of the different embodiments is the ability to focus on quality or speed wherein different methods are implemented to maximize the desired function. In another embodiment, a square based refinement is implemented.

Description

FIELD OF THE INVENTION[0001]The present invention relates to the field of video compression. More specifically, the present invention relates to improved motion estimation in digital video encoders.BACKGROUND OF THE INVENTION[0002]A video sequence consists of a number of pictures, usually called frames. Subsequent frames are very similar, thus containing a lot of redundancy from one frame to the next. Before being efficiently transmitted over a channel or stored in memory, video data is compressed to conserve both bandwidth and memory. The goal is to remove the redundancy to gain better compression ratios. A first video compression approach is to subtract a reference frame from a given frame to generate a relative difference. A compressed frame contains less information than the reference frame. The relative difference can be encoded at a lower bit-rate with the same quality. The decoder reconstructs the original frame by adding the relative difference to the reference frame.[0003]A...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(United States)
IPC IPC(8): H04N7/26
CPCH04N19/577H04N19/523
Inventor ZHANG, XIMIN
Owner SONY GRP CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products