There is disclosed a
system and method for video coding, and more particularly to video coding that uses
structural similarity (SSIM) based rate-
distortion optimization methods to improve the perceptual quality of decoded video without increasing
data rate, or to reduce the
data rate of compressed video
stream without sacrificing perceived quality of the decoded video. In an embodiment, the video coding
system and method may be a SSIM-based rate-
distortion optimization approach that involves minimizing a joint cost function defined as the sum of a
data rate term and a
distortion functions. The
distortion function may be defined to be monotonically increasing with the decrease of SSIM and a Lagrange parameter may be utilized to control the trade-off between rate and distortion. The optimal Lagrange parameter may be found by utilizing the ratio between a reduced-reference SSIM model with respect to quantization step, and a data rate model with respect to quantization step. In an embodiment, a group-of-picture (GOP) level quantization parameter (QP) adjustment method may be used in multi-pass encoding to reduce the bit-rate while keeping similar perceptual
video quality. In another embodiment, a frame level QP adjustment method may be used in single-pass encoding to achieve constant SSIM quality. In accordance with an embodiment, the present invention may be implemented entirely at the
encoder side and may or may not require any change at the decoder, and may be made compatible with existing video coding standards.