Psychoacoustics model processing method based on advanced audio decoder
A psychoacoustic model and audio encoder technology, applied in instruments, speech analysis, etc., can solve problems affecting the efficiency of bit rate distortion control, encoders improve encoding quality, and weaken the influence of pre-echo, etc.
Active Publication Date: 2008-11-19
ZTE CORP
View PDF0 Cites 9 Cited by
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
It can be seen that the sub-band bit consumption prediction number is only obtained by multiplying the normalized perceptual entropy and the number of bits available in the current frame, and the accuracy is not high, which in turn affects the efficiency of rate-distortion control
Moreover, because the traditional psychoacoustic model only considers the simultaneous masking effect of the human ear and ignores the heterochronous masking effect, the encoder cannot use heterochronous masking to improve the encoding quality. Once the pre-masking fails, the quantization noise cannot be masked and pre-echo occurs. The sound quality will be greatly degraded when
Although the AAC standard provides Temporal Noise Shaping (TNS) to weaken the influence of pre-echo, actual tests show that using this module will worsen the sound quality
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreImage
Smart Image Click on the blue labels to locate them in the text.
Smart ImageViewing Examples
Examples
Experimental program
Comparison scheme
Effect test
Embodiment Construction
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More PUM
Login to View More
Abstract
The invention discloses a psychoacoustic model processing method based on an advanced audio encoder. The psychoacoustic model processing method includes the following steps: A, the perceptual entropy threshold value and the masking limen of a coding sub-band are obtained by the spectrum energy of the psychoacoustic of the sub-band of the bit stream to be encoded through masked diffusion matrix algorithm; B, anticipated bit consumption of the sub-band is calculated by employing time-frequency correction and anticipated echo correction through the perceptual entropy threshold value and the masking limen of the coding sub-band; C, the psychoacoustic model outputs the anticipated bit consumption of the sub-band, which then serves as a parameter for code rate distortion so as to carry out the encoding process. The psychoacoustic model processing method can obtain the bit consumption of the sub-band through the perceptual entropy more accurately and the anticipated value is taken by the encoder as the parameter for code rate distortion control, thus greatly improving quantizing encoding efficiency and quality of the encoder.
Description
technical field The invention relates to an advanced audio coder, in particular to a processing method based on a psychoacoustic model of the advanced audio coder. Background technique Advanced Audio Coding (AAC) belongs to a kind of transform domain lossy perceptual audio coding. Lossy perceptual audio coding can achieve high compression ratio, but its coding error (quantization noise) is inevitably high. In order to reduce the impact of quantization noise, lossy perceptual audio coding controls the distribution of coding errors by studying the psychoacoustic effects of the human ear, so that the noise generated by quantization errors is difficult to detect. This process is implemented through psychoacoustic models in lossy perceptual coding. The psychoacoustic model controls the distribution of quantization errors by exploiting the auditory masking phenomenon of the human ear. The masking phenomenon is a common psychoacoustic phenomenon, which is determined by the huma...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More Application Information
Patent Timeline
Login to View More
IPC IPC(8): G10L19/02G10L19/00G10L19/002
Inventor 吴晟邱小军黎家力陈强
Owner ZTE CORP
Who we serve
- R&D Engineer
- R&D Manager
- IP Professional
Why Patsnap Eureka
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com