Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for objective assessment of speech quality based on auditory perception characteristics

A voice quality, objective evaluation technology, applied in voice analysis, instruments, etc., can solve problems such as high computational complexity, disadvantageous real-time evaluation of voice quality, etc., to achieve the effect of improving correlation, facilitating performance analysis and avoiding energy leakage

Active Publication Date: 2018-03-06
HUNAN INST OF METROLOGY & TEST +1
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] P.862 Perceptual Evaluation of Speech Quality Analysis released by ITU-T in 2001 is an objective evaluation method of voice quality with high performance at present, which can better identify communication delay, environmental noise and errors, but it is based on The perceptual model of the Bark spectrum has high computational complexity, which is not conducive to real-time evaluation of voice quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for objective assessment of speech quality based on auditory perception characteristics
  • A method for objective assessment of speech quality based on auditory perception characteristics
  • A method for objective assessment of speech quality based on auditory perception characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] 1. Gammatone filter

[0045] The Gammatone filter is a standard cochlear auditory filter, and the time-domain impulse response of the filter is:

[0046] g(t)=B n t n-1 e -2πBt cos(2πf 0 t+φ)u(t) (1)

[0047] Among them: u(t)=0 when t0; parameter B=b 1 ERB(f 0 ), ERB (f 0 ) is the equivalent rectangular bandwidth of the Gammatone filter (equivalent rectangular bandwidth: for the same white noise input, the width of the rectangular filter with the same energy as the specified filter, referred to as ERB), which is the same as the Gammatone filter center frequency f 0 The relation is ERB(f 0 )=24.7+0.108f 0 , parameter b 1 = 1.019 is a parameter introduced to make the function more consistent with physiological data; n is the order of the filter, and research shows that the Gammatone filter with n=4 can well simulate the filtering characteristics of the basilar membrane; the parameter φ is the initial phase of the filter.

[0048] The frequency response charac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method for objective evaluation of speech quality based on auditory perception characteristics, characterized in that: the method is filtered by adding a Gammatone filter bank to a Bark spectrum module in spectrum mapping, and the concrete steps are: 1) by POLQA processing reference signals and degradation signal, then the reference signal and the degraded signal enter the core model; 2) the spectrum mapping in the core model is that the Barker spectrum module adds the Gammatone filter bank for filtering, and then performs auditory transformation to make the extracted auditory spectrum closer to people. 3) After the auditory transformation, the interference analysis is performed to analyze the distortion of the degraded signal relative to the reference signal, and the objective evaluation MOS score is obtained. Compared with other methods, the present invention effectively improves the correlation between the objective evaluation result and the subjective evaluation result.

Description

technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a method for objectively evaluating speech quality based on auditory perception characteristics. Background technique [0002] Voice quality evaluation can be divided into two categories from the evaluation subject: subjective evaluation and objective evaluation. [0003] Subjective evaluation uses people as the main body to evaluate the quality of speech. Although this method is relatively complicated, since people are the final recipients of speech, this evaluation is a true reflection of speech quality. The Mean Opinion Score (MOS) proposed by the ITU organization in 1996 is a widely used subjective evaluation method, which uses the average opinion score of testers to intuitively reflect people's perception of voice quality. The advantage of subjective evaluation is that it conforms to people's feelings about voice quality, but the disadvantages are that it i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/60
Inventor 李庆先刘良江卞昕柏文琦周鑫彭正梁徐昱
Owner HUNAN INST OF METROLOGY & TEST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products