Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Device and method for expanding speech bandwidth based on audio watermarking

A technology of audio watermarking and bandwidth expansion, applied in speech analysis, instruments, etc., can solve problems such as equalization and inability to achieve

Inactive Publication Date: 2013-08-14
DALIAN UNIV OF TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Disadvantages of the prior art: the above methods cannot achieve a good balance in the three aspects of robustness, concealment and the number of embedded watermarks, and each has its own shortcomings, so it cannot be better used for voice bandwidth expansion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Device and method for expanding speech bandwidth based on audio watermarking
  • Device and method for expanding speech bandwidth based on audio watermarking
  • Device and method for expanding speech bandwidth based on audio watermarking

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0130] The present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0131] figure 1 A complete functional block diagram of the present invention is given. At the beginning, the human voice is a wideband signal. Before transmission through the telephone line, the high frequency parameters are embedded into the narrowband code stream, and the narrowband voice signal is transmitted through the telephone line; A-law decoding is performed at the receiving end, and then the high frequency parameters are used The extraction module extracts high-frequency parameters, uses the high-frequency parameter synthesis module to restore the high-frequency part in the broadband speech, and finally synthesizes the high-frequency speech and the low-frequency speech into broadband speech.

[0132] Each module involved in the principle block diagram of the present invention is introduced as follows:

[0133] 1. QMF analysis filter bank ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention disclosed a device and a method for expanding speech bandwidth based on an audio watermarking. The device and the method are as follows: in the starting part, the speech sent by a person is a bandwidth signal; before the speech is transmitted by a telephone line, high-frequency parameters are embedded to a narrow band code stream; the narrow band speech signal is transmitted by the telephone line; A-law decoding is performed on a receiving end, and then the high-frequency parameters are extracted; the high-frequency part in the bandwidth speech is recovered by the high-frequencyparameters; finally, the high-frequency speech and low-frequency speech are synthesized to be a bandwidth speech. The device and the method use characteristics of the audio watermarking to build a hidden channel in the narrow band speech and uses the channel to transmit the parameters of the high-frequency speech to achieve band extension of the speech signal without changing the original networkprotocol.

Description

technical field [0001] The invention relates to speech processing technology, in particular to a device and method for extending speech bandwidth based on audio watermark. Background technique [0002] The main energy of the human speech signal is concentrated in 0.3-3.4KHz, and the bandwidth of 4KHz can guarantee sufficient intelligibility. Therefore, the sampling frequency of the public telephone network (PSTN) coding standard G.711 (that is, A law and μ law) formulated by the International Telecommunication Union (ITU) is 8KHz, and it has been used until now. [0003] Narrowband speech reduces the demand for communication bandwidth while ensuring a certain intelligibility, but this is at the expense of the naturalness of speech. Narrowband speech loses the high-frequency components of the original speech, so it doesn't sound natural. In order to improve voice quality, ITU-T proposed the first wideband voice codec G.722 for teleconferencing. Broadband voice communicatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/018G10L21/038G10L21/04
Inventor 陈喆殷福亮赵承勇
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products