Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Background noise excitation signal generating method and apparatus

A technology of excitation signal and background noise, applied in the field of communication, to achieve the effect of comfortable and natural transition to the human ear

Active Publication Date: 2009-01-07
HUAWEI TECH CO LTD
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0035] Embodiments of the present invention provide a method and device for generating background noise excitation signals to solve the problem of a more natural, smooth and continuous transition when the signal frame is converted from speech to background noise

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Background noise excitation signal generating method and apparatus
  • Background noise excitation signal generating method and apparatus
  • Background noise excitation signal generating method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0068] Embodiment 1 is the implementation process of the present invention applied in 729B CNG. It should be noted that, in 729B, the maximum value of the pitch delay T is 143, and the specific process is as follows:

[0069] (1) The speech codec receives each speech coding frame, and saves the coding parameters of the speech coding frame, and the coding parameters include the excitation signal and the pitch delay Pitch of the last subframe. The excitation signal can be stored in the excitation signal memory old_exc(i) in real time, where i∈[0, 142], since the frame length of 729B is 80, the excitation signal of the last two frames is cached in the excitation signal memory old_exc(i) Of course, the excitation signal memory old_exc(i) may also cache the latest frame, multiple frames or less than one frame according to the actual situation.

[0070] (2), when the signal frame is converted from the speech coding frame to the background noise coding frame, the transition length N ...

Embodiment 2

[0082] Embodiment 2 is the implementation process of the embodiment of the present invention applied in Adaptive Multirate Codec (AMR, AdaptiveMultirate Codec) CNG. It should be noted that, in AMR, the maximum value of the pitch delay T is 143. The specific implementation process for:

[0083] (1) The speech codec receives each speech coded frame, and saves the coding parameters of the speech coded frame, including the excitation signal and the pitch delay Pitch of the last subframe. The excitation signal is stored in the excitation signal memory old_exc(i) in real time, where i∈[0, 142], since the frame length of AMR is 160, only the excitation signal of the latest frame is cached in the excitation signal memory old_exc(i) Of course, the excitation signal memory old_exc(i) may also cache the latest frame, multiple frames or less than one frame according to the actual situation.

[0084] (2), when converting from speech coding frame to background noise coding frame, the trans...

Embodiment 3

[0096] The third embodiment is the implementation process of the present invention applied in G.729.1CNG.

[0097] G.729.1 is a speech coder recently released by the International Telecommunication Union (ITU, International Telecommunication Union). It is a broadband speech coder, that is, the bandwidth of the speech signal to be processed is 50-7000 Hz. During specific processing, the input signal is processed by It is divided into high-frequency band (4000-7000Hz) and low-frequency band (50-4000Hz) for processing respectively. The low-frequency band adopts the CELP model, which is the basic model of speech processing, and 729, AMR and other encoders use the this model. The basic signal processing frame length of G.729.1 is 20 ms, which is called a super frame. Each super frame has 320 signal sampling points. After frequency band division, each frequency band signal sampling point in the super frame is 160 points. At the same time, G.729.1 also defines the CNG system for pro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a generating method and device for a background noise excitation signal, and the method comprises: excitation signals bound are generated by using the coding parameters in the phase of voice coding and decoding and the transitional length of the excitation signals; the excitation signals bound and the random excitation signals of the background noise coding frame are carried out weighted sum, and the excitation signals of the background noise in the transitional phase are obtained. Meanwhile, the device comprises: an excitation signals bound generation unit and a unit for obtaining excitation signals in the transitional phase. After the scheme of the synthesis of the comfortable background noise of the invention is adopted, when the synthesis signals are transferred from the voice to the background noise, the transition is more natural, smooth and continuous, and the feelings of ears of the hearers are more comfortable.

Description

technical field [0001] The present invention relates to the communication field, in particular to a method and device for generating background noise excitation signals. Background technique [0002] In voice communication, the voice processing is mainly done by the voice codec. Because the voice signal has short-term stability, the voice codec generally processes the voice signal by frame, and each frame is 10-30ms. The initial speech codecs are all fixed-rate, that is, each speech codec has only a fixed coding rate, such as the coding rate of the speech codec G.729 is 8kbit / s, and the rate of G.728 is 16kbit / s. These traditional fixed-rate speech codecs generally speaking, the encoding algorithm of higher rate can guarantee the encoding quality more easily, but occupies larger communication channel resources; The encoding algorithm of lower rate occupies less communication channel resources, but It is not easy to guarantee the encoding quality. [0003] Speech signals ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/00G10L19/08G10L19/14H03M7/30G10L19/012
CPCG10L19/012G10L19/08
Inventor 艾雅·舒默特代金良汪林张立斌
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products