A method and device for voice adaptive discontinuous transmission

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An adaptive and non-continuous technology, applied in speech analysis, instruments, etc., can solve problems such as high computational complexity and inability to flexibly track signal changes, and achieve low average bit rate and guaranteed sound quality

Active Publication Date: 2017-04-12

ZTE CORP

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0008] The technical problem to be solved by the present invention is to provide a method and device for voice adaptive discontinuous transmission, which overcomes the problems in the prior art that the fixed interval method cannot flexibly track signal changes, and the variable interval method must have linear prediction, etc. The calculation of multiple parameters leads to the disadvantage of high computational complexity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment approach 1

[0040] In Embodiment 1, the silence insertion description frame processing unit is further configured to determine that the absolute value of the spectrum energy of the speech signal frame and / or the absolute value of the spectrum energy of the last silence insertion description frame is greater than a single frame energy threshold and the When the difference between the spectrum energy of the speech signal frame and the spectrum energy of the previous silence insertion description frame is greater than a preset limit 1, the silence insertion description frame is sent.

[0041] The silence insertion description frame processing unit is also used to judge that the absolute value of the spectrum energy of the speech signal frame and / or the absolute value of the spectrum energy of the last silence insertion description frame is greater than the single frame energy threshold and the speech signal frame The difference between the spectrum energy and the spectrum energy of the last s...

Embodiment approach 2

[0044] In Embodiment 2, when the silence insertion description frame processing unit is used to judge that the absolute value of the spectrum energy of the speech signal frame and / or the absolute value of the spectrum energy of the last silence insertion description frame is greater than the single frame energy threshold, according to Calculate the spectral correlation value of the current speech signal frame and the spectrum energy of the last silence insertion description frame, and send the silence insertion description frame when it is judged that the spectrum correlation value is smaller than the spectrum correlation threshold.

Embodiment approach 3

[0045] In the third embodiment, the silence insertion description frame processing unit is used to simultaneously determine whether to send the silence insertion description frame based on the difference between the spectrum energy and the spectrum correlation value of the two.

[0046] Such as figure 2 As shown, the device may also include a smoothing filter unit; the smoothing filter unit is used to smooth and filter the frequency domain signal of the voice signal, and then input it to the silence insertion description frame processing unit, and the silence insertion description frame processing unit performs smoothing processing After the frequency domain signal is subjected to the above processing, the silence insertion description frame storage unit also needs to store the smoothed frequency domain signal.

[0047] The method for performing speech adaptive discontinuous transmission includes: in performing speech adaptive discontinuous transmission, deciding whether to s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A method and an apparatus for performing voice adaptive discontinuous transmission. The method comprises: when voice adaptive discontinuous transmission is performed, determining whether to send a silence insertion descriptor frame according to frequency spectrum information of a current voice signal frame and frequency spectrum information of a previous silence insertion descriptor frame. This scheme may overcome disadvantages that in related technologies, with a fixed interval manner, signal changes cannot be flexibly tracked, and with a variable interval manner, necessary multi-parameter calculation such as linear prediction results in high calculation complexity. This scheme is directly performed in a frequency domain, and can well track signal changes, thereby ensuring the tone quality at the same time of maintaining a low average code rate.

Description

technical field [0001] The present invention relates to the field of digital signal processing, in particular to a method and device for voice adaptive discontinuous transmission (Discontinuous Transmission, DTX for short). Background technique [0002] In the actual user communication process, under normal circumstances, less time is used to transmit user voice, and more time is used to transmit non-voice background sound. If the entire communication process is encoded according to the encoding method of the voice signal, a great waste of resources will be caused. In order to reduce this waste in the prior art, the sending end uses the Voice Activity Detector (VAD) algorithm for signal detection. The important information of the signal is encoded, that is, the signal is encoded into a Silence Insertion Description (Silence Insertion Descriptor, SID for short) frame, and the SID frame is sent in a discontinuous manner. The decoding end performs decoding in a manner of comf...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L19/012G10L19/02

CPCG10L19/012

Inventor 顾彩霞袁浩江东平黎家力

Owner ZTE CORP

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A method and device for voice adaptive discontinuous transmission

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment approach 1

Embodiment approach 2

Embodiment approach 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology