Nonnegative matrix decomposition method for speech signal characteristic waveform

A technology of non-negative matrix decomposition and characteristic waveform, which is applied in the field of speech signal processing and can solve problems such as low complexity and high precision

Inactive Publication Date: 2006-11-15
BEIJING UNIV OF TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In order to solve the above problems, the present invention provides a non-negative matrix decomposition method of the characteristic waveform of the speech signal. The problem to be solved by the method is to achieve high precision and low complexity when decomposing the characteristic waveform of the speech signal in the waveform interpolation speech encoder degree and without additional delay for the purpose of eigenwave decomposition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Nonnegative matrix decomposition method for speech signal characteristic waveform
  • Nonnegative matrix decomposition method for speech signal characteristic waveform
  • Nonnegative matrix decomposition method for speech signal characteristic waveform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0085] In the specific implementation manner of the present invention, how to use the standard non-negative matrix decomposition method to train the base matrix and how to obtain the coding matrix are respectively discussed below.

[0086] A. Training base matrix.

[0087] a. At first, the voice feature waveform is divided into 9 categories according to the size of the pitch period (pitch) of the frame voice signal, and the classification basis is as shown in Table 1:

[0088] Class 1

20≤pitch<30

Class 2

30≤pitch<40

Class 3

40≤pitch<50

Class 4

50≤pitch<60

Class 5

60≤pitch<70

Class 6

70≤pitch<80

Class 7

80≤pitch<90

Class 8

90≤pitch<100

Class 9

100≤pitch≤120

[0089] Table 1 Classification of characteristic waveforms

[0090] B, then all select about 10000 frames of experimental samples for each type of characteristic waveform, form matrix V, according to the iterative ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to a nonnegative matrix decomposition method of speech signal characteristic waveform, belonging to speech signal processing technology. Said method includes the following several steps: firstly, utilizing fundamental tone pitch of speech signal to divide the speech characteristic waveform into 9 classes, for every class of characteristic waveform utilizing iteration method of standard nonnegative matrix decomposition to train out base matrix W, then for given a frame characteristic waveform utilizing its fundamental tone pitch to make subsumption, then taking out the trained base matrix W correspondent to said class of characteristic waveform, and utilizing iteration method to obtain code matrix H correspondent to said frame characteristic waveform, so that said frame characteristic waveform can be approximately decomposed into the product of base matrix W and code matrix H.

Description

technical field [0001] The invention relates to a method for non-negative matrix decomposition of characteristic waveforms of speech signals, which belongs to the field of speech signal processing. Background technique [0002] With the rapid development of wireless mobile communication, secure telephone communication and network VoIP communication system, people's demand for high-quality speech coding technology below 4kb / s is increasing day by day. At present, the international speech coding models for rates below 4kb / s mainly include binary excitation linear prediction model (LPC-10-Linear Prediction Coding-10), mixed excitation linear prediction model (MELP-Mixed Excitation Linear Prediction), multi-band excitation model (MBE-Muliti-BandExcitation) and waveform interpolation model (WI-Waveform Interpolation). These models are all based on the source-system model of the speech signal, that is, the excitation source signal (simulating the airflow from the lungs) is used t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/00
Inventor 鲍长春张鹏
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products