Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-person speech separation method and system for chatting robot

A chat robot and multi-person voice technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as large amount of calculation

Inactive Publication Date: 2018-11-30
BEIJING UNION UNIVERSITY
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The present invention mainly solves the problem of the initial value sensitivity of the FastICA algorithm and the problem of a large amount of calculation when separating multiple people's mixed voices, introduces the negative gradient descent method, overcomes the initial value sensitivity, and enhances the convergence stability of the algorithm; proposes an improved difference quotient method to replace FastICA's optimization algorithm - Newton's method, avoids the problem of large amount of calculation caused by derivation and Jacobian matrix calculation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-person speech separation method and system for chatting robot
  • Multi-person speech separation method and system for chatting robot
  • Multi-person speech separation method and system for chatting robot

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0071] like figure 1 , 2 As shown, step 100 is executed, and the voice collection module 200 uses a microphone to collect voice signals to obtain a mixed signal. Step 110 is executed, the speech collection module 200 performs preprocessing on the mixed signal x. Execute step 111, perform centralized processing on the mixed signal x,

[0072]

[0073] Among them, i=1.....n, n is a real coefficient. Execute step 112, carry out PCA whitening process,

[0074] z=Vx=ED -1 / 2 E. T x

[0075] Among them, V is the whitening matrix, E is the orthogonal matrix composed of the eigenvectors of the centralized data, D is the diagonal matrix composed of the eigenvalues ​​corresponding to the eigenvectors, and E T Transpose the matrix for E.

[0076] Step 120 is executed, and the unmixing matrix generating module 210 randomly generates an unmixing matrix w.

[0077] Execute step 130, using the negative gradient descent method to find the negative gradient direction of the unmixing...

Embodiment 2

[0089] The purpose of this application is to solve the initial value sensitivity problem of the FastICA algorithm and the problem of a large amount of calculation when separating multiple people's mixed voices, and proposes a FastICA voice separation method based on the negative gradient descent method and the improved difference quotient method: (1) introduce The negative gradient descent method overcomes the initial value sensitivity and enhances the convergence stability of the algorithm; (2) An improved difference quotient method is proposed to replace FastICA's optimization algorithm - Newton's method, to avoid the large amount of calculation caused by derivation and Jacobian matrix calculation question.

[0090] Independent component analysis (ICA) is a representative method for dealing with multi-person mixed speech similar to the "cocktail party problem". The ICA method assumes that each source signal is statistically independent. According to the statistical character...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-person speech separation method and system for a chatting robot. The multi-person speech separation method comprises the step that a microphone is used for collecting speech signals to obtain a mixed signal and further comprises the following steps that the mixed signal x is preprocessed; a demixing matrix w is generated randomly; the negative gradient direction of the demixing matrix w is solved through a negative gradient descent method; whether the demixing matrix w is converged or not is judged; the optimal demixing matrix is solved through an improved difference quotient method; an estimation signal of a source signal is solved; and separated speech is output. According to the multi-person speech separation method and system for the chatting robot, the problem of initial value sensitivity of a FastICA algorithm and the problem that the calculation amount is large when multi-person mixed speech is separated are solved, the negative gradient descent method is introduced, thus the initial value sensitivity is overcome, and algorithm convergence stability is improved; and the improved difference quotient method is put forward to replace the FastICA optimizing algorithm, namely a Newton method, and the problem of large calculation amount caused by derivation and Jacobian matrix calculation is avoided.

Description

technical field [0001] The invention relates to the technical field of digital signal processing and computer hearing, in particular to a multi-person voice separation method and system for a chat robot. Background technique [0002] With the rapid development of the Internet and artificial intelligence technology, our way of life and work have changed a lot. The best proof is the wide application of intelligent voice technology in human-computer interaction. In real life, the speech signal of interest is often disturbed by various problems such as ambient background noise from other sources, speech from other speakers, and reverberation from surface reflections. These problems will greatly reduce the intelligibility of speech, and lead to a decline in the performance of subsequent speech recognition and voiceprint recognition. In view of the influence of background noise and other people interfering with speech, it is necessary to perform multi-person mixed speech separati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/0272G10L15/22
CPCG10L15/22G10L21/0272
Inventor 刘宏哲张启坤
Owner BEIJING UNION UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products