Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Underdetermined blind source separation (UBSS) method based on maximum matrix diagonal rate

An underdetermined blind separation and diagonal rate technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as poor separation quality, inconvenient processing of 0 norm, and noise sensitivity

Inactive Publication Date: 2012-07-11
DALIAN UNIV OF TECH
View PDF4 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The advantage of this method is that the traditional overdetermined separation technology can be used. The disadvantage is that the separation effect depends on the quality of the newly constructed observation signal. Once the signal is not well constructed, the separation quality will deteriorate.
The two-step method is actually an extension of the basis tracking method. It obtains the optimal solution by solving the linear equation and constraining the solution. Using the sparsity of the signal, it minimizes the 0 norm and then constrains the solution. The 0 norm is very easy to deal with. Inconvenient and especially sensitive to noise
In 1999, D.L.Donoho demonstrated the equivalence of using the smallest norm of 1 and the smallest norm of 0 under certain conditions. The norm of 1 is easier to deal with than the norm of 0. The optimal solution can be easily obtained by using linear programming, and the anti-noise performance Although it is better than the 0-norm criterion, the effect is still unsatisfactory. In addition, the algorithm is based on the sparsity of the signal in the time domain, so the general separation effect is very poor.
The time-frequency masking method was first proposed by Sam T.Roweis in 2000. In 2004, Yilmaz and Rickard combined the DUET algorithm to further develop the time-frequency masking algorithm. However, this type of method strictly requires that the source signal approximately satisfies W-dislocation orthogonality, harsh conditions
In 2005, Abrard proposed the TIFROM algorithm. This method requires that only one source signal exists in the observation signals of several adjacent time-frequency windows. The length of the time-frequency window is difficult to determine, and it is impossible to search for all single-source time-frequency frequency domain
In 2011, Zhou Guoxu and others proposed a new method - nonlinear projection and column masking (NPCM) to estimate the mixing matrix when the signal is not strictly sparse; Lu Fengbo et al. based on the matrix diagonal realizes underdetermined blind source separation under weak sparse conditions, but this method can only deal with uncorrelated signals

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Underdetermined blind source separation (UBSS) method based on maximum matrix diagonal rate
  • Underdetermined blind source separation (UBSS) method based on maximum matrix diagonal rate
  • Underdetermined blind source separation (UBSS) method based on maximum matrix diagonal rate

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0121] Select 4 voice signals of different speakers from the voice library as the source signal, take 50000 points respectively, and perform linear instantaneous mixing in the time domain. The mixing matrix is ​​as follows:

[0122] A = 0.7875 0.6368 0.6117 0.4277 0.4161 0.4792 0.7146 0.7530 0.4545 0.6039 0.3391 0.5000

[0123] The absolute values ​​of the determinants of the respective sub-matrices of the mixing matrix A are 0.0744, 0.0697, 0.0838, and 0.0815, respectiv...

Embodiment 2

[0129] One of the advantages of the present invention is that it lowers the requirement on the statistical characteristics of the source signal, and enables underdetermined separation of related source signals. Select one voice signal from the voice library, and then take different time periods of the voice to form four related source signals. The time-domain waveforms of the original four-way correlation signals are attached Figure 4 As shown in (a), the time-domain waveforms of the three-way observation signals after linear instantaneous mixing are shown in the attached Figure 4 Shown in (b), the time-domain waveforms of the four-way signals separated by the present invention are as attached Figure 4 (c) shown. The output signal-to-noise ratio of the separated signals is shown in Table 2.

[0130] Table 2 Four-channel correlation speech output SNR

[0131]

Embodiment 3

[0133] The invention can better solve the underdetermined separation of weak and sparse signals. The advantages of the present invention will be described below by taking white Gaussian noise with poor sparsity as an example. Select one path of Gaussian white noise and three paths of different speech signals from the speech library. The original time-domain waveforms of one channel of Gaussian white noise and three channels of speech signals are attached Figure 5 As shown in (a), the time-domain waveforms of the three-way observation signals after linear instantaneous mixing are shown in the attached Figure 5 Shown in (b), the four-way signal time-domain wave form that the present invention separates is as attached Figure 5 (c) shown. The output signal-to-noise ratio of the separated signals is shown in Table 3.

[0134] Table 3 One-way noise, three-way voice output signal-to-noise ratio SNR

[0135]

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an underdetermined blind source separation (UBSS) method based on a maximum matrix diagonal rate. The method comprises the following steps of: constructing inverse matrixes of C*M / N M*M-dimensional sub matrixes of a mixed matrix (wherein M and N are respectively the number of sensors and the number of source signals); multiplying the inverse matrixes by observation signal vectors to acquire initial estimation signal vectors; and sequentially calculating the covariance matrix, the solid matrix, the absolute value matrix and the diagonal rate of each initial estimation signal vector, selecting the initial estimation signal vector corresponding to the maximum diagonal rate as estimation of a source signal vector, and thus realizing underdetermined separation of sourcesignals. By the method, the requirement for source signal sparseness is reduced, aliasing of road source signals is allowed at each time frequency point at most, and the underdetermined separation problem of music signals and noise signals is solved. The requirement for the statistical property of the source signals is low, and the underdetermined separation problem of Gaussian signals and related signals is solved. In addition, by the method, processing of each time frequency point and each sub matrix can be executed in parallel, and hardware implementation is facilitated.

Description

technical field [0001] The invention relates to a method for blindly separating instantaneous mixed signals under the condition of underdetermination. The method can separate sparse, weakly sparse or correlated signals, and can be applied in the fields of signal processing, biomedicine and communication. Background technique [0002] Blind Source Separation (BSS) is a technology that determines a transformation based on the observed mixed data vector to restore the original signal or source. Typically, the observed data vector is the output of a set of sensors, where each sensor receives a different combination of source signals. The term "blind" has two meanings: a. the source signal cannot be observed; b. how the source signal mixes is unknown. When the number of source signals is more than the number of observation signals, it is an underdetermined blind separation (UBSS) problem, which is closer to practical applications, and it is also a technical difficulty of linear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L15/20G10L19/02G10L21/0224
Inventor 马晓红魏亮生
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products