Robust monolingual sound source DOA estimation method based on acoustic vector sensor and bispectrum transformation

A vector sensor and bispectral transformation technology, applied in the field of robust monolingual sound source DOA estimation, can solve the problems of high computational complexity, large microphone array aperture, and degraded DOA estimation performance, and achieve the effect of low algorithm complexity

Active Publication Date: 2016-06-15
PEKING UNIV SHENZHEN GRADUATE SCHOOL
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the actual complex acoustic environment, when the microphone collects voice signals, it will inevitably be interfered by factors such as ambient noise, transmission medium noise, internal electrical noise of communication equipment, and room reverberation. Therefore, the quality of the acquired voice decreases, resulting in DOA Estimate performance degradation
The DOA technology based on the traditional microphone array has the limitations of large microphone array aperture, spatial aliasing, and high computational complexity, which limits the application of this type of technology on small mobile devices (robots).

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Robust monolingual sound source DOA estimation method based on acoustic vector sensor and bispectrum transformation
  • Robust monolingual sound source DOA estimation method based on acoustic vector sensor and bispectrum transformation
  • Robust monolingual sound source DOA estimation method based on acoustic vector sensor and bispectrum transformation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0025] Assuming that there is only one speaker's sound source in the space, the present invention adopts 8kHz sampling rate to obtain the bispectral data to the AVS four-channel output signal ((1)-(4)), which is expressed as follows:

[0026]

[0027]

[0028]

[0029]

[0030] Define the BISDR value as follows:

[0031]

[0032]

[0033]

[0034] where I uo (Ω 1 ,Ω 2 ), I vo (Ω 1 ,Ω 2 ) and I wo (Ω 1 ,Ω 2 ) are the BISDR values ​​of u channel and o channel, v channel and o channel, w channel and o channel respectively. with I uo (Ω 1 ,Ω 2) as an example, the relationship between BISDR and DOA of the speaker's sound source is derived as follows. Substitute (5) and (8) into (9) to get

[0035] I uo (Ω 1 ,Ω 2 ) = u s +ε u (Ω 1 ,Ω 2 )(12)

[0036] in

[0037]

[0038] In the same way:

[00...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a robust monolingual sound source DOA estimation method based on an acoustic vector sensor and bispectrum transformation. The method adopts an acoustic vector sensor (AVS), realizes four-channel voice signal acquisition, performs bispectrum transformation on collected data, and obtains corresponding bispectrum data. A trigonometric function relation between AVS receiving components on bispectrum is utilized to calculate a data ratio between AVS sensor components. A bispectrum mask spectrum is calculated, frequency points with high signal to interference ratios are extracted, a KDE method is utilized to cluster the points, and on this basis, the direction of arrival (DOA) of a voice sound source is obtained by calculation. The method provided by the invention can estimate the DOA of the monolingual sound source with high precision under the conditions of different room reverberation, interference noise types and interference noise intensity. In addition, the AVS sensor used by the method which is only 1cm3 in size is very suitable for voice technology application on a portable device.

Description

technical field [0001] The invention relates to a robust monolingual sound source DOA estimation method based on an acoustic vector sensor and bispectral transformation, and belongs to the technical field of human-computer voice interaction. Background technique [0002] Direction of Arrival (DOA) estimation of spatial speakers is a key technology of human-computer interaction for intelligent robots and has great application value. It has always been a research hotspot in the field of array signal processing. In the actual complex acoustic environment, when the microphone collects voice signals, it will inevitably be interfered by factors such as ambient noise, transmission medium noise, internal electrical noise of communication equipment, and room reverberation. Therefore, the quality of the acquired voice decreases, resulting in DOA Estimated performance degradation. The DOA technology based on the traditional microphone array has the limitations of large microphone arra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G01S3/78
CPCG01S3/78
Inventor 邹月娴金彦含
Owner PEKING UNIV SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products