Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

SCSS (Single Channel Speech Separation) algorithm based on DNN (Deep Neural Network)

A deep neural network and speech separation technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of speech separation performance impact, time-consuming, poor separation effect, etc., to achieve improved intelligibility, high separation efficiency, and reduced Effect of Distortion Rate

Active Publication Date: 2019-12-31
NANJING UNIV OF POSTS & TELECOMM
View PDF8 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When using a single-output DNN to separate mixed voices, only one voice can be separated at a time. When using this method to separate multiple voices, it takes a long time; the traditional method based on a multi-output deep neural network can separate multiple voices at the same time. However, for this reason, the output mapped by the multi-output DNN is not as targeted as the single-output DNN, and the separation effect is worse than that of the single-output DNN.
Both of the above two deep neural networks need to be trained by a loss function. The basic loss function used by the traditional dual-output DNN is only used to map the relationship between input and output, but ignores the joint relationship between outputs, and this joint relation has a large impact on the final speech separation performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • SCSS (Single Channel Speech Separation) algorithm based on DNN (Deep Neural Network)
  • SCSS (Single Channel Speech Separation) algorithm based on DNN (Deep Neural Network)
  • SCSS (Single Channel Speech Separation) algorithm based on DNN (Deep Neural Network)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0046] Such as figure 1 As shown, the present invention provides a kind of single channel speech separation algorithm based on deep neural network, mainly comprises the following steps:

[0047] Step 1: Preprocess the training speech samples and extract their feature information;

[0048] Step 2: Use the loss function to train the deep neural network to obtain the deep neural network model;

[0049] Step 3: Preprocess the speech sample to be tested, extract its feature information, and conduct speech separation through the trained deep neural network model, and then obtain the separation result through speech reconstruction.

[0050] Step 1-Step 3 will be described in detail below.

[0051] Among them, step 1 specifically includes:

[0052] St...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an SCSS (Single Channel Speech Separation) algorithm based on a DNN (Deep Neural Network). The SCSS algorithm based on the DNN mainly comprises the following steps of preprocessing a training speech sample, and extracting feature information of the training speech sample; training the DNN by using a loss function so as to obtain a DNN model; preprocessing a speech sample tobe tested; extracting feature information of the speech sample to be tested; performing speech separation through the trained DNN model; and then obtaining a separation result through speech reconstruction. The DNN is trained by using the nonlinear relationship between input and output. Compared with a conventional separation method based on single-output DNN, the SCSS algorithm has the advantagesthat the combined relationship between the output is sufficiently mined; the separation efficiency is high; two source speech signals can be separated in one step; the voice distortion rate is effectively reduced; and meanwhile, the understandability of the separation speech is improved.

Description

technical field [0001] The invention relates to a single-channel speech separation algorithm based on a deep neural network, belonging to the field of speech separation. Background technique [0002] Single channel speech separation (Single channel speech separation, SCSS) is the process of recovering multiple speech from one-dimensional mixed speech. Single-channel speech separation technology is widely used in speech enhancement, preprocessing of speech recognition, hearing aids or smart home and other fields. In these fields, the sensor usually receives a mixed voice from a microphone, and the human ear can easily obtain useful information from this mixed voice, but it is difficult for a computer to accurately obtain the desired voice. Therefore, it has very important practical significance to obtain the target speech accurately and efficiently. [0003] Deep neural network (DNN) has powerful data mining capabilities. In the field of speech separation, it is mainly used...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0272G10L15/02G10L15/06G10L15/16
CPCG10L21/0272G10L15/02G10L15/063G10L15/16
Inventor 孙林慧朱阁傅升邹博
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products