Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice extraction method, device and equipment

A technology of speech extraction and mixed speech, applied in speech analysis, speech recognition, instruments, etc., to reduce structural details and parameters, speed up training time, and improve user experience

Pending Publication Date: 2021-12-03
TSINGHUA UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the embodiment of this description is to provide a speech extraction method, device and equipment to solve the problem of how to reduce the scale of the speech extraction model to achieve rapid and effective speech extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice extraction method, device and equipment
  • Voice extraction method, device and equipment
  • Voice extraction method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] The following will clearly and completely describe the technical solutions in the embodiments of the present specification in combination with the drawings in the embodiments of the present specification. Obviously, the described embodiments are only some of the embodiments of the present specification, not all of them. Based on the embodiments in this specification, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of this specification.

[0016] In order to solve the above technical problems, a speech extraction method according to the embodiment of this specification is introduced. The subject of execution of the voice extraction method is a voice extraction device, and the voice extraction device includes but not limited to a server, an industrial computer, a PC, and the like. Such as figure 1 As shown, the speech extraction method may include the following specific implementation ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a voice extraction method, device and equipment. The method comprises the following steps: acquiring mixed voice sample data, wherein the mixed voice sample data comprises at least one of a noise signal, an interference voice signal and a reverberation signal and a target voice signal; training a preset voice separation model by using the mixed voice sample data to obtain a pre-trained voice separation model; constructing a strategy network and an evaluation network based on the pre-trained voice separation model, wherein the strategy network and the evaluation network correspond to network parameters; determining a target quantification strategy based on the network parameters; updating the pre-trained voice separation model by using a target quantification strategy to obtain a voice extraction model; and extracting a target object voice signal from to-be-processed voice data by using the voice extraction model. According to the method, the scale of the voice extraction model is reduced, so that the voice of the target object in the single-channel voice is quickly and effectively separated.

Description

technical field [0001] The embodiments of this specification relate to the technical field of speech signal processing, and in particular to a speech extraction method, device, and equipment. Background technique [0002] With the development of technologies such as computers and artificial intelligence, automatic speech recognition based on smart devices has also been widely used. In practical applications, while the smart device collects the voice of the target object, it often also receives the voice of other objects, noise in the environment and other interference signals. Therefore, before performing speech recognition, the speech signal corresponding to the target object must be extracted from the acquired speech signals. [0003] Currently, when processing multi-channel speech signals, speech extraction can be performed by comparing speech signals of different channels. However, when processing single-channel speech signals, it is more difficult to directly extract ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/06G10L21/0272
CPCG10L15/02G10L15/063G10L21/0272
Inventor 史慧宇尹首一韩慧明刘雷波魏少军
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products