Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice enhancement method and device based on convolutional neural network, equipment and medium

A convolutional neural network and speech enhancement technology, which is applied in the field of speech enhancement based on convolutional neural network, can solve the problem of low computational efficiency and accuracy of speech enhancement models, and achieve the goal of refining image features, enhancing speech, and improving accuracy. Effect

Pending Publication Date: 2021-09-03
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Embodiments of the present invention provide a method, device, device, and medium for speech enhancement based on convolutional neural networks to solve the problems of low computational efficiency and low accuracy of current speech enhancement models

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice enhancement method and device based on convolutional neural network, equipment and medium
  • Voice enhancement method and device based on convolutional neural network, equipment and medium
  • Voice enhancement method and device based on convolutional neural network, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0031] The speech enhancement method based on convolutional neural network can be applied in such as figure 1 An application environment in which a computer device communicates with a server over a network. Computing devices may include, but are not limited to, various personal computers, laptops, smartphones, tablets, and portable wearable devices. The server can be implemented as an independent server.

[0032] In one embodiment, as figure 2 Show...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of artificial intelligence, and particularly relates to a speech enhancement method and device based on a convolutional neural network, equipment and a medium. The speech enhancement method based on the convolutional neural network comprises the following steps: acquiring a time domain oscillogram of speech to be denoised and a speech enhancement model, wherein the speech enhancement model comprises a Gabor convolution layer, a simple recursion layer, a feature masking layer and a deconvolution layer which are connected in sequence; carrying out Gabor transformation on the time domain oscillogram through a complex filter, and extracting Gabor transformation features; inputting the Gabor transformation features into a simple recursion layer for prediction so as to determine a masking vector corresponding to a feature masking layer; filtering the Gabor transformation features according to the masking vector through the feature masking layer to obtain denoised Gabor transformation features; and restoring the denoised Gabor transformation features through a deconvolution layer to obtain a target denoised voice. According to the speech enhancement method based on the convolutional neural network, the model calculation efficiency and accuracy can be effectively improved.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a speech enhancement method, device, equipment and medium based on a convolutional neural network. Background technique [0002] Speech enhancement refers to a technology that enhances the quality and clarity of useful speech signals and suppresses and reduces noise interference when speech signals are interfered or even submerged by various noises. Due to the simple design process, the end-to-end neural network model is widely used in the field of speech enhancement, but most of the current research does not effectively consider the local and sequential characteristics of speech, resulting in the computational efficiency and accuracy of the current speech enhancement model. . Contents of the invention [0003] Embodiments of the present invention provide a speech enhancement method, device, device, and medium based on a convolutional neural network, so...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0224G10L21/0216G10L21/0264G10L25/30
CPCG10L21/0224G10L21/0216G10L21/0264G10L25/30Y02T10/40
Inventor 张之勇王健宗
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products