Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement method, device, storage medium and electronic equipment

A speech enhancement and speech synthesis technology, applied in the field of storage media and electronic equipment, speech enhancement methods, and devices, can solve the problems of difficult machine-accurate recognition of speech data and low intelligibility of pronunciation data, so as to increase intelligibility and facilitate The effect of interaction and machine recognition, avoiding repeated decoding

Active Publication Date: 2022-05-17
BEIJING BYTEDANCE NETWORK TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the voice data obtained in this process is difficult to be accurately recognized by the machine
At the same time, the pronunciation of users with damaged vocal cords is relatively close to that of the ear, and the intelligibility of the pronunciation data is low during the interaction process.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method, device, storage medium and electronic equipment
  • Speech enhancement method, device, storage medium and electronic equipment
  • Speech enhancement method, device, storage medium and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.

[0025] It should be understood that the various steps described in the method implementations of the present disclosure may be executed in different orders, and / or executed in parallel. Additionally, method embodiments may include additional steps and / or omit performing illustrated steps. The scope of the present disclosure is not limited in this regard. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present disclosure relates to a speech enhancement method, device, storage medium and electronic equipment, the method comprising: acquiring whisper data to be processed; processing the whisper data through a speech enhancement model to obtain the acoustic sound corresponding to the whisper data Feature information, wherein the speech enhancement model includes an encoding sub-model and a decoding sub-model, the encoding sub-model is used to encode the whisper to obtain target encoding information, and the decoding sub-model uses a step-by-step monotonic attention mechanism to Decoding the target coded information to obtain the acoustic feature information; performing speech synthesis according to the acoustic feature information to obtain audio information corresponding to the whisper data. In this way, the whisper data can be enhanced, thereby increasing the intelligibility of the whisper data, and facilitating interaction between users and machine recognition. In addition, the amount of data processing can be reduced, the processing efficiency of the voice enhancement method can be improved, and the user experience can be further improved.

Description

technical field [0001] The present disclosure relates to speech synthesis technology, and in particular, to a speech enhancement method, device, storage medium and electronic equipment. Background technique [0002] In scenarios where loud noises are prohibited, normal voice conversations for users usually bring some inconvenience, so some users choose to use whispers for interaction. However, the voice data obtained in this process is difficult to be accurately recognized by machines. At the same time, the pronunciation of users with damaged vocal cords is relatively close to that of the ear, and the intelligibility of the pronunciation data is low during the interaction process. Contents of the invention [0003] This Summary is provided to introduce a simplified form of concepts that are described in detail later in the Detailed Description. This summary of the invention is not intended to identify key features or essential features of the claimed technical solution, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/02G10L19/16G10L13/04
CPCG10L21/02G10L19/16G10L13/04
Inventor 殷翔
Owner BEIJING BYTEDANCE NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products