Neural network vocoder training method based on short-time spectrum consistency
A neural network and short-time spectrum technology, applied in the field of speech signal processing, to improve the quality of synthesized speech, improve inconsistency, and improve quality.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0052] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.
[0053]The neural network vocoder training method based on short-term spectrum consistency provided by the present invention is applied to the neural network vocoder HiNet for hierarchical prediction of amplitude-phase spectrum, and is used to alleviate the existence of short-term spectrum combining the predicted amplitude spectrum and phase spectrum inconsistency problem. The HiNet vocoder consists of a magnitude spectrum predictor and a phase spectrum predictor.
[0054] Since the magnitude spectrum and phase spectrum of the HiNet vocoder are predicted separately, it is difficult for the short-time spectrum composed of the two to meet the consistency condition, that is, the composed short-time spectrum falls outside the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com