Many-to-many voice conversion method and system based on speaker style feature modeling
A speech conversion and speaker technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of inability to provide, information loss and noise, lack of speaker identity information, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0076] The following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
[0077] The present invention proposes a many-to-many speech conversion method based on speaker style feature modeling, which is to add a multi-layer perceptron and a style encoder to the traditional StarGAN neural network to achieve effective extraction and constraints on the speaker's style features. Using the speaker style feature instead of the speaker label feature overcomes the shortcomings of the limited speaker information carried by the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com