Voice conversion method and device with emotion and rhythm
A speech conversion and prosody technology, applied in speech analysis, neural learning methods, biological neural network models, etc., can solve the problems of low naturalness of speech, complicated extraction engineering, limited speech conversion effect, etc., and achieve high speech quality and high similarity. degree of effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0054] For ease of understanding, in this embodiment, the source speaker can be understood as himself, and the target speaker can be understood as a celebrity. This invention is used to transform one's own voice into that of a certain star.
[0055] This embodiment discloses a voice conversion method with emotion and rhythm, including a training phase and a conversion phase, such as figure 1 As shown, the training phase includes the following steps:
[0056] S11. Obtain training corpus of multiple speakers, including a source speaker and a target speaker;
[0057]Optionally, some existing high-quality public data sets can be used as training corpus, such as VCTK, LibriSpeech, etc., or self-recorded voice data containing multiple speakers.
[0058] S12. Performing acoustic feature extraction on the acquired training corpus;
[0059] Optionally, extract the Mel spectrum features from the training corpus. Specifically, the parameters are selected as follows: the window size is...
Embodiment 2
[0093] The voice conversion device with emotion and rhythm described in the embodiment of the present invention includes:
[0094] The acoustic feature extraction module is used for extracting acoustic features from the input speech.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com