Front end design-based speech synthesis method
A speech synthesis and phoneme technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of data dependence and uncontrollable synthesis effect.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Examples
preparation example Construction
[0029] The speech synthesis method based on front-end design of the present invention comprises the following steps:
[0030] Step 1, preprocessing the Chinese text data;
[0031] Step 2, extracting the relevant linguistic features of the Chinese text;
[0032] Step 3, extracting at least two acoustic features of the audio file;
[0033] Step 4, training the duration model and the acoustic model according to the linguistic features and the acoustic features;
[0034] Step 5. After processing the Chinese text to be synthesized in step 1 and step 2, call the duration model obtained in step 4 to obtain the duration information corresponding to the text, and then combine linguistic features and duration information as the input of the acoustic model to obtain the corresponding acoustic characteristics;
[0035] Step 6. Using a vocoder to synthesize corresponding audio data for the acoustic features obtained in step 5.
Embodiment
[0037] The embodiment of the present invention is based on the speech synthesis method of front-end design, specifically comprises the following steps:
[0038] (1) Data processor, which preprocesses Chinese text data.
[0039] Preprocess special characters and numbers in Chinese text, such as "0.1%" is parsed as "0.1%". "2018" is parsed as "2018", "2018 times" is parsed as "2018 times" and so on. The parsed text is then converted into pinyin with tones. The Chinese text set needs to cover all Chinese pinyin.
[0040] (2) Linguistic feature generator, which extracts linguistic features related to Chinese text.
[0041] a) The pinyin with phonetic symbols obtained in step (1) is split into corresponding phonemes according to a custom dictionary. It includes the setting of conversion rules for special syllables. Some pinyin split rules are as follows:
[0042] a1 a1
gua1 g ua1
na1 n a1
sui1 s uei1
ai1 ai1
guai1 g uai1
nai1 n ai1
sun1...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com