Non-periodic component syllable model building and speech synthesizing method and device
A non-periodic component and model building technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of poor spectral coherence of aperiodic components, large amount of data, and low quality of synthesized audio.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0069] like figure 1 As shown, it is a schematic flow chart of a method for establishing a non-periodic component syllable model in Embodiment 1 of the present invention, and the method includes:
[0070] Step 101: Obtain the original voice wave file in the voice database.
[0071] Specifically, in step 101, the voice database includes a large number of original voice waveform files and annotation files corresponding to the original voice waveform files, for example: files in Wav format and corresponding file identifiers (ie Lable).
[0072] Wherein, there is a one-to-one correspondence between the annotation file and the original voice waveform file, that is to say, each original voice waveform file corresponds to a unique annotation file.
[0073] Before preparing to build the aperiodic component syllable model, a large number of original speech waveform files are obtained from the speech database, and after analysis and processing, the required language parameter model, th...
Embodiment 2
[0123] like figure 2 As shown, it is a schematic flowchart of a speech synthesis method based on an aperiodic component syllable model in Embodiment 2 of the present invention. Embodiment 2 of the present invention is implemented on the basis of Embodiment 1 of the present invention. The method includes:
[0124] Step 201: Use a text analysis device to convert the acquired text information to be speech-synthesized into an original speech waveform file, and obtain an annotation file of the original speech waveform file according to the converted original speech waveform file.
[0125] Specifically, in step 201, after acquiring the text information to be synthesized into speech, it is necessary to use a text analysis device to convert the acquired text information to be synthesized into an original waveform file, and obtain the original voice according to the converted original voice waveform file. Annotation file for wave files.
[0126] Step 202: According to the correspondi...
Embodiment 3
[0136] like image 3 As shown, it is a schematic structural diagram of an aperiodic component syllable model building device in Embodiment 3 of the present invention. Embodiment 3 of the present invention is an invention under the same concept as Embodiment 1 of the present invention and Embodiment 2 of the present invention. The equipment includes: aperiodic component representative value determination module 11, aperiodic component spectrum fitting curve generation module 12 and aperiodic component syllable model building module 13, wherein:
[0137] Aperiodic component representative value determining module 11 is used to decompose the original speech waveform file in the voice database, and obtain the aperiodic component spectrum information, fundamental frequency information and vocal tract spectrum information of each syllable in the original speech waveform file; and according to Preset at least one frequency band information divided for each frame of the syllable and t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com