Speech synthesis method and device, equipment and storage medium
A technology of speech synthesis and speech, which is applied in the field of computer equipment, storage media, devices, and speech synthesis methods, can solve problems such as poor user experience and low fitting degree, and achieve the effect of improving user experience
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0031] see figure 1 , figure 1 It is a schematic flowchart of a speech synthesis method disclosed in an embodiment of the present invention. Such as figure 1 As shown, the speech synthesis method may include the following operations:
[0032] 101. Input the reference speech sequence into a preset speech prosody analysis model for analysis to obtain speech prosody feature information.
[0033] In the above step 101, the reference speech sequence may be the speech to which the speech that the user wants to synthesize refers to. For example, if the user wants to make the synthesized voice more suitable for the voice of human A, he can convert a real voice of human A speaking into a reference voice sequence. The prosody of speech includes the intensity, pitch, duration, and pitch of the speech, and the prosody of the speech of different speakers usually has certain differences. The speech prosody analysis model analyzes the reference speech sequence, and the speech prosody fe...
Embodiment 2
[0066] see figure 2 , figure 2 It is a structural schematic diagram of a speech synthesis device disclosed in an embodiment of the present invention. Such as figure 2 As shown, the speech synthesis device may include:
[0067] The speech prosody analysis module 201 is used for inputting the reference speech sequence to a preset speech prosody analysis model for analysis to obtain speech prosody feature information;
[0068] The text prosody analysis module 202 is used for inputting the target text sequence into a preset text prosody analysis model for analysis to obtain text prosody feature information;
[0069] A merge processing module 203, configured to perform preset merge processing on the speech prosody feature information and the text prosody feature information, to obtain prosody information for recording the prosody of the target speech to be synthesized;
[0070] A speech synthesis module 204, configured to synthesize the target speech based on the target text...
Embodiment 3
[0086] see image 3 , image 3 It is a schematic structural diagram of a computer device disclosed in an embodiment of the present invention. Such as image 3 As shown, the computer equipment may include:
[0087] A memory 301 storing executable program codes;
[0088] A processor 302 connected to the memory 301;
[0089] The processor 302 invokes the executable program code stored in the memory 301 to execute the steps in the speech synthesis method disclosed in Embodiment 1 of the present invention.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com