Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A speech synthesis method and device

A technology of speech synthesis and speech, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of poor expressiveness and low modeling precision, and achieve the effect of strong expressiveness

Active Publication Date: 2019-12-24
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the existing implementation, HMM (Hidden Markov Model, Hidden Markov Model) is used in the pre-selection process of the speech unit and the search process of the candidate space, but since the states in the HMM model are independent of each other, and it is based on The shallow modeling of the decision tree and the linear division of the feature space lead to low modeling accuracy in the case of complex text context features, resulting in smoother and poorer expressiveness in the final synthesized speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A speech synthesis method and device
  • A speech synthesis method and device
  • A speech synthesis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0065] The core idea of ​​the present invention is to use a neural network model in at least one of the process of preselecting the speech unit and the search process of the candidate space. That is, use the pre-trained first model to select candidate speech units from the speech library for the speech to be synthesized to form an alternative space; use the pre-trained second model to select speech units from the alternative space for splicing, so that the selected speech The search cost of the sequence composed of units is optimal; wherein at least one of the first model and the second model is a neural network model.

[0066] figure 1 It is a flow chart of the method provided by Embodiment 1 of the present invention. In this embodiment, the pre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice synthesis method and a voice synthesis device. The method comprises the following steps: selecting candidate voice units for a voice to be synthesized from a voice base to form a candidate space through a first pre-trained model; and selecting the voice unit from the candidate space for splicing through a second pre-trained model, so that the search cost of a sequence consisting of the selected voice units is the best, wherein at least one of the first model and the second model is a neural network model. The voice synthesis method and the voice synthesis device can improve the naturalness and the expressive force of a finally synthesized voice.

Description

【Technical field】 [0001] The invention relates to the field of computer application technology, in particular to a speech synthesis method and device. 【Background technique】 [0002] With the advent of the mobile era, people's demand for speech synthesis is increasing, such as reading aloud novels, voice navigation, etc., all require speech synthesis. Moreover, people are not only satisfied with clarity and intelligibility for speech synthesis, but also require the synthesized speech to have better naturalness and expressiveness. [0003] For speech synthesis, it is first necessary to process the input text, including preprocessing, word segmentation, part-of-speech tagging, phonetic notation, prosodic level prediction, etc., then predict the acoustic features corresponding to each unit through the acoustic model, and finally use the acoustic parameters to pass the acoustic Speech is synthesized by the coder, or the appropriate speech unit is selected from the corpus for sp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/02G10L13/10
CPCG10L13/02G10L13/10
Inventor 盖于涛李秀林康永国
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products