Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Acoustic model establishing method and device, speech synthesis method and device, facility and storage medium

An acoustic model and building method technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of poor sound modeling, sound modeling, and high cost of corpus recording, and achieve good synthesis and modeling performance. Good, the effect of reducing the recording cost

Active Publication Date: 2019-01-29
出门问问创新科技有限公司 +1
View PDF8 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

During the specific implementation process, the inventor found the following problems in the prior art: if the common application scenarios are covered, more corpus needs to be recorded to establish an acoustic model with better effect of synthesizing tones, but the cost of corpus recording is relatively high ; If there are fewer recordings of Erhuayin, it is easy to cause the problem of poor Erhuayin modeling in the acoustic model; it is also impossible to borrow the existing final phonemes in the corpus to model Erhuayin, and it is impossible to synthesize the speech synthesis library. Er Hua Yin

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Acoustic model establishing method and device, speech synthesis method and device, facility and storage medium
  • Acoustic model establishing method and device, speech synthesis method and device, facility and storage medium
  • Acoustic model establishing method and device, speech synthesis method and device, facility and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0032] figure 1 It is a flow chart of an acoustic model establishment method provided by an embodiment of the present invention, the method is executed by an acoustic model establishment apparatus, and the apparatus is executed by software and / or hardware. The apparatus can be configured in equipment such as terminals and computers. The method can be applied in the scenario of acoustic model modeling.

[0033] Such as figure 1 A...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses an acoustic model establishing method, an acoustic model establishing device, a speech synthesis method, a speech synthesis device, a facility and a storage medium. The acoustic model establishing method comprises the following steps: acquiring a phoneme sequence sample of a plurality of training samples from a corpus, and acquiring the context feature ofeach phoneme and the duration of each phoneme in the phoneme sequence sample, wherein the rhotic accent phoneme in the phoneme sequence sample is split to two phonemes; extracting acoustic features from the training samples; and by adopting the phoneme sequence sample, training the acoustic model by taking the context feature of each phoneme and the duration of each phoneme in the phoneme trainingsample as the input of the acoustic model and the acoustic features as the output of the acoustic model, so that the pretrained acoustic model is obtained. The modeling performance of the rhotic accent is good, the synthesis of the rhotic accent can be well realized, then the rhotic accent not appearing in the corpus can be synthesized, and meanwhile, the recording cost of linguistic data in thecorpus can be reduced.

Description

technical field [0001] Embodiments of the present invention relate to the field of information-to-speech synthesis, and in particular, relate to an acoustic model establishment, speech synthesis method, device, equipment, and storage medium. Background technique [0002] With the continuous development of multimedia communication technology, speech synthesis technology, which is one of the important ways of human-computer interaction, has attracted extensive attention of researchers because of its convenience and speed. Speech synthesis is a technology that generates artificial voice through mechanical and electronic methods. It is a technology that converts text information generated by the computer itself or externally input into intelligible and fluent spoken language output. The purpose of speech synthesis is to convert text into speech and play it to users, and the goal is to achieve the effect of live text broadcasting. [0003] Speech synthesis technology has been wi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/04G10L13/06G10L13/10
CPCG10L13/04G10L13/06G10L13/10G10L2013/105
Inventor 张冉
Owner 出门问问创新科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products