Acoustic model establishing method and device, speech synthesis method and device, facility and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An acoustic model and building method technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of poor sound modeling, sound modeling, and high cost of corpus recording, and achieve good synthesis and modeling performance. Good, the effect of reducing the recording cost

Active Publication Date: 2019-01-29

出门问问创新科技有限公司 +1

View PDF8 Cites 16 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

During the specific implementation process, the inventor found the following problems in the prior art: if the common application scenarios are covered, more corpus needs to be recorded to establish an acoustic model with better effect of synthesizing tones, but the cost of corpus recording is relatively high ; If there are fewer recordings of Erhuayin, it is easy to cause the problem of poor Erhuayin modeling in the acoustic model; it is also impossible to borrow the existing final phonemes in the corpus to model Erhuayin, and it is impossible to synthesize the speech synthesis library. Er Hua Yin

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0031] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0032] figure 1 It is a flow chart of an acoustic model establishment method provided by an embodiment of the present invention, the method is executed by an acoustic model establishment apparatus, and the apparatus is executed by software and / or hardware. The apparatus can be configured in equipment such as terminals and computers. The method can be applied in the scenario of acoustic model modeling.

[0033] Such as figure 1 A...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention discloses an acoustic model establishing method, an acoustic model establishing device, a speech synthesis method, a speech synthesis device, a facility and a storage medium. The acoustic model establishing method comprises the following steps: acquiring a phoneme sequence sample of a plurality of training samples from a corpus, and acquiring the context feature ofeach phoneme and the duration of each phoneme in the phoneme sequence sample, wherein the rhotic accent phoneme in the phoneme sequence sample is split to two phonemes; extracting acoustic features from the training samples; and by adopting the phoneme sequence sample, training the acoustic model by taking the context feature of each phoneme and the duration of each phoneme in the phoneme trainingsample as the input of the acoustic model and the acoustic features as the output of the acoustic model, so that the pretrained acoustic model is obtained. The modeling performance of the rhotic accent is good, the synthesis of the rhotic accent can be well realized, then the rhotic accent not appearing in the corpus can be synthesized, and meanwhile, the recording cost of linguistic data in thecorpus can be reduced.

Description

technical field [0001] Embodiments of the present invention relate to the field of information-to-speech synthesis, and in particular, relate to an acoustic model establishment, speech synthesis method, device, equipment, and storage medium. Background technique [0002] With the continuous development of multimedia communication technology, speech synthesis technology, which is one of the important ways of human-computer interaction, has attracted extensive attention of researchers because of its convenience and speed. Speech synthesis is a technology that generates artificial voice through mechanical and electronic methods. It is a technology that converts text information generated by the computer itself or externally input into intelligible and fluent spoken language output. The purpose of speech synthesis is to convert text into speech and play it to users, and the goal is to achieve the effect of live text broadcasting. [0003] Speech synthesis technology has been wi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/04G10L13/06G10L13/10

CPCG10L13/04G10L13/06G10L13/10G10L2013/105

Inventor 张冉

Owner 出门问问创新科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Acoustic model establishing method and device, speech synthesis method and device, facility and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology