Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice synthesis method and device and electronic equipment

A technology of speech synthesis and speech, which is applied in speech synthesis, speech analysis, instruments, etc., and can solve problems such as inconsistency in timbre

Active Publication Date: 2018-04-03
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF11 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a speech synthesis method, device and electronic equipment, which are used to solve the technical problem of inconsistency in timbre in parametric speech synthesis in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice synthesis method and device and electronic equipment
  • Voice synthesis method and device and electronic equipment
  • Voice synthesis method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0081] Please refer to figure 1 , The embodiment of the present application provides a method for speech synthesis, the method includes:

[0082] S101: Extract the fundamental frequency parameter and amplitude parameter of the fixed component text audio from the recording of the fixed component text;

[0083] S102: Perform audio compression and filtering processing according to the amplitude parameter to obtain the frequency spectrum parameter of the fixed component text audio;

[0084] S103: When synthesizing speech, synthesize the speech based on the fundamental frequency parameters and frequency spectrum parameters of the fixed component text in the speech to be synthesized.

[0085] In the specific implementation process, the embodiment of the present application may establish a template library before synthesizing speech to store the fundamental frequency parameters and frequency spectrum parameters of the fixed component text. When building a template library, record frequently...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice synthesis method and device and electronic equipment. The method comprises the steps: extracting the fundamental frequency parameter and amplitude parameter of a fixedcomponent text audio from the record of a fixed component text; carrying out the audio limiting and filtering according to the amplitude parameter, and obtaining the spectrum parameters of the fixed component text audio; and synthesizing the voice based on the fundamental frequency parameter and amplitude parameter of the fixed component text in a to-be-synthesized voice during voice synthesis. According to the technical scheme, the method enables the amplitude of the audio to be more balanced and the audio to be coordinative and consistent through the audio limiting and filtering, enables thespectrum parameters to be consistent with the tone of a pure-parameter synthesized voice (a non-fixed component text), and achieves the synthesis of the voice based on the fundamental frequency parameter and amplitude parameter of the fixed component text audio. The tone of the fixed component text and the tone of the non-fixed component text are consistent with each other, thereby solving a technical problem that the parameter voice synthesis tones are not consistent in the prior art.

Description

Technical field [0001] The present invention relates to the technical field of speech signal processing, in particular to a method, device and electronic equipment for speech synthesis. Background technique [0002] Parametric speech synthesis is currently a mainstream speech synthesis technology. Parametric speech synthesis occupies less space and has high real-time operation. It has a wide range of application prospects on intelligent terminals and embedded devices. [0003] Parametric speech is completed by synthetic text, which is usually composed of fixed and invariant components (that is, fixed component text) and variable parameter components (that is, non-fixed component text). In the prior art, during speech synthesis, a fixed component text is obtained by pre-recording natural speech to obtain part of the speech fragment, the variable component text is subjected to speech synthesis to obtain another speech fragment, and then the two speech fragment signals are spliced ​​...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/04G10L13/08
CPCG10L13/04G10L13/08
Inventor 宋阳
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products