Embedded equipment, bimodule voice synthesis system and method

An embedded device and speech synthesis technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as unsatisfactory user satisfaction, degradation of synthesized sound quality and naturalness, and sound quality degradation, achieving less resource occupation, faster speed, and faster calculation The effect of low ability

Active Publication Date: 2012-05-09
PANASONIC CORP
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage is that in order to obtain better synthesis results, a large amount of voice data needs to be stored in advance, and the synthesis results are directly related to the size of the stored data, that is, when the voice library is greatly reduced, its sound quality will also be greatly reduced
According to the characteristics of embedded devices, researchers (see reference [6]) have transplanted splicing speech synthesis technology to embedded devices by simplifying the text analysis and prosody prediction modules and reducing the number of speech units in the sound bank. But what followed was a drastic drop in synthetic sound quality and naturalness
When parametric synthesis is applied in embedded devices (see reference [7]), resource occupation is not a problem, but in many cases the synthesized voice it provides does not satisfy users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Embedded equipment, bimodule voice synthesis system and method
  • Embedded equipment, bimodule voice synthesis system and method
  • Embedded equipment, bimodule voice synthesis system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] figure 1 is a schematic diagram of an embedded device dual-mode speech synthesis system according to the present invention. exist figure 1 In , User 1 owns Embedded Device 1 and User 2 owns Embedded Device 2. Both embedded devices 1 and 2 can be connected to servers 1 and 2 through wireless connections.

[0038] figure 2 It is a system block diagram of the embedded device dual-mode speech synthesis system according to the present invention. Such as figure 2 As shown, the embedded device dual-mode speech synthesis system according to the present invention is divided into an embedded device side 100 and a server side 200 . The user equipment side 100 mainly includes a text preprocessing module 110 , a network availability detection module 120 , a parameter synthesis module 130 , a real-time detection module 140 , a sound quality detection module 150 and a voice output module 160 . On the server side 200, it mainly includes a splicing and synthesis module 210 . Of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an embedded device, a bimodal speech synthesis system and a bimodal speech synthesis method. According to the invention, the embedded device consists of a network availability detecting unit, a parameter synthesizing unit and a timbre detecting unit. The network availability detecting unit is used for detecting availability of the network. If the network is available, the received text is transmitted to a splicing and synthesizing unit at the server side by the network; if the network is unavailable, the text is input to the parameter synthesizing unit. The parameter synthesizing unit is used for parameter speech synthesis process for the entered text and outputting the processed speech to the timbre detecting unit. The timbre detecting unit is used for receiving the parameter synthesis result from the parameter synthesizing unit and the splicing and synthesizing result from the splicing and synthesizing unit at the server side within the scope permitted by realtime. Speech quality evaluation is carried out on the results so as to select the synthesizing result with the best timbre quality to be output.

Description

technical field [0001] The present invention relates to speech synthesis technology for arbitrary text-to-natural speech conversion on embedded devices, more specifically, to an embedded device, a dual-mode speech synthesis system and a dual-mode speech synthesis method, which can meet the requirements of Based on the user's requirements for real-time and sound quality, it provides users with high-quality voice synthesis output. Background technique [0002] With the advent of the digital age, voice interaction technology has been used more and more. As an important part of voice interaction, speech synthesis technology from text to speech has attracted more and more attention from academia and industry. Many companies, universities and scientific research institutes at home and abroad have done extensive and in-depth research on speech synthesis technology, and proposed waveform splicing synthesis technology based on pre-recorded speech database (see references [1][2]), spe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/00G10L13/02G10L13/04G10L13/08G10L11/00G10L25/60
Inventor 夏海荣
Owner PANASONIC CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products