Embedded equipment, bimodule voice synthesis system and method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An embedded device and speech synthesis technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as unsatisfactory user satisfaction, degradation of synthesized sound quality and naturalness, and sound quality degradation, achieving less resource occupation, faster speed, and faster calculation The effect of low ability

Active Publication Date: 2012-05-09

PANASONIC CORP

View PDF3 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The disadvantage is that in order to obtain better synthesis results, a large amount of voice data needs to be stored in advance, and the synthesis results are directly related to the size of the stored data, that is, when the voice library is greatly reduced, its sound quality will also be greatly reduced

According to the characteristics of embedded devices, researchers (see reference [6]) have transplanted splicing speech synthesis technology to embedded devices by simplifying the text analysis and prosody prediction modules and reducing the number of speech units in the sound bank. But what followed was a drastic drop in synthetic sound quality and naturalness

When parametric synthesis is applied in embedded devices (see reference [7]), resource occupation is not a problem, but in many cases the synthesized voice it provides does not satisfy users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] figure 1 is a schematic diagram of an embedded device dual-mode speech synthesis system according to the present invention. exist figure 1 In , User 1 owns Embedded Device 1 and User 2 owns Embedded Device 2. Both embedded devices 1 and 2 can be connected to servers 1 and 2 through wireless connections.

[0038] figure 2 It is a system block diagram of the embedded device dual-mode speech synthesis system according to the present invention. Such as figure 2 As shown, the embedded device dual-mode speech synthesis system according to the present invention is divided into an embedded device side 100 and a server side 200 . The user equipment side 100 mainly includes a text preprocessing module 110 , a network availability detection module 120 , a parameter synthesis module 130 , a real-time detection module 140 , a sound quality detection module 150 and a voice output module 160 . On the server side 200, it mainly includes a splicing and synthesis module 210 . Of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an embedded device, a bimodal speech synthesis system and a bimodal speech synthesis method. According to the invention, the embedded device consists of a network availability detecting unit, a parameter synthesizing unit and a timbre detecting unit. The network availability detecting unit is used for detecting availability of the network. If the network is available, the received text is transmitted to a splicing and synthesizing unit at the server side by the network; if the network is unavailable, the text is input to the parameter synthesizing unit. The parameter synthesizing unit is used for parameter speech synthesis process for the entered text and outputting the processed speech to the timbre detecting unit. The timbre detecting unit is used for receiving the parameter synthesis result from the parameter synthesizing unit and the splicing and synthesizing result from the splicing and synthesizing unit at the server side within the scope permitted by realtime. Speech quality evaluation is carried out on the results so as to select the synthesizing result with the best timbre quality to be output.

Description

technical field [0001] The present invention relates to speech synthesis technology for arbitrary text-to-natural speech conversion on embedded devices, more specifically, to an embedded device, a dual-mode speech synthesis system and a dual-mode speech synthesis method, which can meet the requirements of Based on the user's requirements for real-time and sound quality, it provides users with high-quality voice synthesis output. Background technique [0002] With the advent of the digital age, voice interaction technology has been used more and more. As an important part of voice interaction, speech synthesis technology from text to speech has attracted more and more attention from academia and industry. Many companies, universities and scientific research institutes at home and abroad have done extensive and in-depth research on speech synthesis technology, and proposed waveform splicing synthesis technology based on pre-recorded speech database (see references [1][2]), spe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L13/00G10L13/02G10L13/04G10L13/08G10L11/00G10L25/60

Inventor 夏海荣

Owner PANASONIC CORP

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Embedded equipment, bimodule voice synthesis system and method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology