Speech segment splicing system and method for speech synthesis

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of speech fragments and speech synthesis, which is applied in speech synthesis, speech analysis, instruments, etc. It can solve the problems of speech fragment spectrum hopping, spectrum hopping, and finding no smooth alignment point, etc., so as to improve the running speed and enhance the sense of hearing The effect of feeling, good continuity

Active Publication Date: 2017-11-28

BEIJING UNISOUND INFORMATION TECH

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Using this splicing method, if the spliced fragments are not handled well at the connection, there will be jumps in the frequency spectrum, which will lead to unnatural hearing experience for users

Therefore, a key technical issue is: what kind of splicing method is used to make the spliced speech clips output smoothly?

[0004] At present, the existing splicing method adopts the method of aligning the speech segments first and then accumulating and smoothing them. The smoothing effect of the speech segments output by this splicing method is average, and there is a problem of jumping between the frequency spectrums of the speech segments.

In addition, in some cases, this stitching method has the problem of not finding a smooth alignment point

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0046] figure 1 It is a schematic diagram of the module structure of a speech segment splicing system for speech synthesis of the present invention, as figure 1Shown, a kind of speech segment mosaic system that is used for speech synthesis comprises speech storehouse 1, sampling point selection module 2, speech splicing point generation module 3 and stitching module 4; The number of speech segments in Bank 1 is at least 2. The sampling point selection module is used to extract two speech segments to be spliced from the speech library 1 as the first speech segment and the second speech segment respectively, and select the most good sampling point. The voice splicing point generation module is used to perform fir...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention relates to a system and method for splicing speech segments for speech synthesis. First, two speech segments to be spliced are extracted from a speech bank as a first speech segment and a second speech segment, and the first speech segment and the second speech segment are Select the best sampling point in the second speech segment; then, carry out first-order smoothing to the best sampling point, generate speech splicing point; First-order smoothing method is: calculate the slope ka, kb at the best sampling point U1, U2 place, and The numerical difference value deltaU of the best sampling points U1 and U2; predict according to the slope ka, kb and the difference value deltaU, and generate a speech splicing point. Finally, a speech splicing point is inserted between the first speech segment and the second speech segment to generate a third speech segment. The present invention solves the problem of voice spectrum hopping that occurs in direct splicing in the prior art, and the problem of excessive calculation through the autocorrelation search and accumulation smoothing method, and obtains a good continuous frequency spectrum at the splicing place through the first-order smoothing method. and enhance the user's hearing experience.

Description

technical field [0001] The invention relates to the field of speech synthesis, in particular to a speech segment splicing system and method for speech synthesis. Background technique [0002] There are two existing speech synthesis methods based on speech feature parameters and based on waveform splicing. Compared with the parameter-based method, the speech synthesis based on waveform splicing can obtain higher-quality synthesized speech, and the sound sounds more natural, which is closer to the timbre of the original speaker. Therefore, the current mainstream online speech synthesis focuses on the speech synthesis scheme based on waveform splicing. [0003] The principle of the speech synthesis method based on waveform splicing is as follows: first select the appropriate speech unit from the pre-recorded and marked speech library as the speech segment to be spliced, and then obtain the final synthesized speech by splicing the speech segments. Using this splicing method, i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L13/033

Inventor 刘青松

Owner BEIJING UNISOUND INFORMATION TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speech segment splicing system and method for speech synthesis

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology