Concatenation point smoothing method in waveform concatenation, device and storage medium

A technology of waveform splicing and splicing points, which is applied in speech synthesis, speech analysis, instruments, etc. It can solve the problems of discontinuous spectrum of synthesized speech, the influence of frequency domain parameters, and karaoke sound of synthesized speech, etc.

Pending Publication Date: 2019-08-30
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The judgment error of the pitch period or its starting point will affect the effect of PSOLA technology
Secondly, PSOLA technology is a simple splicing synthesis of waveform mapping. Whether this splicing can maintain a smooth transition and its impact on frequency domain parameters has not been resolved.
[0003] In addition, the unit speech data used for splicing often have differences in frequency or pitch. Therefore, after splicing using the TD-PSOLA algorithm, it will bring discontinuity in the synthesized speech spectrum, and the pitch correction is relatively large. Sometimes this discontinuity will be very obvious, manifested as a karaoke sound in the synthetic voice

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Concatenation point smoothing method in waveform concatenation, device and storage medium
  • Concatenation point smoothing method in waveform concatenation, device and storage medium
  • Concatenation point smoothing method in waveform concatenation, device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0076] The present invention provides a splicing point smoothing method in waveform splicing, which is applied to an electronic device 1 . refer to figure 1 As shown, it is a schematic diagram of an application environment according to a specific embodiment of the splicing point smoothing method in waveform splicing according to the present invention.

[0077] In this embodiment, the electronic device 1 may be a server, a smart phone, a tablet computer, a portable computer, a desktop computer, and other terminal devices with computing functions.

[0078] The electronic device 1 includes: a processor 12 , a memory 11 , a network interface 14 and a communication bus 15 .

[0079] The memory 11 includes at least one type of readable storage medium. The at least one type of readable storage medium may be a non-volatile...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of voice signal processing, and provides a concatenation point smoothing method in waveform concatenation. The concatenation point smoothing method in the waveform concatenation is applied to an electronic device. The concatenation point smoothing method in waveform concatenation comprises the steps that concatenation points of two voice units to be subjected toconcatenation are determined, and voice signal segments with preset lengths at the two concatenation points are intercepted correspondingly; the two voice signal segments are correspondingly subjectedto windowing processing through a window function to correspondingly obtain corresponding short-time analysis signals; the amplitudes, phases and frequency of the two short-time analysis signals areacquired correspondingly based on the short-time Fourier transform; polynomial interpolation based on distance weight is carried out on the amplitudes, phases and frequencys of the two short-time analysis signals to acquire new amplitudes, phases and frequency; and sine wave synthesis is carried out on the new amplitudes, phases and frequency to acquire new voice signal segments. According to theconcatenation point smoothing method in waveform concatenation, data of the voice units are analyzed through a sinusoidal model, and the voice signals at the concatenation position are expressed as the sum of a series of sine waves, so that the smooth transition of synthesized voice is ensured, and the improvement of the naturalness degree of the synthesized voice is facilitated.

Description

technical field [0001] The present invention relates to the technical field of speech signal processing, in particular to a splicing point smoothing method, device and computer-readable storage medium in waveform splicing. Background technique [0002] Waveform splicing technology is a technology applied in the speech synthesis system. This technology synthesizes the required voice by splicing pre-recorded unit voice data. Among them, the PSOLA technology is a pitch-synchronous speech analysis / synthesis technology, which first requires accurate pitch period and determination of its starting point. The judgment error of the pitch period or its starting point will affect the effect of PSOLA technology. Secondly, PSOLA technology is a simple splicing synthesis of waveform mapping. Whether this splicing can maintain a smooth transition and its impact on frequency domain parameters has not been resolved. [0003] In addition, the unit speech data used for splicing often have d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L13/047G10L13/033
CPCG10L13/047G10L13/033
Inventor 彭话易程宁王健宗
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products