Voice changing method and system for changing voice tones and timbres

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A pitch and voice technology, applied in the field of voice timbre adjustment, can solve the problems of poor voice change and voice change effect, not involving independent adjustment of fundamental frequency and formant, etc.

Pending Publication Date: 2020-10-23

SHANGHAI YINGZHUO INFORMATION TECH CO LTD

View PDF1 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The above-mentioned patent documents can achieve the effect of generating a specific sound. After the generation, the voice signal still retains the pitch and speed of the original speaker corresponding to the first voice signal, and has the second voice signal corresponding to the sound of the voice-changing object, so as to overcome the inability to change the voice for a specific object. and the technical defect of poor sound changing effect, but it does not involve the independent adjustment of fundamental frequency and formant

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0033] The present invention changes the fundamental frequency based on the "resampling and WSOLA" technology in the time domain, and then extracts the spectral envelope through the cepstrum method, and uses the spectral envelope (channel system function) to move the formant without changing the fundamental frequency. Specifically, it is implemented through the following steps,

[0034] First, according to the requirement of "base frequency ratio adjustment factor b", the speech data x[n] is resampled in the time domain to obtain the speech data rs[n], and the sequence length of rs[n] is the sequence length of x[n] b times.

[0035] Secondly, using "WSOLA" and other similar pitch-keeping algorithms, the speaker's speech rate can be changed without changing the speaker's pitch, and rs[n] can be scaled to the original speech length, and ws[n] can be output, and the pitch It is the change of the base frequency, the speed change is the change of the speaking speed, rs[n] is the v...

Embodiment 2

[0043] Embodiment 2 can be regarded as a preferred example of Embodiment 1. The system for changing the voice pitch and timbre described in Embodiment 2 utilizes the steps of the method for changing the voice pitch and timbre described in Embodiment 1.

[0044] According to a kind of voice changing system that changes voice pitch and timbre provided by the present invention, comprises following module:

[0045] Module S1: According to the requirement of "base frequency ratio adjustment factor b", the first speech data is resampled in the time domain to obtain the second speech data, the sequence length of the second speech data is the sequence length of the first speech data b times;

[0046] Module S2: using the pitch hold algorithm to change the rate of speech, scaling the speech length of the second speech data, and outputting the third speech data;

[0047] Module S3: transform the third voice data by windowing to obtain a complex spectrum, perform polar coordinate conve...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice changing method and system for changing voice tones and timbres. The method comprises the steps: re-sampling first voice data in a time domain according to the requirement of a fundamental frequency proportion adjustment factor b to obtain second voice data, wherein the sequence length of the second voice data is b times the sequence length of the first voice data; changing tones by utilizing tone keeping, zooming the voice length of the second voice data, and outputting third voice data; windowing the third voice data for transformation to obtain a complex frequency spectrum, performing polar coordinate transformation on the complex frequency spectrum to obtain an amplitude spectrum and a phase spectrum, performing cepstrum transformation on the amplitude spectrum, extracting a frequency spectrum envelope, and extracting a fundamental frequency spectrum; and according to a resonance peak proportion adjustment factor f, adjusting the frequency spectrum envelope to synthesize a new amplitude spectrum, combining the new amplitude spectrum and a phase spectrum, converting the polar coordinates into rectangular coordinates, carrying out IFFT conversion, and carrying out window compensation to generate new fourth voice data. According to the invention, the problem of independent adjustment of fundamental frequency and resonance peaks is solved.

Description

technical field [0001] The present invention relates to the technical field of voice timbre adjustment, in particular to a voice changing method and system for changing voice pitch and timbre. Background technique [0002] Fundamental frequency and formant are very important features in speech. The fundamental frequency is the frequency at which the vocal cords vibrate when making voiced sounds. The fundamental frequency is directly related to the gender of the speaker. Generally, the fundamental frequency of a male voice is relatively low, and that of a female voice is relatively high. , the fundamental frequency of the elderly is lower than that of young people; the formant refers to the resonant frequency of the glottal wave in the vocal tract, the longer the vocal tract, the higher the frequency of the formant, and the male vocal tract is longer than the female vocal tract . Most of the existing voice changing schemes cannot adjust the fundamental frequency and formant ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/007G10L21/013G10L21/043

CPCG10L21/007G10L21/013G10L21/043G10L2021/0135

Inventor 邓海峰林立曹烈安张鹏飞

Owner SHANGHAI YINGZHUO INFORMATION TECH CO LTD

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice changing method and system for changing voice tones and timbres

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology