Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice changing method and system for changing voice tones and timbres

A pitch and voice technology, applied in the field of voice timbre adjustment, can solve the problems of poor voice change and voice change effect, not involving independent adjustment of fundamental frequency and formant, etc.

Pending Publication Date: 2020-10-23
SHANGHAI YINGZHUO INFORMATION TECH CO LTD
View PDF1 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above-mentioned patent documents can achieve the effect of generating a specific sound. After the generation, the voice signal still retains the pitch and speed of the original speaker corresponding to the first voice signal, and has the second voice signal corresponding to the sound of the voice-changing object, so as to overcome the inability to change the voice for a specific object. and the technical defect of poor sound changing effect, but it does not involve the independent adjustment of fundamental frequency and formant

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice changing method and system for changing voice tones and timbres
  • Voice changing method and system for changing voice tones and timbres
  • Voice changing method and system for changing voice tones and timbres

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] The present invention changes the fundamental frequency based on the "resampling and WSOLA" technology in the time domain, and then extracts the spectral envelope through the cepstrum method, and uses the spectral envelope (channel system function) to move the formant without changing the fundamental frequency. Specifically, it is implemented through the following steps,

[0034] First, according to the requirement of "base frequency ratio adjustment factor b", the speech data x[n] is resampled in the time domain to obtain the speech data rs[n], and the sequence length of rs[n] is the sequence length of x[n] b times.

[0035] Secondly, using "WSOLA" and other similar pitch-keeping algorithms, the speaker's speech rate can be changed without changing the speaker's pitch, and rs[n] can be scaled to the original speech length, and ws[n] can be output, and the pitch It is the change of the base frequency, the speed change is the change of the speaking speed, rs[n] is the v...

Embodiment 2

[0043] Embodiment 2 can be regarded as a preferred example of Embodiment 1. The system for changing the voice pitch and timbre described in Embodiment 2 utilizes the steps of the method for changing the voice pitch and timbre described in Embodiment 1.

[0044] According to a kind of voice changing system that changes voice pitch and timbre provided by the present invention, comprises following module:

[0045] Module S1: According to the requirement of "base frequency ratio adjustment factor b", the first speech data is resampled in the time domain to obtain the second speech data, the sequence length of the second speech data is the sequence length of the first speech data b times;

[0046] Module S2: using the pitch hold algorithm to change the rate of speech, scaling the speech length of the second speech data, and outputting the third speech data;

[0047] Module S3: transform the third voice data by windowing to obtain a complex spectrum, perform polar coordinate conve...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice changing method and system for changing voice tones and timbres. The method comprises the steps: re-sampling first voice data in a time domain according to the requirement of a fundamental frequency proportion adjustment factor b to obtain second voice data, wherein the sequence length of the second voice data is b times the sequence length of the first voice data; changing tones by utilizing tone keeping, zooming the voice length of the second voice data, and outputting third voice data; windowing the third voice data for transformation to obtain a complex frequency spectrum, performing polar coordinate transformation on the complex frequency spectrum to obtain an amplitude spectrum and a phase spectrum, performing cepstrum transformation on the amplitude spectrum, extracting a frequency spectrum envelope, and extracting a fundamental frequency spectrum; and according to a resonance peak proportion adjustment factor f, adjusting the frequency spectrum envelope to synthesize a new amplitude spectrum, combining the new amplitude spectrum and a phase spectrum, converting the polar coordinates into rectangular coordinates, carrying out IFFT conversion, and carrying out window compensation to generate new fourth voice data. According to the invention, the problem of independent adjustment of fundamental frequency and resonance peaks is solved.

Description

technical field [0001] The present invention relates to the technical field of voice timbre adjustment, in particular to a voice changing method and system for changing voice pitch and timbre. Background technique [0002] Fundamental frequency and formant are very important features in speech. The fundamental frequency is the frequency at which the vocal cords vibrate when making voiced sounds. The fundamental frequency is directly related to the gender of the speaker. Generally, the fundamental frequency of a male voice is relatively low, and that of a female voice is relatively high. , the fundamental frequency of the elderly is lower than that of young people; the formant refers to the resonant frequency of the glottal wave in the vocal tract, the longer the vocal tract, the higher the frequency of the formant, and the male vocal tract is longer than the female vocal tract . Most of the existing voice changing schemes cannot adjust the fundamental frequency and formant ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/007G10L21/013G10L21/043
CPCG10L21/007G10L21/013G10L21/043G10L2021/0135
Inventor 邓海峰林立曹烈安张鹏飞
Owner SHANGHAI YINGZHUO INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products