Voice changing method and system for changing voice tones and timbres
A pitch and voice technology, applied in the field of voice timbre adjustment, can solve the problems of poor voice change and voice change effect, not involving independent adjustment of fundamental frequency and formant, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0033] The present invention changes the fundamental frequency based on the "resampling and WSOLA" technology in the time domain, and then extracts the spectral envelope through the cepstrum method, and uses the spectral envelope (channel system function) to move the formant without changing the fundamental frequency. Specifically, it is implemented through the following steps,
[0034] First, according to the requirement of "base frequency ratio adjustment factor b", the speech data x[n] is resampled in the time domain to obtain the speech data rs[n], and the sequence length of rs[n] is the sequence length of x[n] b times.
[0035] Secondly, using "WSOLA" and other similar pitch-keeping algorithms, the speaker's speech rate can be changed without changing the speaker's pitch, and rs[n] can be scaled to the original speech length, and ws[n] can be output, and the pitch It is the change of the base frequency, the speed change is the change of the speaking speed, rs[n] is the v...
Embodiment 2
[0043] Embodiment 2 can be regarded as a preferred example of Embodiment 1. The system for changing the voice pitch and timbre described in Embodiment 2 utilizes the steps of the method for changing the voice pitch and timbre described in Embodiment 1.
[0044] According to a kind of voice changing system that changes voice pitch and timbre provided by the present invention, comprises following module:
[0045] Module S1: According to the requirement of "base frequency ratio adjustment factor b", the first speech data is resampled in the time domain to obtain the second speech data, the sequence length of the second speech data is the sequence length of the first speech data b times;
[0046] Module S2: using the pitch hold algorithm to change the rate of speech, scaling the speech length of the second speech data, and outputting the third speech data;
[0047] Module S3: transform the third voice data by windowing to obtain a complex spectrum, perform polar coordinate conve...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com