Voice signal conversion method and device, equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A voice signal and signal technology, applied in voice analysis, instruments, etc., can solve problems such as formant envelope error, sound error, and poor voice signal quality

Active Publication Date: 2020-07-07

BIGO TECH PTE LTD

View PDF8 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] When the original voice is processed by the voice modulation algorithm, although the purpose of adjusting the pitch is achieved, the voice characteristics of the voice user may be changed, so that there is a big difference between the voice played and the actual voice of the voice user. When a male voice signal is raised by 4 semitones, it will sound like a girl's voice, and there is a certain sound error

[0004] At present, a fixed-length window function is usually used to directly process the formant envelope of the speech signal before and after the pitch transposition. Since the formant positions and changes in different speech signals are correspondingly different, the formant envelope obtained at this time will be There is a certain error in the network, resulting in poor quality of the final voice signal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0072] Figure 1A It is a flow chart of a speech signal conversion method provided by Embodiment 1 of the present invention. This embodiment can be applied to any device capable of modulating the voice signal. The technical solutions in the embodiments of the present invention are applicable to the situation of how to realize the consistency of the sound features in the speech signal before and after the pitch modulation. A voice signal conversion method provided in this embodiment can be performed by the voice signal conversion device provided in the embodiment of the present invention, the device can be implemented by means of software and / or hardware, and integrated into the device that executes the method. The device can be an intelligent terminal configured with any application program capable of modulating the voice signal, such as smart phones, tablets, and handheld computers.

[0073] Specifically, refer to Figure 1A , the method may include the following steps:

[...

Embodiment 2

[0098] figure 2 It is an original schematic diagram of the fundamental frequency detection and window function construction process in the method provided by Embodiment 2 of the present invention. This embodiment is optimized on the basis of the foregoing embodiments. Specifically, in this embodiment, the detection process of the fundamental frequency of each segmented original frequency domain signal obtained by Fourier transform after the original speech signal is segmented, and the original segmented window corresponding to each segmented original frequency domain signal The function and the specific construction process of the target segmented window function corresponding to the segmented target frequency domain signal are explained in detail.

[0099] Optionally, the method in this embodiment may specifically include the following steps:

[0100] S201. Acquire an original voice signal.

[0101] S202. Transpose the original speech signal to obtain an initial target sp...

Embodiment 3

[0123] image 3 It is a schematic diagram of the principle of the speech signal conversion process provided by Embodiment 3 of the present invention. This embodiment is optimized on the basis of the foregoing embodiments. Specifically, this embodiment mainly explains in detail the specific process of performing Fourier transform on the speech signal segment and the process of determining the pitch-modified speech signal.

[0124] Optionally, this embodiment may specifically include the following steps:

[0125] S310. Acquire an original voice signal.

[0126] S320. Transpose the original voice signal to obtain an initial target voice signal.

[0127]S330. Segment the original speech signal and the initial target speech signal according to the preset segment length and segment displacement, to obtain segmented original speech signals and segmented target speech signals.

[0128] Optionally, in this embodiment, when segmenting the original speech signal and the initial targe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice signal conversion method and device, equipment and a storage medium. The method comprises the following steps: respectively segmenting an original voice signal and an initial target voice signal obtained by modifying the tone of the original voice signal, and then carrying out Fourier transform to obtain a segmented original frequency domain signal and a segmented target frequency domain signal; filtering the segmented original frequency domain signal through an original segmented window function to obtain a corresponding original formant envelope, and filteringthe segmented target frequency domain signal through a target segmented window function to obtain a corresponding target formant envelope; and determining a tone-changing voice signal according to the segmented target frequency domain signal, the original formant envelope and the target formant envelope. According to the technical scheme provided by the embodiment of the invention, the influenceof the target formant envelope on tone modification is eliminated, so the same formant envelope exists before and after tone modification, the consistency of sound characteristics in the voice signalsbefore and after tone modification is ensured, and the voice quality of the tone-modified voice signals is improved.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of voice recognition, and in particular, to a voice signal conversion method, device, device, and storage medium. Background technique [0002] With the rapid development of Internet technology, an entertainment software that changes the pitch of the original voice through the pitch shift algorithm (Pitch Shift) has been widely used in people's daily life, providing users with a new For example, when modifying the original recording of a certain singer, it will change the tone of the flawed voice to make the song more perfect. [0003] When the original voice is processed by the voice modulation algorithm, although the purpose of adjusting the pitch is achieved, the voice characteristics of the voice user may be changed, so that there is a big difference between the voice played and the actual voice of the voice user. When a male voice signal is raised by 4 semitones, it will sound lik...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/013

CPCG10L21/013G10L2021/0135G10L21/034G10L25/90

Inventor 吴晓婕

Owner BIGO TECH PTE LTD

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice signal conversion method and device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology