Human voice separation method and device, user terminal and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A separation device and user terminal technology, applied in speech analysis, electro-acoustic musical instruments, instruments, etc., can solve the problems of audio quality degradation and audio auditory effect, achieve good auditory effect, save time and labor costs, and accuracy high effect

Inactive Publication Date: 2019-08-23

成都嗨翻屋科技有限公司

View PDF8 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Currently, extracting karaoke music is done during the recording process, which requires a lot of manual work and time

[0003] Most of the existing deep learning technologies for human voice separation improve the separation effect at the cost of reducing the sampling rate and reducing the number of channels. After separation, the audio quality is degraded and the auditory effect of the audio is reduced.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0070] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. The components of the embodiments of the invention generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations.

[0071] Accordingly, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the claimed invention, but merely represents selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art wi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a human voice separation method and device, a user terminal and a storage medium, and relates to the technical field of audio processing. The human voice separation method comprises the steps: a sampled to-be-separated audio file sound channel is separated to obtain an initial waveform sequence; the initial waveform sequence is subjected to discrete Fourier transform to obtain an initial two-dimensional array; the initial two-dimensional array is subjected to module obtaining to obtain an initial speech spectrum; the initial two-dimensional array is subjected to phase obtaining to obtain an initial phase image; the initial speech spectrum is guided into a convolutional neural network model to obtain a mask through calculation; the mask and the initial phase image are subjected to first point multiplication operation to obtain a human voice source speech spectrum; the human voice source speech spectrum and the initial phase image are subjected to second point multiplication operation; the result of the second point multiplication operation is subjected to inverse discrete Fourier transform to obtain single human voice source audio waveforms; and the single human voice source audio waveforms are spliced to obtain stereo audio. According to the human voice separation method and device, the user terminal and the storage medium, automatic human voice separation of the audio can be achieved.

Description

technical field [0001] The present invention relates to the technical field of audio processing, in particular to a human voice separation method, device, user terminal and storage medium. Background technique [0002] Usually for popular music, the human voice is the main theme, and the accompaniment is the rhythm of the music. Since the human voice is usually accompanied by background music, vocal separation is a challenging task. A prerequisite for musical instrument classification, and these techniques can be used in applications such as recommendation systems and label classification. One of the commercial applications of the vocal separation system is karaoke, meaning a musical track without the vocals. Karaoke music helps music lovers learn to sing an existing piece or sing it in a concert. Currently, extracting karaoke music is done during the recording process, which requires a lot of manual operations and time. [0003] Most of the existing deep learning technol...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/028G10L25/30G10H1/36

CPCG10H1/361G10L21/028G10L25/30

Inventor 尹学渊江天宇陈洪宇梁超

Owner 成都嗨翻屋科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Human voice separation method and device, user terminal and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology