Voice extracting method and system and voice audio playing method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An extraction method and technology of human voice, applied in voice analysis, instruments, etc., to simplify the amount of pre-processing data and achieve simple effects

Active Publication Date: 2014-10-01

ZTE CORP

View PDF12 Cites 13 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The present invention provides a human voice extraction method, system, and human voice audio playback method and device to solve the technical problem of how to easily extract human voice from mixed audio

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0041] In order to make the purpose, technical solution and advantages of the present invention more clear, the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined arbitrarily with each other.

[0042] figure 1 It is a flow chart of the human voice extraction method in this embodiment.

[0043] S101 extracting a sound signal in which human voice and background sound co-occur from the beginning of the original sound signal as a sample;

[0044] For example, you can read a section of sound about 10s from the beginning of the original sound signal, and separate the part where the human voice and the background sound co-occur and the part where only the background sound appears; The next 10s can be read until the human voice is found;

[0045] S102 detects the main pi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice extracting method and system and a voice audio playing method and device. The voice extracting method comprises the steps of extracting sound signals where voice and background sound exist jointly from the beginning position of original acoustical signals as a sample, detecting tonic tone from the sample, using the tonic tone as the reference frequency, and comparing the pitch frequency of sound belonging to the same sound source in the sound portion, except the sample, of the original acoustical signals with the reference frequency to determine whether the sound source is voice. The voice extracting method can conveniently extract voice from mixed voice frequency.

Description

technical field [0001] The invention relates to the field of mixed audio separation and extraction, in particular to a human voice extraction method and system, and a human voice audio playback method and device. Background technique [0002] In order to extract human voice from audio such as binaural stereo and enhance it to achieve the purpose of clearer speech and effective noise reduction, a sound separation technology that can extract single audio from mixed audio is needed. The current technology that can meet this requirement is mainly an audio separation technology based on Computational Auditory Scene Analysis (CASA). [0003] Auditory Scene Analysis (ASA) technology, the auditory system uses various characteristics of sound (time domain, frequency domain, spatial position, etc.) to decompose a mixed sound signal into multiple signals, and each signal belongs to a different physical sound source. Computational Auditory Scene Analysis (CASA) technology uses compute...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/0272G10L25/51G10L25/78

CPCG10L21/028G10L21/0272G10L25/51

Inventor 佘海波王进军刘书昌张欣

Owner ZTE CORP

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice extracting method and system and voice audio playing method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology