Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice extracting method and system and voice audio playing method and device

An extraction method and technology of human voice, applied in voice analysis, instruments, etc., to simplify the amount of pre-processing data and achieve simple effects

Active Publication Date: 2014-10-01
ZTE CORP
View PDF12 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention provides a human voice extraction method, system, and human voice audio playback method and device to solve the technical problem of how to easily extract human voice from mixed audio

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice extracting method and system and voice audio playing method and device
  • Voice extracting method and system and voice audio playing method and device
  • Voice extracting method and system and voice audio playing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the purpose, technical solution and advantages of the present invention more clear, the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined arbitrarily with each other.

[0042] figure 1 It is a flow chart of the human voice extraction method in this embodiment.

[0043] S101 extracting a sound signal in which human voice and background sound co-occur from the beginning of the original sound signal as a sample;

[0044] For example, you can read a section of sound about 10s from the beginning of the original sound signal, and separate the part where the human voice and the background sound co-occur and the part where only the background sound appears; The next 10s can be read until the human voice is found;

[0045] S102 detects the main pi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice extracting method and system and a voice audio playing method and device. The voice extracting method comprises the steps of extracting sound signals where voice and background sound exist jointly from the beginning position of original acoustical signals as a sample, detecting tonic tone from the sample, using the tonic tone as the reference frequency, and comparing the pitch frequency of sound belonging to the same sound source in the sound portion, except the sample, of the original acoustical signals with the reference frequency to determine whether the sound source is voice. The voice extracting method can conveniently extract voice from mixed voice frequency.

Description

technical field [0001] The invention relates to the field of mixed audio separation and extraction, in particular to a human voice extraction method and system, and a human voice audio playback method and device. Background technique [0002] In order to extract human voice from audio such as binaural stereo and enhance it to achieve the purpose of clearer speech and effective noise reduction, a sound separation technology that can extract single audio from mixed audio is needed. The current technology that can meet this requirement is mainly an audio separation technology based on Computational Auditory Scene Analysis (CASA). [0003] Auditory Scene Analysis (ASA) technology, the auditory system uses various characteristics of sound (time domain, frequency domain, spatial position, etc.) to decompose a mixed sound signal into multiple signals, and each signal belongs to a different physical sound source. Computational Auditory Scene Analysis (CASA) technology uses compute...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/0272G10L25/51G10L25/78
CPCG10L21/028G10L21/0272G10L25/51
Inventor 佘海波王进军刘书昌张欣
Owner ZTE CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products