Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio characteristic classification method based on variable duration

A classification method and audio feature technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of large detection delay and inconsistent stationary period, so as to avoid the delay problem and realize the effect of real-time classification.

Inactive Publication Date: 2014-01-01
TSINGHUA UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the short-term stationarity of audio signals, long-term features are generally more stable and distinguishable than short-term features, but the disadvantage is that the detection delay is large, which has certain limitations for real-time classification systems.
In addition, the stable periods exhibited by different features may be inconsistent, and it may not be optimal to calculate the corresponding long-term features under the same time length for these features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio characteristic classification method based on variable duration
  • Audio characteristic classification method based on variable duration
  • Audio characteristic classification method based on variable duration

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The preferred embodiments will be described in detail below in conjunction with the accompanying drawings. It should be emphasized that the following description is only exemplary and not intended to limit the scope of the invention and its application.

[0032] The present invention is described by taking speech / music signal classification at a sampling rate of 32kHz as an example. For the classification of other types of audio signals, the present invention is still applicable.

[0033] figure 1 It is a flow chart of the audio feature classification method based on variable duration. figure 1 In , the audio feature classification method based on variable duration includes the following steps:

[0034] Step 1: Use the type-determined and labeled audio sequences as training sequences.

[0035] Step 2: Extract the short-term feature F of the audio signal in the training sequence 1 , F 2 ,...,F K , forming a short-term eigenvector , K is the number of components ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio characteristic classification method based on variable duration in a multimedia signal processing and mode identification technology field. The method comprises the following steps: taking a marked audio sequence whose type is determined as a training sequence; extracting short time characteristics of an audio signal in the training sequence so as to form a short time characteristic vector; calculating a statistical parameter of the each short time characteristic in setting duration so as to acquire a statistical characteristic vector corresponding to the short time characteristic vector; calculating a group of the statistical characteristic vectors corresponding to the short time characteristic vector, and forming a long time characteristic vector of the training sequence by the group of the statistical characteristic vectors; using the long time characteristic vector of the training sequence to train a classifier; extracting a short time characteristic of an ist frame audio signal in a test sequence and calculating an ist frame input long time characteristic vector of the test sequence; sending the ist frame input long time characteristic vector into the trained classifier so as to obtain a classification type. By using the method of the invention, a time-delay problem caused by long time characteristic extraction can be avoided and real time classification of the audio characteristic can be realized.

Description

technical field [0001] The invention belongs to the technical field of multimedia signal processing and pattern recognition, and in particular relates to a variable duration-based audio feature classification method. Background technique [0002] With the continuous development of communication technology, digital audio processing has been widely used in many fields such as mobile communication, Internet, broadcasting and personal electronics. From the perspective of audio codec technology, it has gradually expanded from the traditional narrowband speech-based speech coding to multimedia audio coding with higher bandwidth expansion and quality. There are higher requirements for channel adaptability, transmission reliability, and codec quality. Regardless of audio codec or sound effect editing and production, the diversity of the audio signal itself makes it possible to select different processing techniques for different types of audio signals. For example, G.718 and G.729...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/08
Inventor 卢敏窦维蓓
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products