Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Acoustic scene identification method based on data enhancement

An identification method and scene technology, applied in the field of audio signal processing and deep learning, can solve the problems of high cost of manual labeling, high cost of manual labeling data, insufficient labeled training data, etc.

Active Publication Date: 2019-07-05
SOUTH CHINA UNIV OF TECH
View PDF15 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, manual labeling is mainly used to increase the diversity of labeled data, but the cost of manual labeling is very high
In addition, in different classification tasks, data samples need to be relabeled, which is not universal, making the cost of manual labeling even higher
In order to overcome the problems of high cost of manually labeled data and insufficient labeled training data, data enhancement methods are urgently needed to increase the diversity of limited labeled audio data, thereby improving the adaptability and generalization ability of complex classifiers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Acoustic scene identification method based on data enhancement
  • Acoustic scene identification method based on data enhancement
  • Acoustic scene identification method based on data enhancement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0065] This embodiment discloses a specific implementation process of a sound scene recognition method based on data enhancement, such as figure 1 As shown, the specific steps of the sound scene recognition method are as follows:

[0066] S1. Audio sample preparation: use recording equipment to collect audio samples in different acoustic scenes, and manually mark them, and then divide the above audio samples into training sets and test sets.

[0067] In this embodiment, this step specifically includes the following steps:

[0068] S1.1. Use recording equipment to collect audio data: place recording equipment in different scenes, and record audio samples of corresponding scenes. The sampling frequency is 16kHz, and the number of quantization bits is 16bit.

[0069] S1.2. Divide the dataset: randomly divide the labeled audio samples into disjoint training sets and test sets, where the training set accounts for about 80% and the test set accounts for about 20%.

[0070] S2. Pre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an acoustic scene identification method based on data enhancement. The method comprises the following steps: firstly, collecting and marking audio samples of different sound scenes; then preprocessing is carried out, and pre-emphasis, framing and windowing processing are carried out on the audio samples; data enhancement is then performed, extracting a harmonic source and an impact source of each audio sample to obtain more sufficient audio samples, extracting logarithmic Mel filter bank characteristics from the audio samples and the harmonic sources and the impact sources of the audio samples, stacking the three characteristics into a three-channel high-dimensional characteristic, and constructing more abundant training samples by adopting a hybrid enhancement technology; and finally, inputting the three-channel high-dimensional features into an Xception network for judgment, and identifying the sound scene corresponding to each audio sample. According to the data enhancement method, the generalization capability of the Xception network classifier can be effectively improved, and the training process of the network is stabilized. When the acoustic scene isidentified, the method can obtain a better identification effect.

Description

technical field [0001] The present invention relates to the technical fields of audio signal processing and deep learning, in particular to a sound scene recognition method based on data enhancement. Background technique [0002] Audio signals contain rich information and have the advantages of being non-contact and natural. A sound scene is a high-level representation of an audio signal at the semantic level. The task of acoustic scene recognition is to associate semantic labels with audio streams to identify categories of sound-producing environments. This technology enables smart devices to perceive the surrounding environment based on sound, so as to make appropriate decisions. At present, there is a massive increase in audio data. Due to the time-consuming and labor-intensive manual labeling of data, there are very few audio samples with accurate labels. Unlabeled audio samples cannot be directly used to train a classifier. How to construct more diverse training dat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G10L21/0208G10L25/03G10L25/27G10L25/45
CPCG10L21/0208G10L25/03G10L25/27G10L25/45G06F18/24G06F18/214
Inventor 李艳雄张聿晗王武城刘名乐
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products