Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for classifying voice data

A technology of audio data and classification methods, which is applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., and can solve the problems of low speed of audio data classification, large amount of calculation, and low accuracy of audio data classification

Inactive Publication Date: 2015-03-25
BEIJING QIYI CENTURY SCI & TECH CO LTD
View PDF2 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the dimension of the extracted feature vector is usually 39 dimensions or more, a large amount of labeled data is required to obtain the GMM model from the feature vector training under the framework of the GMM model, and this labeled data consumes a lot of manpower. The amount of data obtained is relatively small, which will cause the problem of data sparseness, and the accuracy of audio data classification is not high
In addition, due to the high dimension of the feature vector, the corresponding amount of calculation of the above training process is relatively large, so the training process is slow and the speed of audio data classification is low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for classifying voice data
  • Method and device for classifying voice data
  • Method and device for classifying voice data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0084] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0085] figure 1 As shown, it is an implementation flowchart of an audio data classification method according to an embodiment of the present invention, which includes the following steps:

[0086] Step S101, obtaining the first audio data of the category to be identified;

[0087] In the process of classifying the first audio data, first the electronic device obtains the first audio data to be identified. Wherein, the first audio data of the category to be i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a method and device for classifying voice data. The method for classifying the voice data comprises the steps that first voice data of the category to be recognized are obtained; windowing is carried out on the voice time shaft of the first voice data according to a preset windowing algorithm; an MFCC feature vector is extracted for the voice data in each window; vector quantization is carried out on each MFCC feature vector to obtain a one-dimensional first feature value; all the first feature values are calculated according to a preset histogram drawing algorithm to obtain a first histogram of the first voice data; similarity calculation is carried out on the first histogram and preset histogram feature templates corresponding to the voice data of various voice categories, and a first histogram feature template most similar to the first histogram is obtained; the voice category of the feature template is the voice category of the first voice data. Compared with the prior art, according to the technical scheme, the accuracy and speed of classifying the voice data are improved.

Description

technical field [0001] The invention relates to the technical field of multimedia data processing, in particular to an audio data classification method and device. Background technique [0002] With the rapid development of multimedia technology and network technology, audio data has grown exponentially. Correspondingly, a large amount of audio data information has also appeared on the Internet, which is widely used in education, entertainment, news, advertising and other fields. become an important part of people's daily life. Therefore, how to classify these audio data is an urgent problem to be solved. [0003] At present, in the prior art, feature vector extraction is first performed on audio data, and then the audio data is classified based on the GMM model. Since the dimension of the extracted feature vector is usually 39 dimensions or more, a large amount of labeled data is required to obtain the GMM model from the feature vector training under the framework of the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/683
Inventor 杨晓昊
Owner BEIJING QIYI CENTURY SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products