Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio data structured conversion method based on working site

A job-site, audio data technology, applied in the field of job-site-based audio data structured transformation, can solve the problems of poor analysis accuracy, lack of standards for large-scale applications, and limited use, achieving good analysis accuracy and convenient keyword retrieval. Effect

Pending Publication Date: 2021-10-22
ZHONGSHAN POWER SUPPLY BUREAU OF GUANGDONG POWER GRID
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although some audio-based intelligent analysis products have appeared in recent years as a powerful supplement to manual screening analysis, they lack standards for large-scale application, and the analysis accuracy is poor, so the role is very limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio data structured conversion method based on working site
  • Audio data structured conversion method based on working site
  • Audio data structured conversion method based on working site

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be further described below with reference to the specific embodiments in the drawings.

[0037] See Figure 1-3 , A structure-based conversion method based on the operating site, including:

[0038] Get the original audio in the field environment of the power job;

[0039] The MFCC feature extracts the original audio by Mel-Scale Frequency Cepstral Coefficients (MFCC) by Mel-Scale Frequency Cepstral Coefficients;

[0040] The original audio after the Dirichlet Process GaussianmixTure Model (DIRICHLET GAUSSIANMIXTURE MODEL, "DPGMM) is used to get the DPGMM after the DPGMM; the DPGMM is high, and the probability of each frame is usually in one-dimensional or Two-dimensional, other dimensions are mostly zero. The MFCC is full of acoustic details in all dimensions, but the DPGMM subtronomance map has the discrimination in dimensions; they add each other in the functional combination, MFCC-DPGMM feature extraction method figure 1 Indicated;

[0041] Con...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio data structured conversion method based on a working site, relates to the technical field of audio data, and solves the technical problem of poor precision of an existing audio analysis product. The method comprises the following steps: acquiring an original audio in an electric power working site environment; performing feature extraction on the original audio by using a Mel-frequency cepstrum coefficient (MFCC) to obtain MFCC features; processing the original audio by using a Dirichlet process Gaussian mixture model (DPGMM) to obtain a DPGMM posterior graph; connecting the DPGMM posterior graph with the MFCC feature to serve as an audio structuralized enhancement feature of the original audio, and obtaining a voice text; performing multi-label classification on the voice text by using a Catboost algorithm to obtain multi-label classification information; and storing the multi-label classification information into a database so as to facilitate subsequent keyword retrieval and deeper audio analysis. Feature extraction is carried out on the audio data through the MFCC-DPGMM, audio file multi-label processing is carried out through the Catboost algorithm, structured processing of the audio data is achieved, and the analysis precision is good.

Description

Technical field [0001] The present invention relates to the field of audio data, and more particularly, it relates to an audio data structure-based conversion method based on the job site. Background technique [0002] In the power industry, with the energy internet, smart grid, the construction and development of electricity networks, the various network topologies become more complicated. Especially in the intelligent monitoring of power operations, audio data such as field voice is widely collected, so massive audio data is also produced. However, the audio collection and analysis system at the current power operation site only implements data acquisition, massive audio needs to consume a lot of artificial processing and analysis, and there is an unable to intelligently audio data mining, unable to effectively express and manage and efficiently retrieval. How to extract structured information and content semantics in audio is the key to audio information depth processing, cont...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F16/31G06F16/61G06F16/65G10L25/24
CPCG06F16/355G06F16/313G06F16/65G06F16/61G10L25/24
Inventor 王天师李华刘文韬罗其锋张春梅谭伟谭莹莹包达志魏俊锋黄国柱
Owner ZHONGSHAN POWER SUPPLY BUREAU OF GUANGDONG POWER GRID
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products