Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio detection method based on LSTM network, electronic equipment and storage medium

An audio detection and network technology, applied in speech analysis, instruments, etc., can solve problems that affect the effect of speech recognition or voiceprint recognition, and achieve the effect of convenient and efficient application and high accuracy of verification

Pending Publication Date: 2020-06-09
XIAMEN KUAISHANGTONG TECH CORP LTD
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The appearance of these invalid audio segments greatly affects the effect of our overall speech recognition or voiceprint recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio detection method based on LSTM network, electronic equipment and storage medium
  • Audio detection method based on LSTM network, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] The present invention provides a kind of audio frequency detection method based on LSTM network, and described method comprises the following steps, as attached figure 1 shown, including the following steps:

[0035] Step S1, collecting a certain number of pieces of audio data, classifying and marking each piece of audio data,

[0036] Mark the invalid audio data as A, in the embodiment of the present invention, preferably, A is 1, and mark the valid audio data as B, in the embodiment of the present invention, preferably, B is 0; the audio data includes a first amount of invalid audio data and a second amount of valid audio data,

[0037] The length of each piece of audio is T seconds; where, 0.1≤T≤1, in the embodiment of the present invention, it is preferably 0.5s.

[0038] The invalid audio contains one or more of the following noises, telephone ringing, and car sounds;

[0039] Step 2, constructing a classification model of one-two classification;

[0040] The c...

Embodiment 2

[0072] An embodiment of the present invention provides an electronic device, the electronic device includes at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be used by the at least one processor Executable instructions, the instructions are executed by the at least one processor, so that the at least one processor executes the steps of the audio detection method based on the LSTM network. The steps of the LSTM network-based audio detection method in this embodiment are the same as those in Embodiment 1, and will not be repeated in this embodiment.

Embodiment 3

[0074] An embodiment of the present invention provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the audio detection method based on the LSTM network are implemented. The steps of the LSTM network-based audio detection method in this embodiment are the same as those in Embodiment 1, and will not be repeated in this embodiment.

[0075] It should be noted that if the LSTM network-based audio detection method provided by the present invention is realized in the form of a software function module and sold or used as an independent product, it can also be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the embodiment of the present invention is essentially or the part that contributes to the prior art can be embodied in the form of a software product. The computer software product is stored in a storage...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio detection method based on an LSTM network, electronic equipment and a storage medium. The method comprises the following steps: collecting a certain number of pieces of audio data, and classifying and labeling each piece of audio data; constructing a classification model of one-two classification; training the classification model by using an LSTM network; intercepting a section of audio data to be tested into a plurality of sections of sub-audio data; respectively inputting multiple sections of sub-audio data of the to-be-tested audio data into the trained classification model for classification judgment; and splicing the reserved effective sub-audio data to form effective audio. According to the method, the audio segments are automatically detected according to the time dimension; when invalid audios are detected, the invalid audios are automatically removed. The method has the advantages of more user-friendly setting, high verification accuracy, convenient and efficient application and the like.

Description

technical field [0001] The invention relates to the field of catering, and relates to an audio detection method based on an LSTM network, as well as related electronic equipment and storage media. Background technique [0002] In voice-related technologies such as voice recognition and voiceprint recognition, there has always been interference of invalid audio. For example, there may be excessive noise in the audio, the beeping sound of the car, the ringing tone of the phone, and blank audio segments, etc. The appearance of these invalid audio segments greatly affects the effect of our overall speech recognition or voiceprint recognition. Therefore, it is necessary to detect and remove them. Contents of the invention [0003] The purpose of the present invention is to solve the problems of the prior art, and propose to automatically detect audio segments according to the time dimension. [0004] The present invention provides a kind of audio detection method based on LS...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/51G10L25/30G10L25/24
CPCG10L25/51G10L25/30G10L25/24
Inventor 白坤肖龙源李稀敏蔡振华刘晓葳
Owner XIAMEN KUAISHANGTONG TECH CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products