Audio detection method based on LSTM network, electronic equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An audio detection and network technology, applied in speech analysis, instruments, etc., can solve problems that affect the effect of speech recognition or voiceprint recognition, and achieve the effect of convenient and efficient application and high accuracy of verification

Pending Publication Date: 2020-06-09

XIAMEN KUAISHANGTONG TECH CORP LTD

View PDF7 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The appearance of these invalid audio segments greatly affects the effect of our overall speech recognition or voiceprint recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0034] The present invention provides a kind of audio frequency detection method based on LSTM network, and described method comprises the following steps, as attached figure 1 shown, including the following steps:

[0035] Step S1, collecting a certain number of pieces of audio data, classifying and marking each piece of audio data,

[0036] Mark the invalid audio data as A, in the embodiment of the present invention, preferably, A is 1, and mark the valid audio data as B, in the embodiment of the present invention, preferably, B is 0; the audio data includes a first amount of invalid audio data and a second amount of valid audio data,

[0037] The length of each piece of audio is T seconds; where, 0.1≤T≤1, in the embodiment of the present invention, it is preferably 0.5s.

[0038] The invalid audio contains one or more of the following noises, telephone ringing, and car sounds;

[0039] Step 2, constructing a classification model of one-two classification;

[0040] The c...

Embodiment 2

[0072] An embodiment of the present invention provides an electronic device, the electronic device includes at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be used by the at least one processor Executable instructions, the instructions are executed by the at least one processor, so that the at least one processor executes the steps of the audio detection method based on the LSTM network. The steps of the LSTM network-based audio detection method in this embodiment are the same as those in Embodiment 1, and will not be repeated in this embodiment.

Embodiment 3

[0074] An embodiment of the present invention provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the audio detection method based on the LSTM network are implemented. The steps of the LSTM network-based audio detection method in this embodiment are the same as those in Embodiment 1, and will not be repeated in this embodiment.

[0075] It should be noted that if the LSTM network-based audio detection method provided by the present invention is realized in the form of a software function module and sold or used as an independent product, it can also be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the embodiment of the present invention is essentially or the part that contributes to the prior art can be embodied in the form of a software product. The computer software product is stored in a storage...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an audio detection method based on an LSTM network, electronic equipment and a storage medium. The method comprises the following steps: collecting a certain number of pieces of audio data, and classifying and labeling each piece of audio data; constructing a classification model of one-two classification; training the classification model by using an LSTM network; intercepting a section of audio data to be tested into a plurality of sections of sub-audio data; respectively inputting multiple sections of sub-audio data of the to-be-tested audio data into the trained classification model for classification judgment; and splicing the reserved effective sub-audio data to form effective audio. According to the method, the audio segments are automatically detected according to the time dimension; when invalid audios are detected, the invalid audios are automatically removed. The method has the advantages of more user-friendly setting, high verification accuracy, convenient and efficient application and the like.

Description

technical field [0001] The invention relates to the field of catering, and relates to an audio detection method based on an LSTM network, as well as related electronic equipment and storage media. Background technique [0002] In voice-related technologies such as voice recognition and voiceprint recognition, there has always been interference of invalid audio. For example, there may be excessive noise in the audio, the beeping sound of the car, the ringing tone of the phone, and blank audio segments, etc. The appearance of these invalid audio segments greatly affects the effect of our overall speech recognition or voiceprint recognition. Therefore, it is necessary to detect and remove them. Contents of the invention [0003] The purpose of the present invention is to solve the problems of the prior art, and propose to automatically detect audio segments according to the time dimension. [0004] The present invention provides a kind of audio detection method based on LS...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L25/51G10L25/30G10L25/24

CPCG10L25/51G10L25/30G10L25/24

Inventor 白坤肖龙源李稀敏蔡振华刘晓葳

Owner XIAMEN KUAISHANGTONG TECH CORP LTD

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Audio detection method based on LSTM network, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology