Bird sound recognition method based on multi-feature fusion and combination model

A multi-feature fusion and combined model technology, applied in the field of bird sound recognition, can solve the problems of single extraction feature and insufficient bird sound characteristics, and achieve the effect of large feature difference, solving the lack of bird sound features, and improving the difference.

Active Publication Date: 2021-11-30
NANJING UNIV OF INFORMATION SCI & TECH
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the problem that the extracted feature is single in the existing bird voice recognition method and the representative bird voice characteristics are insufficient, the present invention provides a bird voice recognition method based on multi-feature fusion and combination model, which uses fusion features instead of single features , so that the feature differences between different bird voices are greater and easier to be distinguished, and the combination of three neural network models is used for recognition, which improves the accuracy of bird voice recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bird sound recognition method based on multi-feature fusion and combination model
  • Bird sound recognition method based on multi-feature fusion and combination model
  • Bird sound recognition method based on multi-feature fusion and combination model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Embodiments of the present invention will be described below in conjunction with the accompanying drawings.

[0028] like figure 1 As shown, the present invention relates to a kind of bird's voice recognition method based on multi-feature fusion and combination model, and this method mainly comprises the following steps:

[0029] Step 1. Preprocess the read original bird sound audio, including pre-emphasis and frame-by-frame windowing, as follows:

[0030] First, the original bird sound audio is read at a frequency of 22.5KHz, and a first-order FIR high-pass digital filter is used to pre-emphasize the read original bird sound audio, with a pre-emphasis coefficient of 0.9665. Then the Hamming window is used for frame division and windowing. The frame length is 23ms and the frame shift is 11.5ms. A total of 173 frames of bird voice data can be obtained.

[0031] Step 2. Extract the Mel cepstral coefficient (MFCC) of the bird's voice from the preprocessed original bird's...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a bird sound recognition method based on multi-feature fusion and a combination model. The method comprises the steps: carrying out the preprocessing of a read original bird sound audio, and carrying out the pre-emphasis and framing windowing; extracting four characteristics of the Mel-frequency cepstrum coefficient, the Mel-filtered energy coefficient, the short-time zero-crossing rate and the short-time spectrum centroid of the bird sound, respectively normalizing the four characteristics, and longitudinally splicing the four characteristics to form a fusion characteristic; drawing an STFT speech spectrogram; respectively inputting the fusion characteristic and the drawn STFT speech spectrogram into two constructed CNN models based on an Inception module for training, splicing probability arrays output by the two models to form a characteristic array after training is completed, taking the characteristic array as input of an ANN model for training, and loading optimal parameters of the three models after training is completed; and inputting any to-be-detected bird sound audio into the three models loaded with the optimal parameters to obtain a bird sound recognition and classification result. According to the invention, the characteristic difference between different bird sounds can be improved, and the bird sound recognition accuracy is increased.

Description

technical field [0001] The invention relates to a bird voice recognition method based on multi-feature fusion and combination models, and belongs to the technical field of bird voice classification and recognition. Background technique [0002] Birds are an important part of the natural ecosystem. Because they are very sensitive to changes in the ecological environment in which they live and are easy to be observed and studied, the monitoring and identification of birds is helpful to the monitoring of the ecological environment. The development of protection work is of great significance. Bird monitoring is an important field of research at home and abroad. Traditional bird monitoring mainly depends on the differences in the morphological characteristics of birds. In terms of hearing, birdsong also contains unique features, and has a wide range, Stability, low interference and other advantages, so the research on bird voice recognition is particularly important. [0003] T...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/02G10L17/04G10L17/18G10L17/26G10L25/24G10L25/30
CPCG10L17/02G10L17/04G10L17/18G10L17/26G10L25/24G10L25/30
Inventor 周晓彦欧昀李大鹏刘文强
Owner NANJING UNIV OF INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products