Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voiceprint recognition method under channel attention propagation and aggregation

A technology of voiceprint recognition and attention, which is applied in the field of signal processing, can solve the problems of reducing the performance of voiceprint recognition, insufficient robustness of voiceprint encoding, and no attention

Active Publication Date: 2021-07-06
CHONGQING UNIV OF POSTS & TELECOMM
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing models only extract various attribute features of the speaker’s voiceprint from the frame-level features of the last layer of the network, and do not pay attention to the frame-level features extracted by other network layers and the rich information contained in each channel, that is, The useful information of voiceprint features is not captured and emphasized, which makes the voiceprint encoding output by the network less robust and reduces the performance of voiceprint recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voiceprint recognition method under channel attention propagation and aggregation
  • Voiceprint recognition method under channel attention propagation and aggregation
  • Voiceprint recognition method under channel attention propagation and aggregation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049]Embodiments of the present invention are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that the diagrams provided in the following embodiments are only schematically illustrating the basic concept of the present invention, and the following embodiments and the features in the embodiments can be combined with each other in the case of no conflict.

[0050] Wherein, the accompanying drawings are for illustrative purposes only, and represent only schematic diagrams, rather than physical drawings, and should ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a voiceprint recognition method under channel attention propagation and aggregation, belonging to the field of signal processing. The method comprises the following steps: S1, carrying out second-order wavelet scattering transformation on an original speech discrete signal; S2, performing voiceprint mapping coding of multi-scale features; and S3, evaluating the similarity of the voiceprint codes. According to the method, the multi-scale short-time voice features are obtained through wavelet scattering transformation, and the multi-scale features are mapped by adopting the time delay neural network based on channel attention propagation and aggregation to obtain the voiceprint codes, so the accuracy and robustness of voiceprint recognition are improved. The method takes the processing of long-time and short-time voices into consideration, provides a new technical means for voiceprint recognition containing short-time voice data, and can also be migrated to other voice processing fields to serve as one of voiceprint code acquisition methods.

Description

technical field [0001] The invention belongs to the field of signal processing, and relates to a voiceprint recognition method under channel attention propagation and aggregation. Background technique [0002] As a biometric technology, voiceprint recognition has the following advantages compared with face recognition, fingerprint recognition and other technologies: (1) easy to obtain; (2) low cost; (3) high user acceptance; (4) universal wide range. In recent years, the research on using the hidden layer output of neural network to encode voiceprint has made remarkable progress. However, due to the few voiceprint features extracted from the voiceprint data containing short-term speech and the poor robustness of the voiceprint features, the reliable operation of the voiceprint recognition system still faces major challenges. [0003] Many research works use data sets such as Voxceleb or Librispeech for modeling and verification. The average audio duration of these data set...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/00G10L17/02G10L17/18G10L19/02G10L19/26G10L25/51
CPCG10L17/02G10L17/18G10L19/0216G10L19/26G10L25/51
Inventor 李鹏华田鹏刘行谋陈旭赢李祖栋卢楠王宁鲁鑫高翔
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products