A voiceprint recognition method and system based on variational information bottleneck
A voiceprint recognition and information bottleneck technology, which is applied in the field of voiceprint recognition methods and systems based on variational information bottlenecks, can solve the problem of low accuracy of voiceprint recognition, improve recognition accuracy, improve robustness, reduce The effect of feature redundancy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] The embodiment of the present invention provides a voiceprint recognition method based on variational information bottleneck, including:
[0049] S1: Obtain the original voice data;
[0050] S2: Construct a voiceprint recognition model that introduces a variational information bottleneck, wherein the voiceprint recognition model includes an acoustic feature parameter extraction layer, a frame-level feature extraction network, a feature aggregation layer, a variational information bottleneck layer, and a classifier, wherein the acoustic feature The parameter extraction layer is used to convert the input original speech waveform into the acoustic feature parameter FBank, and the frame-level feature extraction network is used to extract multi-scale and multi-frequency frame-level speaker information from the acoustic feature parameter FBank in a single aggregation method, and obtain frame-level speaker information. Feature vector, the feature aggregation layer is used to c...
Embodiment 2
[0118] Based on the same inventive concept, this embodiment provides a voiceprint recognition system based on variational information bottleneck, including:
[0119] A data acquisition module for acquiring original voice data;
[0120] The model building module is used to construct a voiceprint recognition model that introduces a variational information bottleneck. The voiceprint recognition model includes an acoustic feature parameter extraction layer, a frame-level feature extraction network, a feature aggregation layer, a variational information bottleneck layer, and a classifier. Among them, the acoustic feature parameter extraction layer is used to convert the input original speech waveform into the acoustic feature parameter FBank, and the frame-level feature extraction network is used to extract the multi-scale and multi-frequency frame-level speaker information from the acoustic feature parameter FBank to obtain frame-level speaker information. Feature vector, the feat...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com