Speaker recognition method and system based on multi-source attention network
A technology of speaker recognition and attention, applied in the field of speaker recognition, can solve problems such as improvement of recognition accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] This embodiment provides a speaker recognition method based on a multi-source attention network;
[0037] Such as figure 1 As shown, the speaker recognition method based on multi-source attention network, including:
[0038] S101: Extract gender features of the speech segment to be recognized; extract accent features of the speech segment to be recognized;
[0039] S102: Extract the timbre features of the speech segment to be recognized based on the CNN network of the trained multi-source attention network;
[0040] S103: A gender attention network based on the trained multi-source attention network, using gender features and timbre features to construct gender auxiliary features;
[0041] S104: An accent attention network based on the trained multi-source attention network, using accent features and timbre features to construct accent auxiliary features;
[0042] S105: Perform speaker recognition by combining the timbre feature, gender auxiliary feature and accent a...
Embodiment 2
[0154] This embodiment provides a speaker recognition system based on a multi-source attention network;
[0155] Speaker recognition system based on multi-source attention network, including:
[0156] Gender and accent feature extraction module, which is configured to: extract the gender feature of the speech segment to be recognized; extract the accent feature of the speech segment to be recognized;
[0157] The timbre feature extraction module is configured to: extract the timbre features of the speech segment to be recognized based on the CNN network of the trained multi-source attention network;
[0158] A gender auxiliary feature construction module, which is configured to: use gender features and timbre features to construct gender auxiliary features based on the gender attention network of the trained multi-source attention network;
[0159] An accent auxiliary feature construction module, which is configured as: an accent attention network based on the trained multi-s...
Embodiment 3
[0165] This embodiment also provides an electronic device, including: one or more processors, one or more memories, and one or more computer programs; wherein, the processor is connected to the memory, and the one or more computer programs are programmed Stored in the memory, when the electronic device is running, the processor executes one or more computer programs stored in the memory, so that the electronic device executes the method described in Embodiment 1 above.
[0166]It should be understood that in this embodiment, the processor can be a central processing unit CPU, and the processor can also be other general-purpose processors, digital signal processors DSP, application specific integrated circuits ASIC, off-the-shelf programmable gate array FPGA or other programmable logic devices , discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com