End-to-end macaque voiceprint verification method and system based on cyclic frame-level feature fusion
A voiceprint verification and feature fusion technology, applied in the computer field, can solve the problems of inaccessibility, difficulty in implementation, and high vigilance of primates, and achieve the effect of expanding limitations and improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0053] Such as figure 1 As shown, Embodiment 1 of the present invention proposes an end-to-end rhesus monkey voiceprint verification method based on cyclic frame-level feature fusion, which method includes the following steps:
[0054] Step 110: Select speech pairs from the preprocessed macaque corpus according to the rules, and construct a training set and a test set. Among them, the training set data is randomly divided into q groups, and each group contains the speech segments of m macaques.
[0055] In the prior art, feature extraction is usually performed on speech data in the preprocessing stage to obtain MFCC, LPC, or spectrogram, etc., for classification by classification models. The preprocessing of the present invention is to intercept the effective speech segment, that is, to cut out the silent segment in the original speech, rather than feature extraction, and the macaque corpus after preprocessing is still data in the speech format.
[0056] Step 120: Randomly r...
Embodiment 2
[0107] Such as Figure 4 As shown, Embodiment 2 of the present invention proposes a macaque voiceprint verification system, which is implemented based on the above-mentioned macaque voiceprint verification method. In the training and testing phase, the system specifically includes:
[0108] The data processing module 410 is used to select the speech pair by the macaque corpus after the preprocessing according to the rules, constructs the training set and the test set, and the training set and the test set are grouped;
[0109] The backbone network 420 is used to extract the features of the input rhesus monkey speech segment to obtain the frame level feature vector of the rhesus monkey speech segment;
[0110] The feature fusion network 430 is used to perform cyclic frame interception on the frame-level feature vector output by the backbone network, and then perform feature fusion to obtain the fusion frame feature vector of the macaque speech segment;
[0111] The feature com...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com