Speaker verification method and system based on tensor structure and sparse representation

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of speaker confirmation and sparse representation, applied in the field of speaker recognition, can solve problems such as a large amount of memory, slow down the recognition process, limit the number of training samples, etc., to reduce the computational complexity and improve the confirmation efficiency.

Active Publication Date: 2019-07-12

HANGZHOU DIANZI UNIV

View PDF6 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

But due to the over-complete dictionary, the sparse representation of high-dimensional supervectors requires a lot of memory, which limits the number of training samples and may slow down the recognition process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0033] This embodiment provides a speaker confirmation method based on tensor structure and sparse representation, such as figure 1 shown, including steps:

[0034] S1. Construct an auditory feature tensor;

[0035] The present invention processes speech signals by simulating the auditory system of the human ear to obtain its power spectrum. In order to obtain robust features based on tensor structure, this embodiment models the obtained power spectra of different speakers as a third-order tensor quantity.

[0036] Specifically, the human ear can easily perform speaker recognition tasks and is insensitive to noise. In our feature extraction framework, we obtain frequency-selective information by mimicking the processes performed by the human ear in the auditory periphery and pathways.

[0037] First, the present invention extracts features by simulating the process of auditory periphery and pathway occurrence, such as outer ear, middle ear, basilar membrane, inner hair cell...

Embodiment 2

[0099] This embodiment provides a speaker confirmation system based on tensor structure and sparse representation, such as figure 2 shown, including:

[0100] Building blocks for constructing auditory feature tensors;

[0101] The present invention processes speech signals by simulating the auditory system of the human ear to obtain its power spectrum. In order to obtain robust features based on tensor structure, this embodiment models the obtained power spectra of different speakers as a third-order tensor quantity.

[0102] Specifically, the human ear can easily perform speaker recognition tasks and is insensitive to noise. In our feature extraction framework, we capture frequency-selective information by mimicking the processes performed by Ren in the auditory periphery and pathways.

[0103]First, the present invention extracts features by simulating the process of auditory periphery and pathway occurrence, such as outer ear, middle ear, basilar membrane, inner hair ce...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speaker verification method and system based on a tensor structure and sparse representation. The method comprises the following steps of Step 1, building an auditory featuretensor; Step 2, converting the auditory feature tensor into a spare matrix; Step 3, reducing the dimension of the sparse matrix so as to generate a final feature vector; and Step 4, performing speaker verification on the basis of a sparse representation classifier. The speaker verification method and system have the advantages that on the basis of remaining an internal structure of data, the calculation complexity is reduced; and the speaker verification efficiency is improved.

Description

technical field [0001] The present invention relates to the technical field of speaker recognition, in particular to a speaker confirmation method and system based on tensor structure and sparse representation. Background technique [0002] The task of speaker recognition is to identify the speaker. Speaker recognition can be divided into speaker identification and speaker confirmation. For speaker identification, it is to find the correct speaker from multiple speakers. "One-on-one" questions. Speaker verification is the process of extracting personality characteristics from the interlocutor's voice, establishing a recognition model, and verifying the identity of the interlocutor. It is an important branch of speaker recognition research. In speaker confirmation, the extraction of effective feature parameters and the establishment of high-performance recognition models are the key. [0003] Support vector machine (Support Vector Machine, SVM) and Gaussian Mixture Model ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L17/06G10L17/02

CPCG10L17/02G10L17/06

Inventor 简志华郭珊徐剑金易帆

Owner HANGZHOU DIANZI UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speaker verification method and system based on tensor structure and sparse representation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology