Speech retrieval method and system based on fast Fourier inverse transformation
A technology of inverse Fourier transform and speech, applied in speech analysis, speech recognition, digital data information retrieval, etc., can solve the problems of low accuracy and low retrieval efficiency, and achieve high-efficiency extraction, high retrieval efficiency and retrieval accuracy Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0067] figure 1 It is a flow chart of the voice retrieval method based on inverse fast Fourier transform in Embodiment 1 of the present invention.
[0068] see figure 1 , the voice retrieval method based on the inverse fast Fourier transform of the embodiment, comprising:
[0069] Step S1: Obtain the voice to be queried. The voice to be queried is submitted by a mobile terminal user.
[0070] Step S2: Using Inverse Fast Fourier Transform to perform feature extraction on the speech to be queried to obtain a feature vector to be queried.
[0071] Step S3: Using the measurement matrix to reduce the dimensionality of the feature vector to be queried to obtain the feature vector to be queried after dimensionality reduction.
[0072] Specifically, a part of the Hadamard measurement matrix is used to reduce the dimensionality of the feature vector to be queried.
[0073] Step S4: constructing a hash sequence to be queried according to the feature vector to be queried after dim...
Embodiment 2
[0103] The speech retrieval method based on the inverse fast Fourier transform in this embodiment includes three processes of system hash index table construction, ciphertext speech library construction, and user speech retrieval.
[0104] Step 1 System hash index table construction
[0105] 1) Perceptual hashing scheme construction
[0106] The efficient speech-aware hashing scheme proposed in this embodiment is obtained by performing inverse fast Fourier transform (IFFT) calculation on the original speech and then reducing the dimensionality of a part of the Hadamard measurement matrix. The steps of perceptual hash scheme construction are as follows:
[0107] Step 1: Preprocessing. The signal s(t)' is obtained by pre-emphasizing the speech segment s(t), which makes the frequency spectrum of the signal flatter and facilitates subsequent feature extraction.
[0108] Step 2: Framing and windowing. Divide the speech segment s(t)' into m frames of equal length and non-overlap...
Embodiment 3
[0147] Figure 4 It is a schematic structural diagram of a speech retrieval system based on inverse fast Fourier transform in Embodiment 3 of the present invention.
[0148] see Figure 4 , the speech retrieval system based on fast Fourier transform in the present embodiment, comprises:
[0149] The first voice acquisition module 401 is configured to acquire the voice to be queried.
[0150] The first feature extraction module 402 is configured to perform feature extraction on the speech to be queried by using an inverse fast Fourier transform to obtain a feature vector to be queried.
[0151] The first dimensionality reduction module 403 is configured to perform dimensionality reduction on the feature vector to be queried by using a measurement matrix to obtain a feature vector to be queried after dimensionality reduction.
[0152] The first sequence construction module 404 is configured to construct a hash sequence to be queried according to the feature vector to be queri...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com