Speech deep hash learning method and system based on CNN
A learning method and deep technology, applied in the field of speech retrieval based on deep learning, can solve the problems of manual feature defects, low query accuracy and efficiency, and achieve the effect of accelerating convergence speed, improving query accuracy and efficiency, and improving robustness.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0096] This embodiment adopts the speech in the Chinese speech database--THCHS-3 released by Tsinghua University Speech and Language Technology Center (CSLT) to evaluate the proposed method, the speech sampling frequency is 16kHz, the sampling size is 16bits, and the speech content is 1000 sentences There are 13,388 speech clips in the database for news clips with different contents, each speech clip is about 10s long, and the total length is about 30 hours. In the experiment of the present invention, 10 sections of voices with different voice content spoken by 17 people were selected, and various voice content maintenance operations including volume adjustment, adding noise, weighting, resampling, MP3, etc. were carried out, and a total of 3060 voices were obtained. Speech training is expected to improve the robustness of the system while increasing the amount of data. In the experimental analysis stage, 1000 voices were randomly selected from the THCHS-30 voice library for e...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com