Tibetan language speech recognition method based on HMM and DNN
A speech recognition and Tibetan language technology, applied in speech recognition, speech analysis, instruments, etc., to achieve the effect of improving efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0043] Based on the HMM-DNN Tibetan speech recognition system, its construction includes the following steps:
[0044] Step 1: Record Tibetan speech data, and label the recorded Tibetan speech data to establish a database.
[0045] Step 2: Perform data preparation, organize several files required for training the model, extract MFCC, and perform cepstral mean variance normalization.
[0046] In a speech recognition system, the first step is feature extraction. Information such as the pitch of a voice can reflect a person's speech characteristics. A person's speech characteristics can be reflected in the shape of the vocal tract. If the shape can be accurately known , then we can accurately describe the generated phonemes. The shape of the vocal tract is displayed in the envelope of the short-term power spectrum of speech. MFCC is a feature that accurately describes this envelope.
[0047] First, pre-emphasize, frame and window the speech; then analyze each short time window, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com