Unified Chinese-English mixed text generation and speech recognition end-to-end framework
A mixed text and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as data mismatch
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0086] Such as figure 1 The end-to-end framework for unified Chinese-English mixed text generation and speech recognition provided by the embodiment of the present application includes:
[0087] Chinese-English mixed phoneme sequence generation module, speech feature extraction module, acoustic feature sequence convolution downsampling module, acoustic encoder, phoneme embedding module, phoneme encoder, discriminator and decoder; the phoneme encoder and the discriminator Constitute a generation confrontation network, the phoneme coder is used as the generator of the generation confrontation network, the discriminator is the discriminator of the generation confrontation network, and the acoustic encoder is used as the true data input of the generation confrontation network, Using this confrontational generative network to promote the distribution of the phoneme coded representation output by the phoneme encoder close to the acoustic coded representation output by the acoustic c...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com