Mixed voice recognition method and device, storage medium, and electronic device
A technology of mixing speech and recognition methods, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as affecting recognition time, high cost, and no solution found, and achieve low efficiency, reliable quality, and labor cost saving. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] The method embodiment provided in Embodiment 1 of the present application may be executed in a server, a computer, a mobile phone, a speech recognition device, a recording pen, or a similar computing device. Take running on mobile phone as an example, figure 1 It is a block diagram of the hardware structure of a mobile phone according to the embodiment of the present invention. Such as figure 1 As shown, the mobile phone can include one or more ( figure 1 Only one is shown in ) processor 102 (processor 102 may include but not limited to processing devices such as microprocessor MCU or programmable logic device FPGA) and memory 104 for storing data. Optionally, the above-mentioned mobile phone can also be A transmission device 106 for communication functions and an input and output device 108 are included. Those of ordinary skill in the art can understand that, figure 1 The structure shown is only for illustration, and it does not limit the structure of the above-men...
Embodiment 2
[0084] In this embodiment, a hybrid speech recognition device is also provided, which is used to realize the above embodiments and preferred implementation modes, and what has been explained will not be repeated. As used below, the term "module" may be a combination of software and / or hardware that realizes a predetermined function. Although the devices described in the following embodiments are preferably implemented in software, implementations in hardware, or a combination of software and hardware are also possible and contemplated.
[0085] Figure 8 is a structural block diagram of a mixed speech recognition device according to an embodiment of the present invention, such as Figure 8 As shown, the device includes: an acquisition module 80, a first extraction module 82, and a first identification module 84, wherein,
[0086] Obtaining module 80, is used for obtaining the mixed speech of waiting phoneme recognition, wherein, described mixed speech comprises Chinese word ...
Embodiment 3
[0097] The embodiment of the present application also provides an electronic device, Figure 9 is a structural diagram of an electronic device according to an embodiment of the present invention, such as Figure 9 As shown, it includes a processor 91, a communication interface 92, a memory 93 and a communication bus 94, wherein the processor 91, the communication interface 92, and the memory 93 complete mutual communication through the communication bus 94, and the memory 93 is used to store computer programs;
[0098] Processor 91, when being used to execute the program stored on the memory 93, realize the following steps: obtain the mixed speech to be recognized by phonemes, wherein, the mixed speech includes Chinese words and English words; extract English non-English words from the mixed speech Abbreviated words; using the first preset grapheme sequence to phoneme sequence G2P model to identify the first phoneme information of the English non-abbreviated word, wherein the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com