Text-to-speech conversion method and device, electronic device and storage medium
A text and speech technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as inability to separate words, great impact on results, and errors in word segmentation, so as to ensure accuracy, improve accuracy, and improve correctness. rate effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0035] figure 1 It is a flow chart of a text-to-speech method provided by Embodiment 1 of the present invention. The technical solution of this embodiment is applicable to the case of text-to-speech, and the method can be implemented by a text-to-speech device, which can be implemented by hardware and / or software, the method of text-to-speech specifically includes:
[0036] Step 110, acquiring a preset text normalization template matching the text to be processed.
[0037] Among them, the user can input text through an input device, or obtain text through Optical Character Recognition (OCR), and match the obtained text to be processed with a preset text normalization template, and in all preset text normalization templates Search for preset text normalization templates that match the text to be processed.
[0038] Step 120: Perform text normalization processing on the text to be processed according to the matching preset text normalization template to obtain normalized text....
Embodiment 2
[0048] figure 2 It is a flowchart of a text-to-speech method provided by Embodiment 2 of the present invention. The technical solution of this embodiment is further refined on the basis of the above technical solution. The method includes:
[0049] Step 210, storing pre-generated preset text normalization templates in the text normalization template library.
[0050] Among them, the preset text normalization template can be established by the designer, and then the preset text normalization template can be stored in the text normalization template library, and the preset text normalization template stored in the text normalization template library can be added and deleted and update.
[0051] Step 220: Store pre-segmentation templates in the text normalization template library, and establish a correspondence relationship between preset text normalization templates and pre-segmentation templates.
[0052] Among them, the designer can establish a pre-segmentation template for t...
Embodiment 3
[0065] image 3 It is a schematic structural diagram of a text-to-speech device provided in Embodiment 3 of the present invention. The text-to-speech device 300 includes:
[0066] A preset text normalization template acquisition module 310, configured to acquire a preset text normalization template matching the text to be processed;
[0067] A normalized text determination module 320, configured to perform text normalization processing on the text to be processed according to the matched preset text normalization template to obtain the normalized text;
[0068] The pre-segmentation information adding module 330 is used to add pre-segmentation information in the normalized text according to the pre-segmentation template corresponding to the preset text normalization template;
[0069] The word segmentation text determination module 340 is used to carry out word segmentation to the normalized text according to the pre-segmentation information and the word segmentation model, an...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com