Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text phonetic notation method, electronic equipment and storage medium

A text and text technology, applied in text database query, unstructured text data retrieval, electronic digital data processing, etc., can solve problems such as difficult to accurately predict the pronunciation of polyphonic characters, phonetic errors, etc.

Active Publication Date: 2021-05-18
ZHANGYUE TECH CO LTD
View PDF10 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the pronunciation of polyphonic characters may change with the context and semantics, it is difficult to accurately predict the pronunciation of polyphonic characters in various scenarios based on commonly used words, and phonetic errors often occur

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text phonetic notation method, electronic equipment and storage medium
  • Text phonetic notation method, electronic equipment and storage medium
  • Text phonetic notation method, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] figure 1 A flow chart of a text phonetic notation method provided by an embodiment of the present invention is shown. like figure 1 As shown, the method includes the following steps:

[0030] Step S110: Match the acquired text to be phoneticized with the preset polyphone list, and identify the target polyphone included in the text to be phoneticized according to the matching result.

[0031] Wherein, the text to be phoneticized refers to the text that needs to be marked with pinyin. Specifically, the text to be phoneticized may be the original text of the electronic book, or a text obtained after preprocessing the original text of the electronic book, and the source and form of the text to be phoneticized are not limited in the present invention. In addition, the preset list of polyphonic characters is used to store known polyphonic characters. The list of polyphonic characters can be generated based on pre-collected polyphonic characters, or based on feedback from u...

Embodiment 2

[0041] figure 2 A flow chart of a text phonetic notation method provided by another embodiment of the present invention is shown. like figure 2 As shown, the method includes the following steps:

[0042]Step S200: respectively obtaining training sample sets corresponding to each polyphone sample, and training to obtain a polyphone model corresponding to each polyphone sample based on the training sample set corresponding to each polyphone sample, and training the obtained polyphone model. The individual polyphonic models of , are added to the polyphonic model collection.

[0043] Specifically, in order to accurately predict the pronunciation of different polyphonic words, in this embodiment, a corresponding polyphonic word model is trained for each polyphonic word, so as to predict the meaning of the polyphonic word in different contexts and the pronunciation rules. Pinyin of the polyphonic word.

[0044] During the specific implementation, first, text data is obtained, ...

Embodiment 3

[0072] An embodiment of the present application provides a non-volatile computer storage medium, the computer storage medium stores at least one executable instruction, and the computer executable instruction can execute the text phonetic notation method in any of the above method embodiments.

[0073] Specifically, the executable instruction can be used to make the processor perform the following operations:

[0074] Matching the acquired text to be phoneticized with a preset list of polyphonic characters, and identifying the target polyphonic characters contained in the text to be phoneticized according to the matching result;

[0075] Acquiring the context information of the target polyphonic character in the text to be phoneticized, and generating a prediction feature vector corresponding to the target polyphonic character according to the context information;

[0076] Querying the polyphonic character model corresponding to the target polyphonic character from the pre-tra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text phonetic notation method, electronic equipment and a storage medium. The method comprises the steps: carrying out the matching of an obtained to-be-phonetic notation text with a preset polyphone list, and recognizing a target polyphone contained in the to-be-phonetic notation text according to a matching result; acquiring context information of the target polyphone in the text to be subjected to phonetic notation, and generating a prediction feature vector corresponding to the target polyphone according to the context information; querying a polyphone model corresponding to the target polyphone from a polyphone model set obtained by pre-training, and inputting the prediction feature vector into the queried polyphone model; and performing phonetic notation on the target polyphone according to an output result of the polyphone model. According to the method, the pronunciation of the polyphone can be accurately predicted by fully utilizing the context information of the polyphone, and the labeling accuracy is remarkably improved.

Description

technical field [0001] The invention relates to the field of computers, in particular to a text phonetic notation method, electronic equipment and a storage medium. Background technique [0002] At present, with the increasing popularity of audiobooks, more and more users are accustomed to obtaining information by listening to books. In the process of generating audiobooks, it is necessary to mark accurate pinyin for each text, so as to realize text-to-speech conversion processing according to the pinyin. [0003] Since there are polyphonic characters in Chinese characters, and the pronunciation of polyphonic characters varies with different contexts, how to accurately recognize the pronunciation of polyphonic characters and mark the correct pinyin for them has become a technical problem to be solved urgently. In the traditional way, several common words corresponding to different pronunciations are stored for each polyphonic character, and the pronunciation of the polyphon...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F16/383G06F40/157G06F40/279
CPCG06F16/3343G06F16/383G06F40/157G06F40/279
Inventor 陈梦瑶朱军
Owner ZHANGYUE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products