Method and device for detecting wrongly written characters, computer storage medium and electronic equipment
A detection method and typo technology, applied in the field of data processing, can solve the problems of complex process and low efficiency of the method of identifying typos
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0034] figure 1 It shows a schematic flowchart of the implementation of the typo detection method in the first embodiment of the present application.
[0035] As shown in the figure, the typo detection method includes:
[0036] Step 101, determine the text data to be detected;
[0037] Step 102, converting the text data into pinyin data;
[0038] Step 103, generating the feature template based on the ngram model of the pinyin data;
[0039] Step 104, inputting the feature template of the pinyin data into a pre-built typo detection model; the typo detection model is obtained according to conditional random field CRF model and feature template training based on ngram model;
[0040] Step 105. Determine whether there is a typo in the text data to be detected according to the output result of the typo detection model.
[0041] During specific implementation, the text data to be detected is Chinese characters or Chinese. The conversion of text data into pinyin data can specifi...
Embodiment 2
[0095] Based on the same inventive concept, the embodiment of the present application provides a misspelling detection device. The principle of solving technical problems of the device is similar to a misspelling detection method, and the repetition will not be repeated.
[0096] figure 2 A schematic structural diagram of a typo detection device in Embodiment 2 of the present application is shown.
[0097] As shown in the figure, the typo detection device includes:
[0098] Data determining module 201, for determining the text data to be detected;
[0099] Pinyin conversion module 202, for converting the text data into pinyin data;
[0100] Template generating module 203, for generating the feature template based on the ngram model of the pinyin data;
[0101] Model detection module 204, for inputting the feature template based on the ngram model of the pinyin data to the pre-built typo detection model; the typo detection model is obtained according to the conditional rand...
Embodiment 3
[0126] Based on the same inventive concept, an embodiment of the present application further provides a computer storage medium, which will be described below.
[0127] The computer storage medium stores a computer program thereon, and when the computer program is executed by a processor, the steps of the typo detection method as described in the first embodiment are implemented.
[0128] The computer storage medium provided in the embodiment of the present application converts the text data to be detected into pinyin, and then generates a feature template of the pinyin data and inputs it into a pre-built typo detection model to detect and determine whether there are typos in the text data. The embodiment applies the CRF model to the detection of typos, and adds a feature template based on the ngram language model, which effectively combines the characteristics of the language model and the scalability of the CRF feature function, making the process of typos detection simple an...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com