Identification method for on-line handwritten Tibetan characters based on components

A recognition method and technology of component categories, which are applied in character and pattern recognition, computer parts, instruments, etc., can solve the problems of large storage capacity of classifier parameters, large number of similar character categories, and large number of Tibetan character categories, etc. Improve character recognition accuracy, meet storage requirements, and reduce the amount of dictionary storage

Active Publication Date: 2012-11-07
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The advantage of this method is that the model complexity of structural primitives is low, while the disadvantage is that the extraction of substructures is difficult and the accuracy is low
At present, the research on online handwritten Tibetan character recognition is based on statistical methods. The large number of categories of Tibetan characters leads to a large storage capacity of classifier parameters; on the other hand, the large number of similar character categories affects the performance of the classifier. Recognition accuracy; these two main reasons lead to the fact that the recognition performance of Tibetan characters has not yet reached the high demand of handwriting recognition technology for pen-type mobile devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Identification method for on-line handwritten Tibetan characters based on components
  • Identification method for on-line handwritten Tibetan characters based on components
  • Identification method for on-line handwritten Tibetan characters based on components

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In the following, the component-based on-line handwritten Tibetan character recognition method of the present invention will be described in detail with reference to the accompanying drawings.

[0025] This embodiment uses the MRG-OHTC sample database of the Multilingual Processing Research Group of the National Engineering Research Center for Basic Software, Institute of Software, Chinese Academy of Sciences. The database includes samples of Tibetan characters from 130 different writers, and each writer completed the writing of 910 commonly used characters (basic set and extended A set). The experiment selects 562 categories of Tibetan characters for testing, each category has 130 sets of samples, and the samples that cannot correctly mark the segmentation points of the parts are eliminated. Select 105 sets of samples for training, and the remaining 25 sets of samples for testing. In addition, the position marks of the part segmentation points of the characters in the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of minority language information processing, and in particular relates to an identification method for on-line handwritten Tibetan characters based on components. According to the invention, a traditional identification method based on statistics is broken through and components are used as basic identification objects; the identification method comprises the steps of: firstly, performing component division on an input character to obtain sub-structure sequences arranged in a rule; and then, obtaining a correct identification result of component string breakpoints and the component strings from the sub-structure sequences via an integrated identification method based on condition random field; and finally determining the category of the character based on the identification result. The invention is applied in the handwritten identification input of mobile equipment based on pen-type interaction, and the invention has the advantages of small storage quantity of identification method, high identification precision and high demand satisfaction of pen-type mobile equipment.

Description

technical field [0001] The invention belongs to the field of on-line handwritten character recognition for ethnic minority language and text information processing, and relates to a recognition method for Tibetan characters, in particular to a component-based on-line handwritten Tibetan character recognition method. Background technique [0002] Pen-based online handwritten character recognition technology is an easy-to-use and effective real-time tool, which has been widely used in computers and handheld mobile devices (such as mobile phones, PDAs, etc.). The popularity of pen input devices and the expansion of applications have brought new opportunities to the application of handwritten character recognition technology, and at the same time put forward higher requirements for recognition performance. Further improving the recognition accuracy, reducing the amount of calculation and storage space is the next research goal. Tibetan character recognition technology is an imp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/68
Inventor 马龙龙吴健刘汇丹
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products