The invention provides an identification method to identify Chinese printed formula, including analysis of
typeface,
character recognition and
mathematical formula recognition three modules;
typeface analysis module two-value pretreats BMP images, split out literal block, image block and
list block using
projection method combined with bottom-up
typeface analysis
algorithm, preserve image block and
list block; Chinese
character recognition module false merge rows against literal block, select segmentation parameters, extract characteristics and recognize
Chinese characters,
record the rejected results, combine adjacent rejected results in the same row in order to locate formula region;
mathematical formula recognition is to extract,divide formula characters in the rejected character region, merger some characters and recognize; finally obtain the relationship between characters through
structure analysis of formula characters, and output the final one-dimensional character strings. the identification effect of this invention is famous after test.