Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Underlined text image preprocessing method and device

A text image and preprocessing technology, which is applied in the field of optical character recognition, can solve problems that affect the correct recognition of characters, cannot be correctly positioned, reduce the recognition rate of characters and the adaptability of the recognition core, and achieve improved recognition rate and strong adaptability Effect

Active Publication Date: 2012-05-09
HANVON CORP
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] This method is effective for the underline separated from the character, but for the case where the character and the underline are glued together, the straight line may not be correctly positioned, or the glued part of the character and the underline may be deleted, which will affect the correct recognition of the character and reduce the recognition rate of the character and identify core fitness

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Underlined text image preprocessing method and device
  • Underlined text image preprocessing method and device
  • Underlined text image preprocessing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0048] The underlined text image preprocessing method of the present invention will be described in detail below with reference to the accompanying drawings and taking the underlined processing of English characters as an example.

[0049] Such as figure 1 shown, also refer to figure 2 , a specific embodiment of the underlined text image preprocessing method of the present invention, comprising the following steps:

[0050] Step 1: Receive the text line image, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an underlined text image preprocessing method and device, relating to the field of optical character recognition. The method comprises the following steps: acquiring the position of each text line in a text image; for the position of each text line, searching each text line based on a run-length search method; if the preliminary determination result shows that an underline exists in the text line, locating the position of the initial upper boundary of the underline; by using the initial upper boundary of the underline as an initial pixel line, locating the underline region based on run-length search and connected domain analysis methods; separating out stroke regions of characters from the underline region, thus obtaining a region to be deleted; and setting the foreground information in the region to be deleted into the background, thus obtaining a character region having no underline. By searching each text line based on the run-length search method for the position of each text line, the invention solves the problem that a text having an underline (especially an underline conglutinated with characters) is difficult to recognize, improves the character recognition rate, and enhances the adaptability of the recognition core.

Description

technical field [0001] The invention belongs to the field of optical character recognition (OCR), and relates to an underlined text image preprocessing method and device. Background technique [0002] In printed character recognition, the general processing flow is: first divide the text image into several lines, so that each text line contains only a single line of text; and then further character segmentation and recognition. [0003] If there is an underscore under the character, it will not only affect the normal segmentation of the character, but also cause the character recognition engine to fail to recognize the corresponding character correctly. Therefore, it is usually necessary to remove the underscore below the character before character segmentation and recognition. [0004] In the prior art, a simple straight line detection method (such as Hough transform, etc.) is usually used. If a long straight line is detected under the character image, the image in the lin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/34G06K9/20
Inventor 万鑫刘正珍
Owner HANVON CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products