Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

OCR picture and text recognition and retrieval method and system through web mode

A retrieval system, image-text technology, applied in the field of image-text recognition, to achieve the effect of convenient and efficient retrieval

Inactive Publication Date: 2009-06-24
JIANGYIN MINGLUN TECH
View PDF0 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The problem to be solved by the embodiments of the present invention is to provide a method and system for OCR image-text recognition and retrieval using the web method, so as to overcome the defect that it is difficult to retrieve image-text formats that cannot be edited at will in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • OCR picture and text recognition and retrieval method and system through web mode
  • OCR picture and text recognition and retrieval method and system through web mode
  • OCR picture and text recognition and retrieval method and system through web mode

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0024] A method for OCR image-text recognition and retrieval using a web method in an embodiment of the present invention is as follows: figure 1 shown, including the following steps:

[0025] Step s101, obtaining the image and text to be recognized. In this embodiment, any device with shooting and scanning functions such as a scanner, a digital camera, an all-in-one machine, and a camera phone is used to obtain images and texts to be recognized.

[0026] Step s102, performing layout analysis on the image and text.

[0027] Step s103, performing OCR recognition on the image and text, and acquiring text and picture information in the image and text.

[0028] Step s104, storing the text and picture information in the OCR database.

[0029] Step s105, perform a full-text search in the OCR database according to keywords. In this embodiment, the full-text retrieval technology is used to conveniently and efficiently retrieve the required information resources by inputting the te...

Embodiment 2

[0031] A method for OCR image-text recognition and retrieval using a web method in an embodiment of the present invention is as follows: figure 2 shown, including the following steps:

[0032] Step s201, acquiring the image and text to be recognized. In this embodiment, any device with shooting and scanning functions such as a scanner, a digital camera, an all-in-one machine, and a camera phone is used to obtain images and texts to be recognized.

[0033] Step s202, performing layout analysis on the picture and text.

[0034] Step s203, performing OCR recognition on the image and text, and obtaining text and picture information in the image and text.

[0035] Step s204, proofreading the text information. In this embodiment, complex layouts are automatically analyzed, texts in various mixed formats are intelligently analyzed, and horizontal and vertical comprehensive corrections are carried out for the identification documents without excessive manual intervention.

[0036...

Embodiment 3

[0039] A method for OCR image-text recognition and retrieval using a web method in an embodiment of the present invention is as follows: image 3 shown, including the following steps:

[0040] Step s301, obtaining the image and text to be recognized. In this embodiment, any device with shooting and scanning functions such as a scanner, a digital camera, an all-in-one machine, and a camera phone is used to obtain images and texts to be recognized.

[0041] Step s302, performing layout analysis on the picture and text.

[0042] Step s303, performing OCR recognition on the image and text, and obtaining text and picture information in the image and text.

[0043] Step s304, exporting the text and picture information to a text format file that can be edited, copied or quoted. In this embodiment, the text format files include word, rtf and other text format files that can be edited, copied and quoted.

[0044] Step s305, storing the text information in the OCR database.

[0045...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for retrieving and recognizing OCR picture and text in a web manner. The method comprises the following steps: acquiring text information and picture information from a picture and text to be recognized; storing the text information and the picture information into an OCR database; and retrieving the full text in the OCR database. The invention further discloses a system for retrieving and recognizing OCR picture and text, which comprises a picture and text information acquisition unit, an OCR database and a retrieval unit. By utilizing the OCR picture and text recognizing technique, the invention ensures that the recognition is more efficient, editable text formatting can be exported, and the required information resource can be retrieved conveniently and effectively by utilizing the full text retrieval technique and inputting characters embedded in the picture information.

Description

technical field [0001] The present invention relates to the technical field of image-text recognition, in particular to an OCR (Optical Character Recognition, optical character recognition) image-text recognition retrieval method and system. Background technique [0002] Retrieval refers to the process and technology of organizing information in a certain way and finding relevant information according to the needs of information users, that is, the process of finding the required information from the information collection. [0003] Since the text in the image file cannot be well recognized, it is very difficult to retrieve the graphic format that cannot be edited at will, which makes the management organization seem so at a loss when facing image formats with different contents A lot of manpower and material resources are used to rearrange, enter, and classify manually, and then they can be unified into a certain text format for retrieval. Contents of the invention [00...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/20
Inventor 凌辉黄惠良
Owner JIANGYIN MINGLUN TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products