Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Certificate image character extraction method

A text extraction and certificate technology, applied in neural learning methods, character recognition, computer components, etc., can solve the problems of messy text information, increase the amount of text recognition calculations, and fail to extract structured information from data, so as to improve accuracy degree and the effect of noise immunity

Pending Publication Date: 2021-03-19
SHENZHEN TAIJI SOFTWARE CO LTD
View PDF1 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the prior art, a document text recognition method and device (CN108549881A) cannot achieve data structured information extraction, and not all text information on the document is useful, and identification of document information without distinction increases The amount of calculation for text recognition is increased, and the extracted text information is also messy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Certificate image character extraction method
  • Certificate image character extraction method
  • Certificate image character extraction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the following The described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0051] Such as Figure 1-6 As shown, a document image text extraction method comprises the following steps:

[0052] S1. Input the certificate image, adjust the resolution of the certificate image, keep the aspect ratio of the certificate image unchanged, and adjust the resolution of the certificate image; the purpose of adjusting the re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a certificate image character extraction method. The method comprises the following steps: S1, inputting a certificate image; S2, detecting a character position in the certificate image through a character detection model, and marking the character position through a marking box; S3, counting the position distribution condition of the marking box in the certificate image, judging the direction of the image and adjusting; s4, establishing a plane coordinate, merging and sorting the marking boxes in the same row according to the Y axis, and obtaining an information box ofeach row of characters; S5, aligning the standard template with the information box, outputting an intersection part of the information box and the standard template, and cutting and outputting a text picture; and S6, recognizing the character picture by using a character recognition model, and extracting the character content.

Description

technical field [0001] The invention relates to the fields of image processing and text recognition, in particular to a method for extracting text from a certificate image. Background technique [0002] Recognizing the text in the image of the certificate is very common and important in many scenarios. For example, in financial scenarios such as remote account opening, online lending, payment verification, etc., we need to identify the name, address, ID number and other information of the user ID card , to check whether the person and certificate are one; the law enforcement of the industrial and commercial department often needs to identify the business name, legal representative, and unified social credit code of the business license, and see whether the important information such as the company name, legal representative, and unified social credit code of the enterprise is consistent with the industrial and commercial department. The records in the database in the system ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/32G06N3/04G06N3/08
CPCG06N3/08G06N3/049G06V30/413G06V10/243G06V20/62G06V30/10G06N3/045
Inventor 吴志雄白丹周兴杰冯智辉
Owner SHENZHEN TAIJI SOFTWARE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products