Text processing method and system, data processing equipment and storage medium

A text processing and text technology, applied in the field of computer vision, can solve problems such as low efficiency, repetitive text, and poor text readability, and achieve the effects of saving time and cost, avoiding sorting problems, and improving processing efficiency

Pending Publication Date: 2020-04-17
SHANGHAI XIAOI ROBOT TECH CO LTD
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, when recognizing the typesetting text in the image, the OCR technology often ignores the typesetting, so that there are sorting problems in the recognized text such as repeated, missing and misplaced text, and the recognized text is poor in readability.
At this time, only manual calibration and adjustment can be performed, which increases time and cost, and is inefficient
[0004] Therefore, the existing OCR technology cannot accurately and completely process the typesetting text in the image

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and system, data processing equipment and storage medium
  • Text processing method and system, data processing equipment and storage medium
  • Text processing method and system, data processing equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] As mentioned above, OCR technology has been widely used in various fields for text recognition to obtain required information data, which are often output in the form of plain text. Therefore, when recognizing typesetting text in an image, OCR technology often ignores the typesetting, and does not coordinate and integrate the extracted text information, so that the recognized text has repeated, missing, and misplaced characters. Problem, the recognized text is poorly readable. At this time, only manual calibration and adjustment can be performed, which increases time and cost, and is inefficient.

[0064] In view of the above problems, the embodiment of this specification provides a text processing scheme, which can first identify the corner points of the image containing the typesetting text, obtain the position information of the corner points in the image, and then based on the position information of the corner points in the image Information, connect the corner po...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text processing method and system, data processing equipment and a storage medium, and the method comprises the steps: carrying out corner recognition of an image containinga typesetting text, and obtaining the position information of a corner in the image; based on the position information of the corner points in the image, connecting the corner points in the image according to a preset connection rule to obtain a corresponding corner point connection graph; determining position information of each first connected domain unit in the corner connection graph; and based on the position information of each first connected domain unit, matching the characters obtained by identifying the corresponding positions in the image to obtain corresponding text data. By adopting the scheme, the readability of the text can be improved.

Description

technical field [0001] The embodiments of this specification relate to the technical field of computer vision, and in particular to a text processing method and system, a data processing device, and a storage medium. Background technique [0002] At present, computer vision technology has been widely used, and an optical character recognition (Optical Character Recognition, OCR) technology is usually used for image recognition. OCR technology is good at identifying plain text without typesetting in images. [0003] However, when recognizing the typesetting text in the image, the OCR technology often ignores the typesetting, so that there are sorting problems in the recognized text such as repeated, missing and misplaced text, and the recognized text is poor in readability. At this time, only manual calibration and adjustment can be performed, which increases time and cost, and is inefficient. [0004] Therefore, the existing OCR technology cannot accurately and completely ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/46
CPCG06V30/414G06V10/44G06V30/10
Inventor 张波王晓珂
Owner SHANGHAI XIAOI ROBOT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products