Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A natural scene text detection method based on full convolution neural network

A convolutional neural network, text detection technology, applied in the field of computer vision, can solve the problems of affecting the accurate detection of text objects, loss of accurate location information, etc., and achieve excellent natural scene text detection performance, high accuracy, and strong representation ability. Effect

Active Publication Date: 2019-02-01
NANJING UNIV
View PDF11 Cites 54 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These models generally contain multiple convolutional layers and pooling layers. The extraction of features at different levels is achieved through continuous convolution and pooling operations. On the one hand, the ability to express feature semantics is improved, but on the other hand, precise location information is also lost. , which affects the accurate detection of text objects to a certain extent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A natural scene text detection method based on full convolution neural network
  • A natural scene text detection method based on full convolution neural network
  • A natural scene text detection method based on full convolution neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The following describes the technical solutions in the embodiments of the present invention clearly and completely with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0025] First, explain and explain the terms in the present invention:

[0026] ResNet-50: ResNet is a network model architecture for target detection proposed by He Kaiming and others. It is named ResNet-34, ResNet-50, ResNet-152, etc. according to the number of network layers used. ResNet generally consists of 5 parts. The first part consists of a convolutional layer with a 7*7 convolution kernel, and then passes through a pooling layer wit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a natural scene text detection method based on full convolution neural network, This method uses CNN network to extract the feature representation of text, and adjusts the feature representation by feature fusion module, at the same time fuses the semantic feature of high-level feature map and the position information of low-level feature map, so that the extracted featurehas stronger representation ability, and combines with text prediction module to directly predict the candidate text object. This method adopts end-to-end training and prediction process, The processing flow is simple, and no multi-step hierarchical processing is needed. Finally, the final detection result is obtained by simple NMS operation, which has high accuracy and strong robustness. The multi-directional and multi-size text object in the natural scene image with complex background can be detected well, and the detection performance of the natural scene text is excellent.

Description

Technical field [0001] The invention belongs to the technical field of computer vision, and specifically relates to a natural scene text detection method based on a full convolutional neural network. Background technique [0002] Natural scene text usually carries rich semantic information about the scene and image content, and plays a very important role in many application fields such as image retrieval, annotation, and content analysis. Compared with the text in the scanned document, the appearance attributes such as font, size, direction and color of natural scene text and factors such as image background and lighting are more complex and changeable. At the same time, blur and resolution may occur when natural scene image is collected. Too low conditions make natural scene text detection a challenging task. [0003] Traditional natural scene text detection methods can be divided into two categories, namely text detection methods based on connected components and text detection...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06N3/04
CPCG06N3/045
Inventor 汪洋苏丰
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products