Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Deep learning algorithm based on multi-task and nearby information fusion for object detection

A technology of target detection and deep learning, applied in the direction of neural learning methods, computing, computer components, etc., can solve the problems of reducing recognition speed, fast recognition speed, and accelerating the application of convolutional neural network, so as to increase the speed and reduce convolution Calculate, shorten the effect of the process

Active Publication Date: 2018-12-28
FOSHAN SHUNDE SUN YAT SEN UNIV RES INST +2
View PDF8 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Starting from RCNN, namely Regions with CNN features, the convolutional neural network was introduced into the field of target detection, which greatly improved the effect of target detection; subsequently, SppNET, Fast-RCNN and Faster-RCNN were proposed to further accelerate the convolutional neural network in target detection. field of application, but at the same time there is a contradiction between accuracy and recognition speed, the recognition speed is reduced due to repeated feature extraction and calculation, and a large storage space is required
In addition, there is also a structure of YOLO, which is You only look once, which has a fast recognition speed, but at the expense of a certain accuracy rate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning algorithm based on multi-task and nearby information fusion for object detection
  • Deep learning algorithm based on multi-task and nearby information fusion for object detection
  • Deep learning algorithm based on multi-task and nearby information fusion for object detection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] refer to figure 1 and figure 2 , the deep learning algorithm of the multi-task based on target detection of the present invention and adjacent information fusion comprises the following steps:

[0022] Step S1: Input an initialized picture with real frames, use the pre-trained convolutional neural network to extract image features, and generate a small number of obvious target candidate frames;

[0023] Step S2: Using the image features obtained in step S1, pass the picture through the region candidate network to extract a large number of target prediction frames;

[0024] Step S3: The target prediction frame obtained in step S2 is subjected to feature extraction through the convolutional layer and feature pooling through the pooling layer, and then through the first fully connected layer to perform preliminary border regression and the direction between the target prediction frame and the real frame Prediction, preliminary target detection and classification, and ob...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-task and proximity information fusion depth learning algorithm based on object detection, which comprises the steps: pictures are input, image features by using a convolution neural network are extracted, and target candidate frames are generated; using the image features, the image is passed through the region candidate network to extract the target prediction frame; feature extraction and feature pooling of the target prediction frame, then the border regression, direction prediction, target detection and classification are performed to get the preliminary detection results; the preliminary detection result is fused with the target candidate frame and enters the ROI pooling layer and passes through the second full connection layer to obtain the final detection result; the classification of target detection is to redefine the confidence score of a target prediction box by using the information relationship between the target prediction box and its neighboring other target prediction boxes. The algorithm adopts multi-task output mode. The invention not only improves the speed of the target detection, but also ensures the accuracy of the target detection and achieves the requirement of the real-time target detection.

Description

technical field [0001] The invention relates to the field of image information processing, in particular to a multi-task based target detection and a deep learning algorithm for fusion of adjacent information. Background technique [0002] At present, target detection has always been a basic problem in the application of visual computing, which is applied in traffic monitoring, intelligent driving and other fields. In real conditions, on the one hand, due to the diversity of targets such as vehicles, pedestrians, numbers, railings, etc. on the road, there are many subcategories of targets such as buses, cars, trucks, bicycles, etc. on the other hand. The target has multiple angles, different occlusion situations and the local size of the target display, which brings great difficulty to target detection. Target detection is still a very challenging field, and in order to achieve target detection, recognition, and tracking in real time, there are quite high requirements for d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V20/54G06N3/045G06F18/24
Inventor 胡建国杨焕
Owner FOSHAN SHUNDE SUN YAT SEN UNIV RES INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products