Pedestrian detection method based on feature pyramid network
A feature pyramid and pedestrian detection technology, applied in the fields of image processing and pattern recognition, can solve problems such as poor generalization performance, difficult to apply real-time industrial application scenarios, and loss of precision, and achieve a wide range of applications
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0024] see figure 1 , the pedestrian detection method based on the feature pyramid network, including the following steps:
[0025] S1. Establish a FOSFPN network;
[0026] S2. Read the video frame and perform preprocessing;
[0027] S3. Propagate the frame image forward to obtain the detection frame;
[0028] S4. Perform non-maximum value suppression on the detection results to filter redundant detection frames;
[0029] S5. Standardize the detection frame beyond the boundary to obtain the final detection result.
[0030] The present invention designs a new network structure: the Fast One Stage Feature Pyramid Network [Fast One Stage Feature Pyramid Network (FOSFPN)], using a lightweight deep neural network (MobileNet) as the basic network structure to construct a feature pyramid, And the classification loss function and regression loss function are respectively added on the feature maps of different scales for network training.
[0031] The feature pyramid structure com...
Embodiment 2
[0038] The difference from the above embodiment is that the preprocessing of reading the video frame is to subtract the average value of the image from the image sent to the FOSFPN network and normalize to (0,1).
[0039] The reading of the video frame described in this embodiment is to perform mean value processing on the image. Subtract the corresponding mean value for each channel, and then divide by 255 to get the normalized input data;
[0040] The forward propagation of the frame image is to send the frame image into the trained FOSFPN network for forward propagation operation, and the output of each layer of pyramid network is used to calculate the classification results and regression results respectively.
[0041] The classification result represents the probability that the frame image anchor target is a pedestrian or the background; the regression result represents the deviation of the prediction result relative to the frame image anchor point; the anchor point whos...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com