Pedestrian re-identification method integrated with position awareness attention
A pedestrian re-recognition and attention technology, applied in the field of computer vision, can solve the problems of excessive parameters and ignoring useless information, so as to achieve good recognition effect and improve expression ability.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0042] see figure 1 , a pedestrian re-identification method incorporating position-aware attention provided by the present invention mainly includes inputting the output feature map obtained from the original image through the first two layers of the ResNet50 network into the position-aware attention module for processing and converting the position-aware attention The module is integrated into the ResNet50 network for training and testing.
[0043] The output feature map obtained from the original image through the first two layers of the ResNet50 network is input into the position-aware attention module for processing, including:
[0044] S1: Obtain the input feature map, extract three different feature maps through the convolution filter, perform the pooling operation on two of the feature maps to obtain the feature maps φ and g, and the feature map θ remains unchanged; then the above three-dimensional feature Figures θ, φ, and g are flattened and straightened into a two-d...
Embodiment 2
[0051] The inventor found that in the ResNet50 network person re-identification method, the model considers that the importance of each sub-feature in the feature map is the same, and all features need to be considered, resulting in slow training speed and inability to efficiently extract useful tasks. key features. In order to solve the above problems, in a preferred embodiment of the present invention, a non-local attention module is provided, the basic structure of which is as follows figure 2 shown, the specific steps are as follows:
[0052] S1.1 will input feature map X ∈ R b×c×h×w Through three different weight coefficients and the number of output channels is the number of input channels The 1 × 1 convolution filter of the Width and channel number reduction factor;
[0053] S1.2 Select feature maps φ and g from three different feature maps for pooling operation to obtain feature maps and The feature map without pooling operation is denoted as
[0054] S1.3...
Embodiment 3
[0079] The present invention also provides an example for showing a specific experimental process of the method provided by the present invention.
[0080] In this embodiment, three data sets of Market1501, DukeMTMC-ReID, and CUHK03 are used for training and testing. Market1501 was collected from the campus of Tsinghua University in the summer of 2015, including 1,501 pedestrian IDs, and a total of 32,668 images were collected by 6 cameras. The training set contains 751 pedestrian IDs and a total of 12,936 images. The test set contains the remaining 750 IDs, 3368 There are 15,913 images to be retrieved; DukeMTMC-reID was collected from the Duke University campus in the winter of 2015, including 1,812 pedestrian IDs, and a total of 36,411 images. The training set contains 702 pedestrian IDs and a total of 16,522 images. The set contains the remaining 702 pedestrian ID pictures. The CUHK03 dataset contains 14,096 images labeled by humans, and 14,097 images labeled by detection,...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com