Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cross-modal pedestrian re-identification method and system based on heterogeneous hierarchical attention mechanism

A pedestrian re-identification and attention mechanism technology, applied in the field of cross-modal pedestrian re-identification methods and systems, can solve the problems of difficulty in measuring sample similarity across modalities and difficult to identify.

Active Publication Date: 2021-02-19
中科人工智能创新技术研究院(青岛)有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The inventor found in the research that the difficulty of pedestrian re-identification technology is: the heterogeneity between samples of different modalities brings great difficulties to measure the similarity of samples across modalities; at the same time, because all pictures belong to the same Pedestrian categories, and the corresponding descriptions of different pedestrians are relatively similar, it is difficult to accurately identify

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-modal pedestrian re-identification method and system based on heterogeneous hierarchical attention mechanism
  • Cross-modal pedestrian re-identification method and system based on heterogeneous hierarchical attention mechanism
  • Cross-modal pedestrian re-identification method and system based on heterogeneous hierarchical attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] It should be noted that the following detailed description is exemplary and intended to provide further explanation of the present disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.

[0069] It should be noted that the terminology used herein is only for describing specific embodiments, and is not intended to limit the exemplary embodiments according to the present disclosure. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural, and it should also be understood that when the terms "comprising" and / or "comprising" are used in this specification, they mean There are features, steps, operations, means, components and / or combinations thereof.

[0070] In a typical implementation of the present disclosure, such as figure 1 As shown, a cross-modal pedestrian re-identific...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

This disclosure proposes a cross-modal pedestrian re-identification method and system based on a heterogeneous hierarchical attention mechanism, including: extracting pedestrian image features and text description features, and using them as the initial global features of the pedestrian image channel and text description channel respectively; A structured hierarchical attention model, which uses a two-way cross-modal fine-grained matching attention mechanism and a context-guided local feature aggregation attention mechanism, while enhancing pedestrian image features and text description features; using a two-stage training method In the first stage, the pedestrian category supervision information is used for preliminary training, and on this basis, the cross-modal sample matching pedestrian category supervision information is used for the second stage of training, and the trained model is used for training. Pedestrian re-identification. The disclosure can improve the accuracy of pedestrian re-identification.

Description

technical field [0001] The present disclosure relates to the technical fields of computer vision, pattern recognition and multimodal computing, in particular to a cross-modal pedestrian re-identification method and system based on a heterogeneous hierarchical attention mechanism. Background technique [0002] Pedestrian re-identification is an important and challenging classic computer vision task, which has a wide range of applications in security monitoring, intelligent video analysis, personnel search and rescue retrieval and other fields. [0003] The cross-modal person re-identification method based on text description has the characteristics of easy generation of description and the ability to provide relatively rich information for retrieval. [0004] The inventor found in the research that the difficulty of pedestrian re-identification technology is: the heterogeneity between samples of different modalities brings great difficulties to measure the similarity of sampl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/00G06K9/46G06K9/62G06F40/211G06F40/289G06N3/04G06N3/08
Inventor 王亮黄岩牛凯王海滨李凯
Owner 中科人工智能创新技术研究院(青岛)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products