Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-modal face recognition method based on attention mechanism

A face recognition and attention technology, applied in character and pattern recognition, neural learning methods, computer parts and other directions, can solve the problems of low accuracy and low recognition accuracy, and achieve high recognition accuracy, good robustness, Robustness-enhancing effect

Pending Publication Date: 2021-05-14
SOUTH CHINA UNIV OF TECH
View PDF10 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Most of the existing 3D face recognition uses data fusion, feature fusion or fractional fusion to improve the performance of the 3D face recognition model, such as: Chinese patent" 201911397767.1" disclosed "a 3D face recognition method based on the fusion of multiple data types", "201410343015.8" disclosed "3D face recognition method based on feature points" and "202011000990.0" disclosed "a Low-precision 3D face recognition method based on depth map quality enhancement", etc., the recognition accuracy of these existing technologies is low, which cannot meet the existing needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-modal face recognition method based on attention mechanism
  • Multi-modal face recognition method based on attention mechanism
  • Multi-modal face recognition method based on attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In order to make the object, technical solution and advantages of the present invention clearer, the specific implementation of the present invention will be described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0058] see Figure 1-5 , the present embodiment discloses a multimodal face recognition method based on an attention mechanism, and the algorithm includes the following steps:

[0059] Step 1: Establish an RGBD face database. The database includes multimodal face images of 89 people (50 males and 39 females), totaling 17,622 images.

[0060] The database is obtained through the Realsense D435I depth camera. The color image and the depth image obtained by the Realsense D435I are respectively imaged by different sensors on the device, which will cause the imaging areas of the two ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention is applied to the field of pattern recognition, and provides a multi-modal face recognition method based on an attention mechanism, a network model afr_net constructs the attention mechanism by adopting a CBAM and an SAVM. A space and channel attention module CBAM is added in each block of ResNet18. ResNet18 combined with an attention mechanism is utilized to establish RGB, depth and branches of fusion modes of the RGB and the depth, so that features of three modes are obtained, then the features of the three modes are fused and input into a sharing layer. The feature vectors are obtained through a vectorization module SAVM based on the space attention mechanism and a full connection layer. The method not only overcomes the defects of a traditional two-dimensional face recognition method, but also effectively fuses RGB and depth modals, and enhances the RGB-D face recognition capability.

Description

technical field [0001] The invention relates to the fields of machine deep learning and image processing and recognition, in particular to a multimodal face recognition method based on an attention mechanism. Background technique [0002] The technical research of face recognition began in the 1960s, and gradually became a hot topic in the field of computer vision. Recently, with the rapid progress of deep learning technology and the open source of a large number of two-dimensional face data sets, two-dimensional face recognition It was a huge success. Since AlexNet was proposed and cited in 2012, most face recognition models have adopted a deep learning strategy with CNN as the backbone. In 2015, Google proposed the FaceNet model, which achieved an accuracy of 99.63% on the LFW benchmark, and its performance surpassed that of humans. Most of these two-dimensional face recognition progresses use two-dimensional images (RGB), and two-dimensional RGB images contain limited f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V40/172G06V40/168G06N3/048G06N3/045G06F18/214G06F18/24
Inventor 姜立标张俊伟
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products