Multi-modal face recognition method based on attention mechanism

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A face recognition and attention technology, applied in character and pattern recognition, neural learning methods, computer parts and other directions, can solve the problems of low accuracy and low recognition accuracy, and achieve high recognition accuracy, good robustness, Robustness-enhancing effect

Pending Publication Date: 2021-05-14

SOUTH CHINA UNIV OF TECH

View PDF10 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Most of the existing 3D face recognition uses data fusion, feature fusion or fractional fusion to improve the performance of the 3D face recognition model, such as: Chinese patent" 201911397767.1" disclosed "a 3D face recognition method based on the fusion of multiple data types", "201410343015.8" disclosed "3D face recognition method based on feature points" and "202011000990.0" disclosed "a Low-precision 3D face recognition method based on depth map quality enhancement", etc., the recognition accuracy of these existing technologies is low, which cannot meet the existing needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0057] In order to make the object, technical solution and advantages of the present invention clearer, the specific implementation of the present invention will be described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0058] see Figure 1-5 , the present embodiment discloses a multimodal face recognition method based on an attention mechanism, and the algorithm includes the following steps:

[0059] Step 1: Establish an RGBD face database. The database includes multimodal face images of 89 people (50 males and 39 females), totaling 17,622 images.

[0060] The database is obtained through the Realsense D435I depth camera. The color image and the depth image obtained by the Realsense D435I are respectively imaged by different sensors on the device, which will cause the imaging areas of the two ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention is applied to the field of pattern recognition, and provides a multi-modal face recognition method based on an attention mechanism, a network model afr_net constructs the attention mechanism by adopting a CBAM and an SAVM. A space and channel attention module CBAM is added in each block of ResNet18. ResNet18 combined with an attention mechanism is utilized to establish RGB, depth and branches of fusion modes of the RGB and the depth, so that features of three modes are obtained, then the features of the three modes are fused and input into a sharing layer. The feature vectors are obtained through a vectorization module SAVM based on the space attention mechanism and a full connection layer. The method not only overcomes the defects of a traditional two-dimensional face recognition method, but also effectively fuses RGB and depth modals, and enhances the RGB-D face recognition capability.

Description

technical field [0001] The invention relates to the fields of machine deep learning and image processing and recognition, in particular to a multimodal face recognition method based on an attention mechanism. Background technique [0002] The technical research of face recognition began in the 1960s, and gradually became a hot topic in the field of computer vision. Recently, with the rapid progress of deep learning technology and the open source of a large number of two-dimensional face data sets, two-dimensional face recognition It was a huge success. Since AlexNet was proposed and cited in 2012, most face recognition models have adopted a deep learning strategy with CNN as the backbone. In 2015, Google proposed the FaceNet model, which achieved an accuracy of 99.63% on the LFW benchmark, and its performance surpassed that of humans. Most of these two-dimensional face recognition progresses use two-dimensional images (RGB), and two-dimensional RGB images contain limited f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08

CPCG06N3/08G06V40/172G06V40/168G06N3/048G06N3/045G06F18/214G06F18/24

Inventor 姜立标张俊伟

Owner SOUTH CHINA UNIV OF TECH

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Multi-modal face recognition method based on attention mechanism

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology