Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Lip language recognition method combining graph neural network and multi-feature fusion

A multi-feature fusion and neural network technology, which is applied in the field of lip language recognition combined with graph neural network and multi-feature fusion, can solve the problems of increasing lip language recognition accuracy and limited information extraction ability, and achieve high accuracy and high recognition results. Accuracy and strong extraction ability

Active Publication Date: 2021-05-28
HEBEI UNIV OF TECH
View PDF10 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of these methods do not take into account the influence of lip deflection angle, light intensity, light angle and speaker identity information on the lip language recognition task, and the ability of traditional methods to extract information about lip sequence changes is also very limited. The accuracy of language recognition is in a rising bottleneck period

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Lip language recognition method combining graph neural network and multi-feature fusion
  • Lip language recognition method combining graph neural network and multi-feature fusion
  • Lip language recognition method combining graph neural network and multi-feature fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Specific examples of the present invention are given below. The specific embodiments are only used to further describe the present invention in detail, and do not limit the protection scope of the claims of the present application.

[0030] The invention provides a lip language recognition method (method for short) in combination with a graph neural network and multi-feature fusion, which is characterized in that the method comprises the following steps:

[0031] S1. Create a recognition network dataset: select samples from the public lip recognition dataset ouluvs2, use FaceGen software for 3D face reconstruction, and export the face change sequence and save it as an RGB video as a recognition network dataset;

[0032] Preferably, in S1, since face reconstruction requires images of a frontal face and two side faces, the currently used lip language recognition dataset ouluvs2 provides 0°, 30°, 45°, 60°, High-definition face images of 5 angles at 90°, so this method use...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a lip language recognition method combining a graph neural network and multi-feature fusion. The method comprises the following steps: firstly, extracting and constructing a face change sequence, marking face feature points, correcting a lip deflection angle, performing pre-processing through a trained lip semantic segmentation network, training a lip language recognition network through a graph structure of a single-frame feature point relationship and a graph structure of an adjacent-frame feature point relationship, and finally, generating a lip language recognition result through the trained lip language recognition network. CNN lip features and lip region feature points obtained after CNN extraction and feature fusion are performed on an identification network data set and a lip semantic segmentation network data set are subjected to the extraction and fusion by the GNN lip features obtained after GNN extraction and fusion and then input into BiGRU for identification, and the problems that time sequence feature extraction is difficult and lip feature extraction is affected by external factors are solved; the method effectively extracts the static features of the lip and the dynamic features of the lip change, and has the characteristics of high lip change feature extraction capability, high recognition result accuracy and the like.

Description

technical field [0001] The invention belongs to the field of artificial intelligence and deep learning, and specifically relates to a lip language recognition method combined with a graph neural network and multi-feature fusion. Background technique [0002] With the development of science and technology and the improvement of hardware manufacturing level, artificial intelligence technology based on deep learning has attracted more and more attention from researchers. The field of deep learning includes many subfields, such as machine vision, natural language processing, etc. Lip recognition, which combines machine vision and natural language processing, has gained more and more attention. Lip language recognition has a very wide range of application scenarios, such as liveness detection based on lip features, communication assistance for the hearing-impaired, and voice recovery from traffic cameras. [0003] There are many difficulties in lip language recognition. For exa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V40/171G06V40/172G06N3/047G06N3/045G06F18/2415
Inventor 张成伟赵昊天张满囤刘川申冲
Owner HEBEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products