Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese lip language recognition method and device based on hybrid convolutional neural network

A technology of convolutional neural network and recognition method, which is applied in the field of Chinese lip recognition method and device, can solve problems such as the inability to apply Chinese lip recognition, and achieve the effects of fast lip recognition, saving manpower, and improving robustness

Pending Publication Date: 2020-07-10
NORTHEASTERN UNIV
View PDF5 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these mature and excellent network architectures can only recognize English lip language. Due to the differences between graphic language characters such as Chinese and alphabetic languages ​​​​such as English, the above network architecture cannot be applied to Chinese lip language recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese lip language recognition method and device based on hybrid convolutional neural network
  • Chinese lip language recognition method and device based on hybrid convolutional neural network
  • Chinese lip language recognition method and device based on hybrid convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to enable those skilled in the art to better understand the solution of the present invention, the following will give a clear and complete description of the present invention in conjunction with the accompanying drawings in the implementation of the present invention.

[0036] figure 1 It is a schematic flow chart of a Chinese lip recognition method based on a hybrid convolutional neural network of the present invention; a Chinese lip recognition method based on a hybrid convolutional neural network comprises the following steps:

[0037] S1: Obtain the facial image information of the speaker through the camera;

[0038] Use a USB camera to fix in front of the speaker, 45cm away from the speaker, start from receiving the voice signal, and obtain each frame of the real-time video captured by the camera;

[0039] S2: Use the face detection modeler to obtain the face area, extract the position of the fixed point of the lip of the face detection model, and then ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese lip language recognition method and device based on a hybrid convolutional neural network, and belongs to the field of machine vision and deep learning. The method comprises the following steps: acquiring the facial image information of a speaker through a camera; detecting and cutting a lip image sequence from the face image information by utilizing a face detector; carrying out the lip feature extraction on the lip image sequence by using a hybrid convolutional neural network; inputting the lip features into a Bi-GRU model, obtaining an identification probability result of the phoneme unit; inputting an identification probability result of the phoneme unit into a connection time sequence classifier CTC; obtaining a phoneme unit classification result; processing the classification result of the phoneme units by adopting a decoding method of introducing an attention mechanism. According to the method, the problem that an existing network framework cannot recognize graphic language characters such as Chinese is solved, possibility is provided for application of a lip language recognition technology in an actual scene, and the method can be widely popularized in the field of computer vision.

Description

technical field [0001] The present invention relates to the field of machine vision and deep learning, in particular to a Chinese lip language recognition method and device based on a hybrid convolutional neural network. Background technique [0002] With the development of artificial intelligence technology and the improvement of security awareness, voice interaction and identity recognition have become a widely used technology. However, these technologies still have some disadvantages. For example, voice interaction is easily affected by the environment, and noise interference is prone to occur, resulting in inaccurate voice recognition. Static identification technology is easy to be copied and imitated, resulting in the leakage of personal information and the theft of identity authentication information. In order to improve inaccurate speech recognition and enhance dynamic identity authentication technology, lip language recognition technology has emerged. [0003] Lip ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/46G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V40/166G06V40/171G06V40/20G06V20/41G06V20/49G06V10/464G06N3/045G06F18/2415
Inventor 李晶皎聂雅昆闫爱云王爱侠
Owner NORTHEASTERN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products