Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Processing method and device of lip language recognition model, computer equipment and storage medium

A recognition model and lip language technology, applied in the computer field, can solve problems such as affecting the effect of knowledge distillation, lack of flexibility, and inability to adjust teaching content, and achieve the effect of improving learning effect and recognition performance.

Pending Publication Date: 2021-12-21
SOUTH CHINA UNIV OF TECH +1
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current teacher model is usually a pre-trained model, without considering the current ability of the student model to perform lip recognition tasks. Due to the neglect of the needs of the student model, the teacher model often lacks flexibility when adjusting teaching knowledge. It is impossible to dynamically adjust the teaching content according to the development of the student model, thus affecting the effect of knowledge distillation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Processing method and device of lip language recognition model, computer equipment and storage medium
  • Processing method and device of lip language recognition model, computer equipment and storage medium
  • Processing method and device of lip language recognition model, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0090] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0091] The processing method of the lip recognition model provided by this application realizes the training of the lip recognition model and also realizes the lip recognition by using the computer vision technology and the machine learning technology in the artificial intelligence technology (Artificial Intelligence, AI).

[0092] Computer Vision Technology (Computer Vision, CV) Computer vision is a science that studies how to make machines "see". More specifically, it refers to machine vision that uses cameras and computers instead of human eyes to identify, track and mea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a processing method and device of a lip language recognition model, computer equipment and a storage medium. The method relates to a computer vision technology of artificial intelligence, the whole distillation process is divided into a student training stage and a master training stage which are alternately trained, and in the master training stage, a temporary training sample is utilized to update a student model updated by the previous alternate training again, the obtained temporary student model feeds back a current learning state to the master model through a verification sample, and the master model is guided to adaptively adjust teaching knowledge according to the current feedback; and, in addition, the master model is supervised by master training samples, and teaching content is adjusted through master identification loss determined by the master training samples. Then, the student model in a student training stage is trained, and repeated iteration is performed for multiple times to obtain a lip language recognition model according to the student model. According to the scheme, the teaching content can be flexibly adjusted while the knowledge accuracy of master model teaching is improved, and the knowledge distillation effect is improved.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a processing method, device, computer equipment and storage medium of a lip recognition model. Background technique [0002] Lip language recognition aims to predict speech content from silent lip videos or face videos. This visual task usually uses knowledge distillation to allow the student model to learn the ability of lip language recognition from the trained teacher model. [0003] Knowledge distillation can transfer knowledge from the teacher model to the student model. However, the current teacher model is usually a pre-trained model, without considering the current ability of the student model to perform lip recognition tasks. Due to the neglect of the needs of the student model, the teacher model often lacks flexibility when adjusting teaching knowledge. It is impossible to dynamically adjust the teaching content according to the development of the student m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62
CPCG06F18/214
Inventor 何盛烽任苏成孙子荀邓大付王巨宏刘婷婷
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products