Monosyllabic language lip-reading recognition system based on vision character

A technology for visual features and recognition systems, applied in the field of lip-reading recognition systems, to achieve the effects of strong practicability, improved recognition accuracy, and diverse samples

Inactive Publication Date: 2010-12-01
HUAZHONG UNIV OF SCI & TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The present invention provides a monosyllabic language lip-reading recognition system based on visual features, with the purpose of solving the problem of lip-reading recognition in monosyllabic languages ​​such as Chinese by using only video information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Monosyllabic language lip-reading recognition system based on vision character
  • Monosyllabic language lip-reading recognition system based on vision character
  • Monosyllabic language lip-reading recognition system based on vision character

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

This system reads the lip movement of the video creature to recognize the speaking content. Its aim is to use the video info only to recognize the lip language of the single syllable word (SSW), e.g. in Chinese language. This invention includes the video demodulating module, the lip allocating module. The lip movement dividing module, the feature drawing module, the language material warehouse (LMW), the model establishing module and the lip language recognizing module. This LMW possesses rich contents and is easy to expand. This invention processes only video images and need not the audio data to help. It can process video files, e.g. avi, wmv, rmvb and mpg to meet the requirement of recognizing the talking content under soundless condition. The lip movement part in this invention aims SSW to handle intelligently dividing. Comparing with the solid length time dividing or the handwork dividing, this method is more practical and greatly raises the recognition accuracy.

Description

A lip-reading recognition system for monosyllabic languages ​​based on visual features technical field The invention belongs to computer intelligent recognition technology, and in particular relates to a monosyllable language-oriented lip-reading recognition system based on visual features, which recognizes speech content according to lip movement changes of characters in a video when they speak. Background technique Since its birth in 1946, the computer has gone through the keyboard operation mode and the mouse operation mode, and entered the stage of natural human-computer interaction mode. In this context, speech recognition technology has developed rapidly in recent years, and human-computer interaction through speech is undoubtedly the most effective and fast way of interaction. "Speech recognition in noisy environments: a review" (Y.Cong.Speechrecognitioninnoisyenvironments: asurvey[J].SpeechCommunication, 1995, 16: 261-291) analyzed the ViaVoice speech recognition s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/24G06K9/00G10L15/25
Inventor 王天江刘芳周慧华龚立宇陈刚
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products