Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Video behavior identification method and system based on space enhancement module

A recognition method and space technology, applied in character and pattern recognition, neural learning methods, biological neural network models, etc., can solve problems such as weak classification accuracy and attenuation of extraction ability

Active Publication Date: 2021-04-23
SOUTH CHINA UNIV OF TECH +1
View PDF8 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, its joint learning of spatio-temporal features leads to a significant attenuation in the ability to extract spatial features alone compared with 2D convolution, so the classification accuracy on data sets with strong spatial features and weak timing is weaker than that of 2D convolution.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video behavior identification method and system based on space enhancement module
  • Video behavior identification method and system based on space enhancement module
  • Video behavior identification method and system based on space enhancement module

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0059] The data set used in this embodiment is the Kinetics-400 data set. Kinetics is a very large video classification dataset. The videos in it are all from the Youtube video website. It contains a total of about 230,000 training set videos and about 20,000 verification set videos. The duration of all videos is 3-10s, and the resolution is uniform. It is 340 × 256 pixels or 256 × 340 pixels; the embodiment runs on the Linux system, the system version is Ubuntu16.04, mainly based on the Caffe framework, the dependent library OpenCV version of Caffe is 3.0, and the python version is 2.7; the graphi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a video behavior recognition method and system based on a space enhancement module, and the method comprises the following steps: decoding a to-be-detected video into a frame sequence, and storing the decoded frame sequence in an image form; dividing a video into a plurality of video clips by adopting a sparse sampling strategy, extracting a frame from each video clip, and combining to form a stacked frame sequence; calculating a mean value of three channels of all training video frames in the behavior recognition data set, and subtracting the calculated mean value from the sampled frame image; using a residual neural network 3D-ResNet-18 as a backbone network, and enabling a spatial enhancement module to construct a behavior recognition classification network; setting training parameters, inputting the training set into a behavior recognition classification network for training, and storing trained network parameters; in the model deployment stage, fusing linear operators of the space enhancement module; and inputting a to-be-detected video into the behavior recognition classification network, and outputting a final classification result. According to the invention, the behavior recognition effect is improved, and both effectiveness and universality are achieved.

Description

technical field [0001] The invention relates to the technical field of video behavior recognition of computer vision, in particular to a method and system for video behavior recognition based on a spatial enhancement module. Background technique [0002] Human behavior recognition has always been a key research issue in the field of computer vision. Through the study of human body posture and behavior, abnormal behaviors in public places can be detected, such as: running fast, falling, hitting people, etc. Applying deep learning theory to computer vision applications can design high-precision and high-efficiency behavior recognition algorithms. A highly practical behavior recognition algorithm has played an important role in guaranteeing public safety, and it also enables the computer to automatically complete the detection of abnormal human behaviors, saving the time for managers to search manually. The purpose of video behavior recognition is to identify the specific cat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06N3/04G06N3/08
CPCG06N3/08G06V20/41G06N3/045
Inventor 胡永健蔡德利刘琲贝王宇飞
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products