Method and device for detecting human-object interaction relationship in video

A detection method, technology in the video, applied in the direction of reasoning method, neural learning method, biological neural network model, etc.

Pending Publication Date: 2021-03-09
NANJING UNIV
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0029] The problem to be solved by the present invention is: capture the high-level semantic information of the scene from the complex video visual scene, discover, locate and classify the person-object pair and the interactive relationship between them in the video

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for detecting human-object interaction relationship in video
  • Method and device for detecting human-object interaction relationship in video
  • Method and device for detecting human-object interaction relationship in video

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention proposes a human-object interaction detection method based on time-space domain, the purpose of which is to capture the high-level semantic information of the scene from the complex video visual scene, discover, locate and classify the human-object pair and the relationship between them in the video interactive relationship. like figure 2 As shown, the present invention extracts the spatio-temporal trajectory of the subject and object in the video through target trajectory detection, and then identifies the human-object interaction relationship HOI based on the result of target trajectory detection by interactive joint reasoning: target trajectory detection includes video target detection and visual tracking. The frame-level target detection is performed on the video segment, and the target spatio-temporal trajectory is generated; the interactive joint reasoning is based on the target spatio-temporal track, the human body track and the object trac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for detecting a human-object interaction relationship in a video, and the method comprises the steps: extracting a space-time track of a subject and an object in the video through target track detection, recognizing a human-object interaction relationship HOI through interaction joint reasoning based on a result of target track detection, and extracting multimode features from a target space-time track through interaction joint reasoning, performing joint reasoning by using a multi-feature fusion mode according to human and object fusion semantic features, human body behavior visual description features and human and object space-time relative motion features, and predicting human and object interaction actions on a video segment to obtain a predicted interaction category label, namely a human and object interaction relationship. The invention provides a time-space domain-based human-object interaction detection method, which successfully discovers, locates and classifies human-object pairs and interaction relationships between the human-object pairs in a video by capturing advanced semantic information of a scene from a complex video vision scene.

Description

technical field [0001] The invention belongs to the video information retrieval in the field of computer technology, relates to the relationship detection between objects in the video, and is used for discovering, locating and classifying the person-object pairs in the video and the interactive relationship between them, and is a kind of person-object in the video Object interaction relationship detection method and device. Background technique [0002] Human-object interaction HOI (Ref. [1]) is crucial for understanding human-centric video content and benefits many multimedia applications, such as video captioning (Ref. [2,3]), multimodal dialogue (Ref. [4]) and visual question answering (refs [5,6]). ImgHOID has proposed many effective detection methods for human-object interaction detection on images (references [7,8]), but these methods cannot mine the temporal information and dynamic visual cues in the video. Interaction detection HOID is not effective enough. [000...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N5/04G06N3/04G06N3/08
CPCG06N5/04G06N3/08G06V20/44G06V20/49G06V20/41G06V20/46G06V2201/07G06N3/045G06F18/2453G06F18/253
Inventor 任桐炜武港山贺云青孙旭
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products