Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Target pose estimation method and system based on attention mechanism and Hough voting

A technology of pose estimation and attention, applied in computing, computer components, character and pattern recognition, etc., can solve problems such as poor robustness, easily affected by background or occlusion, cumbersome process, etc., to speed up network convergence and improve Effects on Presentation and Reasoning Abilities

Active Publication Date: 2021-07-02
HUNAN UNIV
View PDF7 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the need to artificially calculate templates or feature points for specific objects, the robustness is poor and the process is cumbersome, and such methods are also easily affected by background or occlusion, and the accuracy is low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Target pose estimation method and system based on attention mechanism and Hough voting

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] In order to make the technical problems, technical solutions and advantages to be solved by the present invention clearer, the following will be described in detail in conjunction with the accompanying drawings and specific embodiments:

[0060] Aiming at the problems of existing object pose estimation methods, the present invention provides a target pose estimation method based on attention mechanism and Hough voting. The specific network structure is as follows figure 1 shown, including the following steps:

[0061] Step S1: Obtain the color and depth image of the scene containing the target object;

[0062] Step S2: Obtain the category and segmentation mask of each object from the color image through the existing state-of-the-art object segmentation method;

[0063] Step S3: Use the object segmentation masks obtained in step S2 to cut out the color and depth images of the corresponding objects from the input image, and perform channel splicing to obtain 4-channel im...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a target pose estimation method and system based on an attention mechanism and Hough voting. The method comprises the following steps: acquiring a color image and a depth image; segmenting and cutting the color image to obtain color and depth image blocks of each target object; using two strategies to estimate the six-dimensional pose of a target object, for a three-dimensional rotation matrix, based on a feature extraction network of bidirectional space attention, performing robust feature extraction by utilizing target surface two-dimensional feature constraint, and then using a multi-layer sensing network to regress the three-dimensional rotation matrix of the target; and for a three-dimensional translation vector, reconstructing a target object point cloud and normalizing point cloud data, estimating a point cloud three-dimensional translation direction vector point by point by adopting a Hough voting network, finally establishing a translation center straight line set, and solving a space nearest point to obtain a target three-dimensional translation vector. According to the method, the rotation matrix and the translation vector are estimated respectively, the execution speed is high, and precision is high.

Description

technical field [0001] The invention relates to the fields of robot visual perception and computer vision, in particular to a target pose estimation method and system based on attention mechanism and Hough voting. Background technique [0002] Object pose estimation refers to identifying known objects in the current scene from the perspective of the camera, and estimating their 3-axis orientation and 3-axis position in the camera's 3-dimensional space coordinate system, more specifically, referring to the object The rigid body transformation matrix T of the 3D model transformed from its own coordinate system to the camera coordinate system consists of a 3D rotation matrix R and a 3D translation vector t, which constitute the 6D pose P of the object. Object pose estimation is a key content in robot scene understanding. Using computer vision technology has achieved a series of results in the fields of robot grasping, human-computer interaction and augmented reality, and has be...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32G06K9/34G06T7/73G06N3/04
CPCG06T7/73G06V10/25G06V10/267G06N3/045
Inventor 王耀南刘学兵朱青袁小芳毛建旭冯明涛周显恩谭浩然
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products