Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Two-stage multi-modal three-dimensional instance segmentation method

A multi-modal, three-dimensional technology, applied in the fields of computer vision and deep learning, can solve problems such as difficult segmentation, difficult acquisition of three-dimensional information, and difficulty in distinguishing boundaries by recognition methods, and achieves real-time and precision requirements and accurate three-dimensional instance segmentation Effect

Inactive Publication Date: 2022-05-13
CHENGDU UNIVERSITY OF TECHNOLOGY
View PDF1 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] (1) Due to the difficulty in collecting 3D information in the smart workshop scene and the high cost of 3D label labeling, it is an unavoidable problem to achieve accurate 3D instance segmentation of the smart workshop scene target in the absence of 3D instance segmentation labels;
[0004] (2) There are a large number of identical machine tools on the production line in the smart workshop scene. They have similar color features and shape features, which makes it difficult for existing recognition methods to distinguish their boundaries. In this case, accurately segmenting each instance is another problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Two-stage multi-modal three-dimensional instance segmentation method
  • Two-stage multi-modal three-dimensional instance segmentation method
  • Two-stage multi-modal three-dimensional instance segmentation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0049] A two-stage multimodal 3D instance segmentation method, taking a digital workshop as an example, specifically includes the following steps:

[0050] S1. Establish an instance segmentation data set for intelligent workshop objects: the number of instance segmentation datasets for intelligent workshop objects will greatly affect the accuracy of the segmentation network. Workshop target instance segmentation dataset. The target types of the dataset include seven categories: people, CNC lathes, ordinary lathes, CNC milling machines, ordinary milling machines, pedals, and mobile robots. The dataset is shot with an Intel RealSense D435 camera, labeled with Labelme, and image type Contains color map, depth map and corresponding instance segmentation image data labels, as attached figure 1 shown.

[0051] S2. Establish a three-dimensional ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a two-stage multi-mode three-dimensional instance segmentation method. The method comprises two parts of two-dimensional prior information acquisition and three-dimensional instance segmentation. The two-dimensional prior information acquisition part adopts an RGBD multi-mode fusion two-dimensional instance segmentation network, a depth feature correction module and a data enhancement strategy for a depth map are designed for the problem of low quality of the depth map, and a feature alignment module is added in a feature pyramid module. And the three-dimensional instance segmentation part is used for generating a point cloud with instance segmentation information by using a coordinate transformation method in combination with a depth map according to the acquired two-dimensional prior information, and then removing noise points in the point cloud by using direct filtering and statistical filtering to realize three-dimensional instance segmentation of a target. The two-stage multi-modal three-dimensional instance segmentation method has the beneficial effects that the two-stage multi-modal three-dimensional instance segmentation method can perform accurate three-dimensional instance segmentation on a scene target in an application scene with a small amount of two-dimensional annotation data and without three-dimensional annotation data, and has the advantage of high real-time performance.

Description

technical field [0001] The invention belongs to the fields of computer vision and deep learning, and in particular relates to a two-stage multimodal three-dimensional instance segmentation method. Background technique [0002] With the rise of the industrial Internet, the manufacturing industry is gradually developing in the direction of intelligence. The smart workshop is an important part of the manufacturing process, and the degree of scene awareness is an important indicator of its intelligence level. As an effective scene-aware method, the 3D instance segmentation method can generate 3D object boundaries to obtain object category information and object spatial location information that cannot be obtained by 2D instance segmentation methods. Furthermore, 3D instance segmentation has been applied in areas such as autonomous driving, robot navigation, and virtual / augmented reality. However, most applications in smart workshops are only based on 2D scene perception. Ther...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T7/10G06V10/82G06T5/00G06T5/20G06V10/80G06N3/04
CPCG06T7/10G06T5/20G06T2207/10024G06T2207/10028G06T2207/20081G06T2207/20084G06T2207/20016G06T2207/20024G06N3/045G06F18/253G06T5/70
Inventor 陈光柱唐在作韩银贺
Owner CHENGDU UNIVERSITY OF TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products