Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A semantic segmentation method of weakly supervised image based on spatial pyramid concealment pooling

A space pyramid and semantic segmentation technology, applied in the field of computer vision, to achieve the effect of more robust target size and posture, rich local features, and perfect regional feature mining

Active Publication Date: 2019-01-15
成都图必优科技有限公司
View PDF5 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the other hand, although the pyramid model has the characteristics of multi-scale information and local information fusion, the problem of how to better mine the sub-regional semantic information on the basis of mastering the global information has not been completely solved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A semantic segmentation method of weakly supervised image based on spatial pyramid concealment pooling
  • A semantic segmentation method of weakly supervised image based on spatial pyramid concealment pooling
  • A semantic segmentation method of weakly supervised image based on spatial pyramid concealment pooling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] Refer to attached Figure 1-4 , the embodiments of the present invention will be described in detail.

[0058] A weakly supervised image semantic segmentation method based on spatial pyramid mask pooling, comprising the following steps:

[0059] Step 1: Select a convolutional neural network H, and process the input image X through the convolutional neural network H to obtain a classification feature map;

[0060] Step 2: Establish a spatial pyramid pooling module based on the classification feature map, and then perform spatial pyramid masking to obtain an output feature map;

[0061] Step 3: Calculate the category activation vector and category probability vector according to the output feature map, and then establish a competitive spatial pyramid masking pooling loss function;

[0062] Step 4: Train the convolutional neural network H according to the competitive spatial pyramid masking pooling loss function and extract segmentation feature maps.

[0063] Further, t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a weak supervised image semantic segmentation method based on spatial pyramid concealment pooling, which comprises the following steps: selecting a convolution neural network H, processing the input image X through the convolution neural network H to obtain a classification characteristic map; the spatial pyramid pooling module is established according to the classificationcharacteristic map, and then the spatial pyramid is concealed to obtain the output characteristic map. According to the output characteristic graph, the category activation vector and the category probability vector are calculated, and then the competitive spatial pyramid masking pooling loss function is established. The convolutional neural network H is trained according to the pooling loss function of competitive spatial pyramid concealment and the segmentation feature map is extracted. The invention realizes a weak supervised image semantic segmentation model with richer local features, more perfect region feature mining and more robust target size and posture, improves the extraction ability of local semantic information and strengthens the recognition ability of local targets or parts in the weak supervised semantic segmentation.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and in particular relates to a method for semantic segmentation of weakly supervised images based on spatial pyramid masking pooling. Background technique [0002] Image semantic segmentation is a basic computer vision task, and its goal is to classify all pixels in the image. Since it can fully understand images at the pixel level, it is useful for other vision tasks such as image classification and object recognition. However, because the production of pixel-level label data requires a lot of energy, fully supervised image semantic segmentation is difficult to quickly achieve large-scale expansion, so weakly supervised image semantic segmentation methods that rely on image category labels have been widely studied. [0003] In past computer vision research, the pyramid model has been used a lot. The well-known SIFT algorithm extracts key point description features through the Laplacian...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/11G06K9/62G06N3/04
CPCG06T7/11G06T2207/20016G06N3/045G06F18/2415G06F18/214
Inventor 朱策段昶文宏雕徐榕健
Owner 成都图必优科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products