Image data set construction method and system and computer readable storage medium

An image data set and construction method technology, applied in the field of computer readable storage devices and image data set construction, can solve problems such as training out performance, waste of data labeling cost, etc.

Active Publication Date: 2018-03-13
STATE GRID CHONGQING ELECTRIC POWER CO ELECTRIC POWER RES INST +2
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, even if a large amount of data is used to train the machine learning model, because it contains a large amount of the same and similar data, it not only wastes the cost of data labeling, but more importantly, it is difficult to train a machine learning model with good performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image data set construction method and system and computer readable storage medium
  • Image data set construction method and system and computer readable storage medium
  • Image data set construction method and system and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0046] The embodiment of the present invention discloses a method for constructing an image data set, such as figure 1 shown, including:

[0047] Step S11: dividing the pre-obtained first target hash value set to obtain corresponding hash value subsets; wherein, the hash value subset has M hash values, and M is an integer greater than or equal to 1;

[0048] Wherein, the process of obtaining the first target hash value set includes: obtaining the original ima...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an image data set construction method and system and a computer readable storage medium. The method comprises the steps of dividing a first target hash value set which is obtained in advance to obtain corresponding hash value subsets having m hash values, with m being an integer larger than or equal to 1; extracting N hash values from any hash value subset respectively, andgenerating a first target hash value subset, wherein n is a positive integer smaller than or equal to m; calculating a union set of all the first target hash value subsets to obtain a second target hash value set, using a second target hash value set, and obtaining a corresponding image in the original image so as to construct a target image data set. The process of obtaining the first target hash value set includes obtaining an original image data set to obtain a corresponding original image, calculating a hash value of the original image, removing the repeated hash value according to the hash value obtained through calculation to obtain the first target hash value set. An image data set which is differentiated is constructed.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to an image data set construction method, system and computer-readable storage device. Background technique [0002] Data, algorithms, and computing power are the three pillars of machine learning. Data has a huge impact on the performance of machine learning models, and sufficient data is the basis for training machine learning models with good performance. The adequacy of data is not only reflected in the amount of data, but also in the diversity of data. Differentiated data is a more comprehensive description of the problem, and a large number of identical or similar data is just a repeated description of a certain aspect of the problem. For example, UAV inspections of power transmission lines usually collect a large number of images, and many factors lead to many identical and similar images: (1) For the inspection of multiple circuits on the same tower, the exact sa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N99/00G06K9/62
CPCG06N20/00G06F18/214
Inventor 钱基业伏进何国军宋伟周小龙赵恒军张海兵肖前波吴国照张盈黄江晨彭姝迪
Owner STATE GRID CHONGQING ELECTRIC POWER CO ELECTRIC POWER RES INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products