Data labeling method based on cross validation and related equipment

A cross-validation and data technology, applied in the field of data labeling based on cross-validation, can solve problems such as difficult full sample verification

Pending Publication Date: 2021-04-20
PINGAN PUHUI ENTERPRISE MANAGEMENT CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the embodiment of the present application is to propose a data labeling method, device, computer equipment and storage medium based on cross-validation to solve the technical problem that it is difficult to efficiently check the full amount of samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data labeling method based on cross validation and related equipment
  • Data labeling method based on cross validation and related equipment
  • Data labeling method based on cross validation and related equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by those skilled in the technical field of the application; the terms used herein in the description of the application are only to describe specific embodiments The purpose is not to limit the present application; the terms "comprising" and "having" and any variations thereof in the specification and claims of the present application and the description of the above drawings are intended to cover non-exclusive inclusion. The terms "first", "second" and the like in the description and claims of the present application or the above drawings are used to distinguish different objects, rather than to describe a specific order.

[0034] Reference herein to an "embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present application. The occurrenc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention belongs to the technical field of artificial intelligence, and relates to a data labeling method based on cross validation and related equipment. The method comprises the steps of obtaining a small sample initial standard data set; inputting the initial standard data set into a classification model for cross validation to obtain an initial standard data model; obtaining a large sample data set, inputting the large sample data set into the initial marking data training model for pre-marking, and determining a correction data set according to a pre-marking result; inputting the correction data set into an initial standard data training model for cross validation to obtain a final standard data model; and labeling the received data to be labeled through the final labeling data model. In addition, the invention also relates to a blockchain technology, and the to-be-labeled data can be stored in the blockchain. The labeling efficiency is improved to a greater extent, and false labeling data in the labeling process is detected by adopting a cross validation method; and repeated annotation of most of data is avoided.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, in particular to a cross-validation-based data labeling method and related equipment. Background technique [0002] In recent years, computer recognition technology based on deep learning has been widely used in various industries. An excellent deep learning model requires a large amount of high-quality labeled data to support it, and these high-quality labeled data are currently almost all labeled manually. The efficiency of manual data labeling is very low, and the accuracy of the labeling results largely depends on the labeling level of the labelers, so the quality of data labeling through manual labeling cannot be effectively guaranteed. Therefore, in the current data labeling process, there are generally problems with labeling quality. Among the two existing solutions, redundant data labeling schemes are generally used. The same data set is repeatedly marked by mu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
Inventor 魏万顺
Owner PINGAN PUHUI ENTERPRISE MANAGEMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products