An automatic selection method and system for labeling data, a device and a storage medium

A data labeling and automatic technology, applied in the field of data processing, can solve labor-consuming, time-consuming and other problems, and achieve the effects of avoiding misoperation, improving quality and improving efficiency

Active Publication Date: 2018-12-18
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF6 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to overcome the time-consuming and labor-intensive defect of manually reviewing...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An automatic selection method and system for labeling data, a device and a storage medium
  • An automatic selection method and system for labeling data, a device and a storage medium
  • An automatic selection method and system for labeling data, a device and a storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0059] This embodiment provides an automatic selection method for labeling data, which is used to automatically select labeling data meeting audit requirements from manually labeling labeling data as the final target labeling data. figure 1 shows the flow chart of this embodiment, see figure 1 , the automatic selection method of this embodiment includes:

[0060] S1. Acquiring the data to be labeled;

[0061] Specifically, the data to be labeled may include but not limited to text data, image data, voice data, and video data.

[0062] S2. Determine whether the data to be marked is structured data or unstructured data;

[0063] If it is structured data, go to step S3; if it is unstructured data, go to step S5;

[0064] Specifically, structured data refers to data that can be expressed and stored using a relational database, and is represented in a two-dimensional form. For example, if the structured data to be labeled can be attributes such as object categories. Unstructured...

Embodiment 2

[0084] This embodiment provides an electronic device, which can be expressed in the form of a computing device (for example, it can be a server device), including a memory, a processor, and a computer program stored on the memory and operable on the processor, wherein the processor The method for automatically selecting label data provided in Embodiment 1 can be implemented when the computer program is executed.

[0085] image 3 A schematic diagram of the hardware structure of this embodiment is shown, as image 3 As shown, the electronic device 9 specifically includes:

[0086] At least one processor 91, at least one memory 92, and a bus 93 for connecting different system components, including the processor 91 and the memory 92, wherein:

[0087] The bus 93 includes a data bus, an address bus, and a control bus.

[0088] The memory 92 includes a volatile memory, such as a random access memory (RAM) 921 and / or a cache memory 922 , and may further include a read only memory...

Embodiment 3

[0094] This embodiment provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, the steps of the method for automatically selecting label data provided in Embodiment 1 are implemented.

[0095] Wherein, the readable storage medium may more specifically include but not limited to: portable disk, hard disk, random access memory, read-only memory, erasable programmable read-only memory, optical storage device, magnetic storage device or any of the above-mentioned the right combination.

[0096] In a possible implementation manner, the present invention can also be implemented in the form of a program product, which includes program code, and when the program product runs on the terminal device, the program code is used to make the terminal device execute The steps of the method for automatically selecting labeled data in Embodiment 1.

[0097] Wherein, the program code for executing the present invention...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic selection method and system for labeling data, a device and a storage medium. The automatic selection method comprises the following steps: obtaining the data to be labeled; judging whether the data to be annotated is structured data or unstructured data; if the data is structured, obtaining a plurality of labeled data after the data to be labeled is labeled for many times; selecting the marked data with the largest number of repetitions among the marked data as the target marked data; if the data is unstructured, obtaining the annotated data after the datato be annotated; judging whether the annotated data has passed the audit according to the reference annotation database, and storing a plurality of reference annotation data in the reference annotation database; if approved, selecting the annotated data as the target annotation data. The invention adopts different ways to automatically select the labeled data conforming to the preset rules as thetarget labeling data according to the structured and unstructured data to be labeled, thus saving the cost and improving the efficiency and the quality.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to an automatic selection method, system, equipment and storage medium of marked data. Background technique [0002] At present, for the review and selection of labeled data, the mainstream method is to submit the original data after manual labeling, and the personnel of the data receiver will review the data quality one by one. Specifically, the above-mentioned mainstream approach includes the following steps: prepare the labeling tool; log in to the labeling tool to start manual labeling; submit the labeling data for review after the labeling is completed; and manually review item by item by the data receiver. [0003] This method of selecting and labeling data for review consumes a lot of labor costs in the face of simple review scenarios, and the method of manual review one by one; It is difficult to ensure the quality of labeling data through personnel review. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06K9/62
CPCG06F18/22
Inventor 王科郭鹏
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products