Annotation data quality evaluation method and device, computer device and storage medium
A technology for labeling data and quality evaluation, applied in the field of data processing, can solve problems such as omissions and judge the quality of labeling, and achieve the effects of saving costs, improving evaluation accuracy, and solving low efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0026] figure 1 It is a flow chart of a method for evaluating the quality of tagged data in Embodiment 1 of the present invention. This embodiment is applicable to the situation where the tagged text in the tagged sample is evaluated for tagging quality. This method can be provided by the embodiment of the present invention Annotate data quality evaluation means to implement, the means can be implemented in the form of software and / or hardware, and can generally be integrated into computer equipment, such as terminal equipment or servers. Such as figure 1 As shown, the method of this embodiment specifically includes:
[0027] S110. Acquire at least one labeled sample to be processed.
[0028] Specifically, the marked sample is used as a carrier of marked text, where the marked sample may be text, document, image text recognized by image or audio text recognized by audio, and the like.
[0029] Usually, a specific field is marked in a piece of text, and the text marked with ...
Embodiment 2
[0069] Figure 2aIt is a flow chart of a method for evaluating the quality of labeled data in Embodiment 2 of the present invention. This embodiment is embodied on the basis of the above-mentioned embodiments, and the analysis of the labeling accuracy of the at least one labeled sample is embodied. To: obtain the original text matched by the labeled sample; wherein, the original text does not include any labeled data; use a pre-trained model to label the original text to obtain predicted labeled data; the labeled sample includes The labeled data to be evaluated is compared with the predicted labeled data to obtain an accuracy analysis result of the labeled samples. Concretely analyzing the annotation consistency of the at least one labeled sample as: classifying the labeled data to be evaluated in the at least one labeled sample to form at least one class, each class including at least one initial labeled text; Carry out consistency analysis to the initial labeling text of ea...
Embodiment 3
[0122] image 3 It is a schematic diagram of a labeled data quality evaluation device in Embodiment 3 of the present invention. Embodiment 3 is a corresponding device for implementing the method for evaluating the quality of labeled data provided by the above embodiments of the present invention. The device can be implemented in the form of software and / or hardware, and can generally be integrated into computer equipment.
[0123] Correspondingly, the device of this embodiment may include:
[0124] Annotated sample acquisition module 310, configured to acquire at least one labeled sample to be processed;
[0125] An annotation accuracy analysis module 320, configured to perform an annotation accuracy analysis on the at least one annotated sample;
[0126] An annotation consistency analysis module 330, configured to perform an annotation consistency analysis on the at least one annotation sample;
[0127] An annotation quality evaluation result determining module 340, config...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com