Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Data label recommendation method based on machine learning

A technology of data labeling and recommendation methods, applied in the field of information identification, can solve problems such as the lack of accurate and efficient data label recommendation methods, achieve effective label recommendation and improve accuracy

Pending Publication Date: 2021-11-02
闪捷信息科技有限公司
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there is currently a lack of an accurate and efficient data label recommendation method.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data label recommendation method based on machine learning
  • Data label recommendation method based on machine learning
  • Data label recommendation method based on machine learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0036] A data label recommendation method based on machine learning, comprising the steps of:

[0037] S1. Receive the data to be predicted; the data to be predicted only includes structured data, that is, database data; a set of data to be predicted generally includes table names, table descriptions, field names, field descriptions, examples, empirical knowledge, etc., wherein the table The name and field name are available for each group of data to be predicted, and they are generally directly composed of simple English or Pinyin; while other data may contain complex Chinese text in addition to being directly composed of simple English or Pinyin.

[0038] S2, preprocessing the data to be predicted, including Chinese word segmentation, keyword extraction, and word vector conversion;

[0039] After obtaining the data to be predicted,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data label recommendation method based on machine learning, and belongs to the technical field of information identification. The method comprises the following steps: S1, receiving to-be-predicted data; S2, preprocessing the to-be-predicted data, wherein the preprocessing comprises Chinese word segmentation, keyword extraction and word vector conversion; S3, calculating a similarity score between the to-be-predicted data and the data corresponding to each label; and S4, recommending the labels with the highest similarity score. According to the invention, accurate labels can be efficiently recommended for the data.

Description

technical field [0001] The invention relates to the technical field of information identification, in particular to a data label recommendation method based on machine learning. Background technique [0002] In the information age, especially with the rapid development of computer and network technology, information systems are becoming more and more extensive. As an important carrier for enterprises to store important and sensitive information, databases carry more and more key business systems, and have become the most strategically important assets of enterprises. However, in the complex actual environment of customers, the scale of data assets is often huge. If they cannot be sorted out and classified reasonably, it is impossible to talk about security construction. Therefore, it is necessary for us to sort out data assets and classify data in combination with data marking, so that users can focus on protecting data assets according to different needs. However, there i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/289G06K9/62G06F40/216G06F16/35G06N20/00
CPCG06F40/289G06F40/216G06F16/35G06N20/00G06F18/22
Inventor 张黎孟婷婷苏伟华谢委员
Owner 闪捷信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products