Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data set creation with crowd-based reinforcement

a data set and crowd-based technology, applied in the field of data set creation with crowd-based reinforcement, can solve problems such as lateral understanding of varian

Inactive Publication Date: 2021-03-25
QOMPLX INC
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

The patent describes a system and method for creating and expanding high-quality data set collections for training machine learning algorithms via crowdsourced curation. The system receives a data set and uses a machine learning algorithm to score data entries based on multiple metrics, with the scores combined to form an overall reputation score. The system also flags any erroneous data entries and assigns them to a human curator for resolution. Additionally, the system can use a generative adversarial network or other machine learning frameworks to generate synthetic data sets based on a reputable data set collection. The technical effects of this system and method include improved data quality, reduced human error, and improved efficiency in training machine learning algorithms.

Problems solved by technology

Additionally, real-world data offers only one historical view and provides no lateral understanding of variances that could have happened in the market.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data set creation with crowd-based reinforcement
  • Data set creation with crowd-based reinforcement
  • Data set creation with crowd-based reinforcement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023]The inventor has conceived, and reduced to practice, a system and method for creation and expansion of high quality data set collections for training of machine learning algorithms via crowdsourced curation. One embodiment according to the inventor employs a data marketplace which incentivizes data gatherers, publishers, and users to contribute to an expeditiously growing and vast resource of reliable data sets. To accomplish this, data is automatically ingested from disparate sources and autonomously checked for quality, provenance, and security and subsequently given a data reputation score. The score is compared against a numerical threshold and determines whether the data is sufficiently reputable and if so, stores the data in the marketplace. A queue holds data that falls below the reputation threshold where qualified individuals known as data stewards, receive compensation to manually curate the data. Once curated the now reputable data is stored for consumption in the m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A system and method for creation and expansion of high quality data set collections for training of machine learning algorithms via crowdsourced curation that utilizes a data marketplace which incentivizes data gatherers, publishers, and users to contribute to the creation of a vast resource of reliable data set collections. Data is automatically ingested from disparate sources and autonomously checked for data quality, provenance, and cyber-risks and subsequently given a reputation score. Data stewards curate a queue of low scoring real data as well as synthetically generated data. All reputable data is stored for user consumption and further iterative data generation.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS[0001]ApplicationNo.Date FiledTitleCurrentHerewithCYBERSECURITY PROFILING ANDapplicationRATING USING ACTIVE AND PASSIVEEXTERNAL RECONNAISSANCEIs a continuation-in-part of:15 / 931,534May 13, 2020SECURE POLICY-CONTROLLEDPROCESSING AND AUDITING ONREGULATED DATA SETSwhich is a continuation-in-part of:16 / 777,270Jan. 30, 2020CYBERSECURITY PROFILING ANDRATING USING ACTIVE AND PASSIVEEXTERNAL RECONNAISSANCEwhich is a continuation-in-part of:16 / 720,383Dec. 19, 2019RATING ORGANIZATIONCYBERSECURITY USING ACTIVE ANDPASSIVE EXTERNAL RECONNAISSANCEwhich is a continuation of:15 / 823,363Nov. 27, 2017RATING ORGANIZATIONPatentIssue DateCYBERSECURITY USING ACTIVE AND10,560,483Feb. 11, 2020PASSIVE EXTERNAL RECONNAISSANCEwhich is a continuation-in-part of:15 / 725,274Oct. 4, 2017APPLICATION OF ADVANCEDPatentIssue DateCYBERSECURITY THREAT MITIGATION10,609,079Mar. 31, 2020TO ROGUE DEVICES, PRIVILEGEESCALATION, AND RISK-BASEDVULNERABILITY AND PATCHMANAGEMENTwhich is a con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04L29/06G06F16/951G06F16/2458
CPCH04L63/20H04L63/1425H04L63/1441G06F16/2477G06F16/951G06N3/045
Inventor CRABTREE, JASONSELLERS, ANDREW
Owner QOMPLX INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products