Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data mining system and method

A data and data cleaning technology, applied in the fields of electrical digital data processing, special data processing applications, digital data information retrieval, etc., can solve the problems of inaccurate data sources, low efficiency of analysis process and mining process, etc., and achieve the effect of improving efficiency

Inactive Publication Date: 2021-03-26
杭州洛邑科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Existing data mining technology mostly obtains data through crawlers and other data collection technologies, and then analyzes and mines the data to find out the laws hidden behind the data, and then guide actual life and production, but this method will cause Inaccurate data sources, low efficiency and accuracy in the analysis process and mining process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data mining system and method
  • Data mining system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0032]A data mining system, the system comprising: a data import unit, a data cleaning unit, a mining model training unit, a mining model management unit, a mining model display unit, a mining model evaluation unit, and a mining model publishing unit; the data importing unit signals connected to the data cleaning unit; the data cleaning unit signal is connected to the mining model management unit; the mining model management unit signal is connected to the mining model evaluation unit; the mining model evaluation unit signal is connected to the mining model release unit; the data The import unit is used to import original data information; the data cleaning unit is used to perform data cleaning on the imported original data information; the mining model management unit is used to assist data mining model training, model evaluation, and model release and scoring application of the model; the mining model training unit is used to select a business problem, select a training data ...

Embodiment 2

[0038] Further, the data cleaning unit includes: a data rule subunit for configuring data cleaning rule files; a data cleaning code generation subunit for generating data cleaning codes according to the data table to be cleaned and its corresponding data cleaning rules; The execution subunit is used to execute the data cleaning code to label the data to be cleaned; and the analysis subunit is used to analyze the label and clean the dirty data; the execution subunit also includes: a data reading unit to be cleaned, It is used to read the data to be cleaned one by one from the data table to be cleaned; the initial label setting unit is used to set the initial label for the read data to be cleaned; the data cleaning rule matching unit is used to match the data cleaning rules one by one; label reset The unit is used to reset the label of the data to be cleaned according to the matching result, and increase its label value every time the data to be cleaned triggers a data cleaning r...

Embodiment 3

[0048] Further, the mining model training unit includes: an algorithm selection subunit, a parameter setting subunit, and an algorithm execution subunit; the algorithm selection subunit is used to select a training data source according to the selected business problem, referring to the business problem The modeling variable set of the template selects the algorithm for the training data source; the parameter setting subunit is used for parameter setting; the algorithm execution subunit is used for executing the algorithm and performing the model according to the selected algorithm and the set parameters. train.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data mining system and method, and relates to the technical field of data mining. The system comprises a data importing unit, a data cleaning unit, a mining model training unit, a mining model management unit, a mining model display unit, a mining model evaluation unit, and a mining model publishing unit. The data import unit is in signal connection with the data cleaningunit; the data cleaning unit is in signal connection with the mining model management unit; the mining model management unit is in signal connection with the mining model evaluation unit; the miningmodel evaluation unit is in signal connection with the mining model publishing unit; the data importing unit is used for importing original data information; the data cleaning unit is used for carrying out data cleaning on the imported original data information; and the mining model management unit is used for assisting data mining model training, model evaluation, model release and model scoringapplication. The system and the method have the advantages of high mining efficiency and high data mining result accuracy.

Description

technical field [0001] The invention relates to the technical field of data mining, in particular to a data mining system and method. Background technique [0002] Transform this data into useful information and knowledge. The acquired information and knowledge can be widely used in various applications, including business management, production control, market analysis, engineering design and scientific exploration, etc. [0003] Data mining is a hot issue in the field of artificial intelligence and database research. The so-called data mining refers to the non-trivial process of revealing hidden, previously unknown and potentially valuable information from a large amount of data in the database. Data mining is a decision support process, which is mainly based on artificial intelligence, machine learning, pattern recognition, statistics, database, visualization technology, etc., to analyze enterprise data in a highly automated manner, make inductive reasoning, and dig out ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/215G06F16/2455G06F16/2458
CPCG06F16/215G06F16/24564G06F16/2465
Inventor 周维东
Owner 杭州洛邑科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products