Data mining system and method
A data and data cleaning technology, applied in the fields of electrical digital data processing, special data processing applications, digital data information retrieval, etc., can solve the problems of inaccurate data sources, low efficiency of analysis process and mining process, etc., and achieve the effect of improving efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0032]A data mining system, the system comprising: a data import unit, a data cleaning unit, a mining model training unit, a mining model management unit, a mining model display unit, a mining model evaluation unit, and a mining model publishing unit; the data importing unit signals connected to the data cleaning unit; the data cleaning unit signal is connected to the mining model management unit; the mining model management unit signal is connected to the mining model evaluation unit; the mining model evaluation unit signal is connected to the mining model release unit; the data The import unit is used to import original data information; the data cleaning unit is used to perform data cleaning on the imported original data information; the mining model management unit is used to assist data mining model training, model evaluation, and model release and scoring application of the model; the mining model training unit is used to select a business problem, select a training data ...
Embodiment 2
[0038] Further, the data cleaning unit includes: a data rule subunit for configuring data cleaning rule files; a data cleaning code generation subunit for generating data cleaning codes according to the data table to be cleaned and its corresponding data cleaning rules; The execution subunit is used to execute the data cleaning code to label the data to be cleaned; and the analysis subunit is used to analyze the label and clean the dirty data; the execution subunit also includes: a data reading unit to be cleaned, It is used to read the data to be cleaned one by one from the data table to be cleaned; the initial label setting unit is used to set the initial label for the read data to be cleaned; the data cleaning rule matching unit is used to match the data cleaning rules one by one; label reset The unit is used to reset the label of the data to be cleaned according to the matching result, and increase its label value every time the data to be cleaned triggers a data cleaning r...
Embodiment 3
[0048] Further, the mining model training unit includes: an algorithm selection subunit, a parameter setting subunit, and an algorithm execution subunit; the algorithm selection subunit is used to select a training data source according to the selected business problem, referring to the business problem The modeling variable set of the template selects the algorithm for the training data source; the parameter setting subunit is used for parameter setting; the algorithm execution subunit is used for executing the algorithm and performing the model according to the selected algorithm and the set parameters. train.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com