Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for classifying input data arriving one by one in time

A technology for inputting data and time, applied in text database clustering/classification, relational database, database model, etc., can solve problems such as limited classification accuracy and concept drift, and achieve high classification accuracy

Inactive Publication Date: 2017-08-22
FUJITSU LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These issues lead to limited classification accuracy of existing online learning methods and systems
[0007] It can be seen that due to concept drift, existing online learning methods cannot achieve data classification well.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for classifying input data arriving one by one in time
  • Method and system for classifying input data arriving one by one in time
  • Method and system for classifying input data arriving one by one in time

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Exemplary embodiments of the present invention will be described below with reference to the accompanying drawings. In the interest of clarity and conciseness, not all features of an actual implementation are described in this specification. It should be understood, however, that in developing any such practical embodiment, many implementation-specific decisions must be made in order to achieve the developer's specific goals, such as meeting those constraints related to the system and business, and those Restrictions may vary from implementation to implementation. Moreover, it should also be understood that development work, while potentially complex and time-consuming, would at least be a routine undertaking for those skilled in the art having the benefit of this disclosure.

[0025] Here, it should also be noted that, in order to avoid obscuring the present invention due to unnecessary details, only the device structure and / or processing steps closely related to the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a system for classifying input data arriving one by one in time. The method comprises the steps of a) training a predetermined quantity of a group of classifiers by utilizing recent input data, of which the quantity increases from new to old in time and the real types are obtained, as learning samples; b) based on a recent classification result of the group of the classifiers, selecting the classifier with the highest classification precision for the recent input data from the group of the classifiers; and c) classifying the current input data by utilizing the selected classifier. According to the method and the system, concept drift does not need to be specially detected and can be automatically processed, and very high classification precision can be realized.

Description

technical field [0001] The present invention relates to a classification method and system, in particular to a method and system for classifying input data arriving one by one in time. Background technique [0002] Online learning is a machine learning method that continuously learns new data and updates existing models. It has a wide range of applications, such as streaming data mining. [0003] Concept drift is a problem unique to online learning. It refers to the conflict between data concepts before and after time, which cannot be described by a machine learning model. Constant change in the real world is at the root of concept drift. For example, in a spam classification application, emails about New Year's promotions will be treated as spam from February to October, and normal emails from November to December. [0004] see figure 1 , figure 1 A schematic diagram of a typical existing online learning method 100 is shown. In the method 100, whenever new data 110 is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N20/00
CPCG06F16/285G06F16/35G06N20/00
Inventor 徐卓然侯翠琴夏迎炬孙俊
Owner FUJITSU LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products