Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Enhancing knowledge discovery from multiple data sets using multiple support vector machines

A support vector machine and knowledge discovery technology, applied in the field of optimization of input and output data, can solve problems affecting the quality of approximation methods, fuzzy optimal solutions, limitations, etc.

Inactive Publication Date: 2002-07-10
BARNHILL SCI & TECH CORP
View PDF0 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, a significant disadvantage of backpropagation neural networks is that the empirical hazard function can have many local minima, which can easily be obscured by this technique from finding the best solution
Specifically, the skill of the practitioner implementing the neural network determines the ultimate benefit, but even factors as benign as an apparently random choice of initial weights can lead to poor results
Furthermore, gradient-based methods used in neural network learning are historically slow
Another disadvantage: the inverse function has a scaling factor which affects the quality of the approximation
[0010] Additionally, the ability of SVMs to discover knowledge from data is limited by the choice of kernel

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Enhancing knowledge discovery from multiple data sets using multiple support vector machines
  • Enhancing knowledge discovery from multiple data sets using multiple support vector machines
  • Enhancing knowledge discovery from multiple data sets using multiple support vector machines

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention provides an improved method for discovering knowledge from data using learning machines. While some examples of learning machines exist and advances in this field are expected, the exemplary embodiments of the present invention focus on support vector machines. As is known in the art, learning machines include algorithms that can be trained for normalization using data of known outcome. The trained learning machine algorithm can then be applied in the case of unknown outcomes for prediction. For example, a learning machine can be trained to recognize patterns in data, estimate regressions in data, or estimate probability densities within data. Learning machines can be trained to solve a wide variety of problems known to those of ordinary skill in the art. Optionally, any trained learning machine can be tested on the test data to ensure that its output is confirmed to be within an acceptable error bound. Once a learning machine is trained and tes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A system and method for enhancing knowledge discovery from data using multiple learning machines in general and multiple support vector machines in particular. Training data for a learning machine is pre-processed in order to add meaning thereto. Pre-processing data may involve transforming the data points and / or expanding the data points. By adding meaning to the data, the learning machine is provided with a greater amount of information for processing. With regard to support vector machines in particular, the greater the amount of information that is processed, the better generalizations about the data that may be derived. Multiple support vector machines, each comprising distinct kernels, are trained with the pre-processed training data and are tested with test data that is pre-processed in the same manner. The test outputs from multiple support vector machines are compared in order to determine which of the test outputs if any represents a optimal solution. Selection of one or more kernels may be adjusted and one or more support vector machines may be retrained and retested. Optimal solutions based on distinct input data sets may be combined to form a new input data set to be input into one or more additional support vector machine.

Description

technical field [0001] The present invention relates to the use of learning machines to discover knowledge from data. More particularly, the present invention relates to the optimization of learning machines and associated input and output data in order to improve knowledge discovered from multiple data sets. Background of the invention [0002] Knowledge discovery is the ideal end product of data collection. Recent advances in database technology have led to an explosion of systems and methods for generating, collecting, and storing enormous amounts of data. While database technology has enabled the efficient acquisition and storage of large data sets, the task of facilitating human understanding of the information in such data has grown even more difficult. With numerous existing technologies, this problem has become intractable. Therefore, there exists a need for a new generation of automatic knowledge discovery tools. [0003] As a concrete example, the Human Genome ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/00G06F9/44G06K9/62G06N5/04G06N20/10
CPCG06N99/005G06K9/6269G06K9/6256G06N20/00G06N20/10G06F18/2411G06F18/214G06N3/09
Inventor 斯蒂芬·D·巴恩希尔
Owner BARNHILL SCI & TECH CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products