The present invention relates to a modeling method for a
support vector machine based on
data compression. The modeling method has the technical characteristics that the method comprises the following steps: sampling modeling data through an
equidistant sampling method; compressing the modeling data; calculating the boundary of each cluster of data under leaf nodes of a clustering feature tree, and choosing a boundary point most possibly becoming a support vector as the modeling data of the
support vector machine; and establishing a model of the
support vector machine: establishing a model of the support vector
machine according to the modeling data through a support vector
machine method. In the modeling method of the present invention, the modeling sample quantity of the support vector
machine is greatly reduced under the condition of ensuring the accuracy rate of the
algorithm to the greatest extent through a pre-sampling strategy, a
data compression technology, an increment sampling strategy and the like, so as to greatly improve the modeling speed of the support vector machine and lower the memory consumption, so that the support vector machine technology can be applied to a
big data analysis scene, thereby remedying the defect that a neural
network method, a Bayes method and the like in the
big data analysis have low prediction accuracy.