Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Decision level text automatic classified fusion method

A technology of automatic classification and fusion method, applied in special data processing applications, instruments, electrical digital data processing and other directions, can solve the problem of low accuracy, achieve the effect of ensuring efficiency and accuracy, improving performance, and improving classification accuracy

Inactive Publication Date: 2009-12-16
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF0 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0018] The present invention aims at the disadvantages of low accuracy in existing automatic text classification methods, and proposes a decision-level text automatic classification fusion method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Decision level text automatic classified fusion method
  • Decision level text automatic classified fusion method
  • Decision level text automatic classified fusion method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] According to the above technical solutions, the present invention will be described in detail below in conjunction with the examples.

[0037] The present invention uses the actual project of automatic text classification in my laboratory as an experimental platform to verify the effectiveness of the method proposed by the present invention. The system based on the method of the present invention adopts a JAVA development platform and an Oracle database.

[0038] In this experiment, the method of the present invention is used to classify 10,000 corpus, of which 7000 are training corpus and 3000 are test corpus, which are divided into 15 categories.

[0039] The steps of classifying by the method of the present invention are as follows:

[0040] Step 1: Perform word segmentation, feature extraction, weight calculation and other preprocessing on the 3000 documents to be divided;

[0041] Step 2: On the basis of Step 1, send the preprocessed results to SVM, KNN and Bayes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a decision level text automatic classified fusion method, belongs to the field of data mining, and is applied to digital libraries, network content supervision, junk mail filtration and the like. The method takes information integration as a theoretical basis, takes a text automatic classification algorithm with high classification precision as a study object, and establishes a decision level text automatic classified fusion model, namely adopts a multi-layer fusion structure to perform text automatic classified processing in a serial-parallel hybrid mode so as to obtain classification results with higher precision.

Description

technical field [0001] The invention relates to a decision-level text automatic classification and fusion method, which belongs to the field of data mining and is suitable for digital libraries, network content supervision, spam filtering and the like. Background technique [0002] Automatic text classification is a relatively hot research issue in the field of data mining. Its purpose is to train a classification function or classifier that can map the documents to be divided into the corresponding categories given. Its goal is to develop faster and more accurate methods of managing textual information. How to improve the accuracy of classification is a hot issue in current research. [0003] The decision-level fusion model is a more classic fusion model in the field of information fusion. Its structure has two modes of series and parallel, and the final decision is made by using the feature level and the decision level. [0004] The level of information fusion refers to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 张晓丹牛振东张正施曹玉鹃徐小梅
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products