Method for automatically classifying text documents by utilizing body
A text document and automatic classification technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as tedious, no consideration of semantic relationship between words, and difficulty in improving classification accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038]The present invention will be further described now in conjunction with accompanying drawing:
[0039] According to the method for classifying text documents using ontology proposed by the present invention, we have implemented it using Java and Perl languages, and the specific implementation process is as follows:
[0040] The text document classification method using ontology is divided into the following four steps:
[0041] Step 1: Construction of the keyword set of the text document. Here, the keyword extraction algorithm KEA algorithm is used to extract the weighted keyword set of each text document in the text document collection to be classified, specifically: for the text document collection D={d 1 , d 2 ,...,d |D|} (|D| indicates the number of text documents in the text document collection D) in each text document d i , first, using Naive Bayesian estimation, by considering the frequency tf×idf of words (existing words) appearing in text documents, the aver...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com