Method for automatically classifying documents based on K nearest neighbor algorithm under power cloud environment
An automatic classification, nearest neighbor technology, applied in computing, electrical digital data processing, instruments, etc., can solve problems such as reducing classification performance, and achieve the effect of improving execution efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0014] A document automatic classification method based on the K-nearest neighbor algorithm in the power cloud environment of the present invention, the method improves the MapReduce programming framework of cloud computing, wherein the Map function completes the calculation of document similarity, and the reduce function stipulates similarity The K samples with the highest reliability, count the weights of each category to which the nearest neighbor belongs, and output the category with the largest weight. The specific content includes:
[0015] Utilize the metadata in the power system information base to construct a feature word dictionary, a set of forbidden words and a concept set specific to the power system industry. Then, the training set documents are structured, a model is established, and useless and general stopwords are removed according to the stopword set; the document is segmented according to the feature word dictionary; the same concept with different expressio...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com