A knn text classification method with optimized training sample set
A technology for training sample sets and text classification, which is applied in text database clustering/classification, unstructured text data retrieval, special data processing applications, etc., and can solve problems such as low efficiency and accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0083] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.
[0084] see figure 1 with figure 2 , a text classification method based on the optimized sample set KNN algorithm, firstly preprocess the text of the training set, then represent the preprocessed text in a vector space model, and then perform feature extraction on the representation result, and then perform a text classification model Calculation, after text preprocessing, text representation, and feature extraction are performed on the text dataset to be classified, the model is applied to the text dataset to be classified, and finally the result is obtained.
[0085] A kind of KNN text classification method that optimizes training sample set, concrete steps are as follows:
[0086] (1) The total number of predefined text categories is n, and n represents the number of categories of known category samples, that is, the number of categories o...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com