Webpage text classification method based on feature selection
A feature selection and text classification technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of slow classification speed and low accuracy, and achieve higher accuracy, higher recall, and shorter execution time. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0037] The classification method of the present invention combines the position of the characteristic words and the inter-class and intra-class distribution of the characteristic words when calculating the weight of the characteristic words, thereby avoiding the deficiency that those characteristic words that do not contribute to the classification are given a larger weight, and finally improve the classification accuracy.
[0038] Relevant definitions in the present invention are as follows:
[0039] Definition 1 (Term Frequency) Term Frequency (TF, Term Frequency) refers to the characteristic word t k in document d i The number of occurrences in , use tf ik (d i )express. On the premise of excluding stop words and individual high-frequency words, the feature word t k in document d i The more times it appears in , the more it represen...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com