Rubbish blog detecting method
A detection method and blog technology, applied in the field of blogs, can solve the problems of insufficient feature selection of spam blogs, low accuracy rate of distinguishing spam blogs from normal blogs, etc., and achieve high accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] There are three key points in the implementation of the present invention: blog text content feature extraction, blog page link feature extraction and blog text time distribution feature extraction. After obtaining the blog page data, the present invention obtains the feature vector through text content analysis, blog page link analysis and blog text time attribute analysis, and uses an automatic text classification algorithm to realize accurate classification of spam blogs.
[0037] 1. Feature extraction of blog text content:
[0038] As far as a single article is concerned, a blog article (including the article title) is used as the object, and the feature item is represented by the binary method. Binary representation, that is, one of {0, 1} is selected, and the keyword that appears is represented by 1, and the keyword that does not appear is represented by 0. In the standardized word frequency representation method, it is necessary to make appropriate improvements ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com