A microblog information priority collection method based on multiple strategies
A technology with information priority and collection method, applied in digital data information retrieval, special data processing applications, website content management, etc., can solve the problems of wasting time, a large number of microblogs, and the inability to collect hot bloggers in time.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0072] The specific implementation of the present invention will be further described in detail below in conjunction with the diagrams and examples. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.
[0073] The method that the present invention proposes is to realize by following steps successively:
[0074] Step (1) Spam blogger detection
[0075] Through the blogger collector, 782632 bloggers to be collected are obtained, and the collection of bloggers to be collected is recorded as U={u 1 , u 2 ,...,u n}, where n is 782632.
[0076] Step (1.1) Construct spam microblog detection model
[0077] Step (1.1.1) constructs the training data set, as follows:
[0078] Use crawlers to crawl and manually label a set of Weibo blog post data: G=[(x 1 ,y 1 ),(x 2 ,y 2 ),...,(x n ,y n )], where n represents the total number of microblogs, x i Represents the i-th microblog, where y i = 0 mea...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com