Microcontent similarity based antirubbish method
A similarity, anti-spam technology, applied in the anti-spam field of Internet micro-content, can solve the problems of increasing server burden, unable to guarantee service quality, unable to fully meet the needs of identifying spam comments, etc., to improve efficiency and reduce the number of comparisons. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0031] The present invention is defined as follows for the concept of comment similarity:
[0032] Word: an indivisible semantic unit;
[0033] High-frequency words: words like "的" and "ah" that have no semantic meaning and need to be filtered out;
[0034] Comments: a limited set of words, the original comments are segmented into words, and the result after filtering out high-frequency words;
[0035] The number of words in the comment: the potential of the set of comment words - the number of elements contained in the set;
[0036] "Intersection" of comments: the intersection operation of word sets;
[0037] "Union" of comments: union operation of word sets;
[0038] Define the similarity sim(a, b) between review a and review b:
[0039] The number of words of a and b / the number of words of a and b, that is
[0040] The number of words from a to b / (the number of words from a + the number of words from b - the number of words from a to b)
[0041] Combining the above co...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com