Content type detection method and device
A detection method and content category technology, applied in the field of classification and identification, can solve problems that affect user browsing experience, consume a lot of manpower and material resources, and cannot deal with bad content, so as to shorten the detection time, reduce manpower and material resources, and reduce detection costs. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0022] figure 1 It is a schematic flowchart of a content category detection method provided in Embodiment 1 of the present invention. This embodiment is applicable to the situation where the category detection of the content to be detected is performed. The method can be executed by a category detection device, and the device is composed of software and / or or hardware implementation. see figure 1 The content category detection method provided in this embodiment specifically includes the following operations:
[0023] Operation 110, performing feature extraction on the content to be detected.
[0024] In this embodiment, the content to be detected may be pre-stored locally, or obtained in real time from other devices in text and / or picture format. For example, the content to be detected is web page content including text and / or image format obtained by parsing an HTML (HyperText Mark-up Language) page obtained from a server in the Internet.
[0025] For content in text form...
Embodiment 2
[0036] figure 2 It is a schematic flowchart of a content category detection method provided by Embodiment 2 of the present invention. On the basis of Embodiment 1 above, this embodiment adds the operation of obtaining the content to be detected, and further optimizes the above operation 110 based on this operation . see figure 2 The content category detection method provided in this embodiment specifically includes the following operations:
[0037] Operation 210. Obtain the web page content according to the uniform resource locator as the content to be detected;
[0038] Operation 220, if the webpage content contains text content, perform feature extraction on the text content based on the text feature extraction algorithm, and add the feature extraction result to the feature set of the webpage content;
[0039] Operation 230, if the webpage content includes picture content, perform target feature recognition on the picture content, establish a feature vector of the pict...
Embodiment 3
[0052] image 3 It is a schematic flowchart of a content category detection method provided by Embodiment 3 of the present invention. On the basis of the above-mentioned embodiments, this embodiment "determines the content corresponding to the content of the webpage according to the category detection results obtained by at least two classifiers." The operation of "final category detection results" is further optimized, and the operation of optimizing classifiers and their voting weights is added accordingly. see image 3 The content category detection method provided in this embodiment specifically includes the following operations:
[0053] Operation 310, performing feature extraction on the content to be detected;
[0054] Operation 320. According to the feature extraction result, use at least two classifiers suitable for the content to be detected to detect the category of the content to be detected;
[0055] Operation 330: Determine the final category detection result ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com