Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

A sample training system based on idc harmful information monitoring system

A harmful information and sample training technology, applied in the field of sample training systems, can solve the problems of returning irrelevant results, poor timeliness, and low coverage, and achieve the effect of improving accuracy and effectiveness, and rapid positioning analysis.

Active Publication Date: 2018-09-11
CHENGDU GOLDTEL IND GROUP
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, due to the limitations of traditional search engines, such as low coverage, poor timeliness, inaccurate results, and too many irrelevant results are gradually reflected.
The IDC system cannot accurately and effectively monitor harmful information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A sample training system based on idc harmful information monitoring system
  • A sample training system based on idc harmful information monitoring system
  • A sample training system based on idc harmful information monitoring system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings, but the protection scope of the present invention is not limited to the following description.

[0045] A sample training system based on the IDC harmful information monitoring system, which includes a crawler system and a harmful information monitoring system. The harmful information monitoring system obtains web page data in the Internet data center through the crawler system, and performs harmful analysis on it.

[0046] (1) Reptile system

[0047] like figure 1 As shown, the crawler system is responsible for discovering, crawling and data normalization of raw data from the Internet. According to different applications on the Internet, it includes one or more crawler clusters, and each crawler cluster includes multiple crawler nodes and a crawler root node, forming a distributed data collection network, where the crawler root node is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a sample training system based on an IDC harmful information monitoring system. In the crawler sample training unit, the topic correlation calculation module combines the webpage information captured by the crawler system with the crawler sample database to calculate the topic correlation of the webpage, and according to the Topic correlation adjusts the URL queue, filters out URLs below the preset threshold, and feeds back the calculated topic correlation value to the crawler sample training module. After the crawler sample training module performs training and learning, it updates the crawler sample database; Harmful monitoring samples In the training unit, the keyword approximate vocabulary training module, the harmful information monitoring system performs harmful detection according to the approximate vocabulary related to the input string generated by the approximate matching algorithm, and the keyword approximate vocabulary training module is determined according to the search result fitting degree calculation module The accuracy of search results, judging the similarity of similar words, and updating effective similar words to the harmful monitoring sample database.

Description

technical field [0001] The invention relates to a sample training system based on an IDC harmful information monitoring system. Background technique [0002] With the rapid development of the network, the World Wide Web has become the carrier of a large amount of information, how to effectively extract and use this information has become a huge challenge. As a tool to assist people in retrieving information, search engines become the entrance and guide for users to access the World Wide Web. However, these universal search engines also have certain limitations. [0003] Faced with an increasingly active network community environment, every netizen may become a publisher and disseminator of harmful information, and the channels of harmful communication on the Internet are becoming wider and wider, including blogs, news, forums, Weibo, and other channels. Web crawler is a pioneering technology that can be realized by various search engines. The advent of the era of big data ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06K9/66
CPCG06F16/9535G06F16/9566G06V30/194
Inventor 彭光辉屈立笳陶磊苏礼刚林伟
Owner CHENGDU GOLDTEL IND GROUP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products