System and method for filtering text information of webpage

A text information and filtering system technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of the information filtering mechanism of keywords cannot be identified, bad information cannot be filtered, and relevance is not considered. The effect of improving reusability and scalability

Inactive Publication Date: 2012-04-04
SHANGHAI DIANJI UNIV
View PDF2 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Since criminals generally do not take the initiative to label the bad information they disseminate in accordance with the PICS standard, PICS-based filtering is not effective in practical applications; the database filtering method cannot filter many bad information that is parasitic in comprehensive websites. It is also impossible to filter websites containing bad information that frequently change IP and URL, or adopt multi-level proxy methods; bad information filtering technology based on keywords can obtain fa

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for filtering text information of webpage
  • System and method for filtering text information of webpage
  • System and method for filtering text information of webpage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The implementation of the present invention is described below through specific examples and in conjunction with the accompanying drawings, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific examples, and various modifications and changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention.

[0048] figure 1 It is a system architecture diagram of a webpage text information filtering system of the present invention. Such as figure 1 As shown, a webpage text information filtering system of the present invention includes at least a webpage browsing terminal 10 , a proxy server 20 , a network host 30 and a text filtering center module 40 .

[0049]Wherein the web page browsing ter...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a system and a method for filtering the text information of a webpage. The system comprises a webpage browsing terminal, a proxy server, a network host and a text filtering center module, wherein the webpage browsing terminal receives, analyzes and sends a target request through a browser; the proxy server receives the target request, sends the target request to the network host, acquires returned source code information and sends the source code information to the text filtering center module for filtering; meanwhile, the proxy server is used for receiving a filtering result which is returned by the text filtering center module; the network host is used for receiving the target request and returning the source code information; and the text filtering center module is used for analyzing, filtering and determining the source code information and returning the filtering result. The invention has the advantages that: by using an object-oriented programming idea, a text is filtered and developed; by combining various data structures, the system is fully optimized; modules are independent of one another; and the reusability and the expandability of the system are greatly improved.

Description

technical field [0001] The invention relates to a webpage information filtering system and method, in particular to a webpage text information filtering system and method capable of filtering bad content in webpage text information. Background technique [0002] At present, there are mainly four kinds of filtering technologies for web page content identification at home and abroad, namely filtering based on Internet Content Classification Platform (PICS), database filtering (IP library, URL library), keyword filtering and intelligent content understanding filtering. [0003] Since criminals generally do not take the initiative to label the bad information they disseminate in accordance with the PICS standard, PICS-based filtering is not effective in practical applications; the database filtering method cannot filter many bad information that is parasitic in comprehensive websites. It is also impossible to filter websites containing bad information that frequently change IP a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 朱一群徐涛刘兰保
Owner SHANGHAI DIANJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products