Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for detecting phishing web pages with spatial mixed index mechanism

A technology of spatial mixing and phishing webpages, which is applied in the field of information security and can solve problems such as slow speed

Inactive Publication Date: 2012-09-12
NANJING UNIV OF POSTS & TELECOMM
View PDF3 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, the identification of phishing websites mainly relies on computer automatic identification and manual identification. Manual identification uses a blacklist mechanism. Users report a certain website and manually identify whether it is a phishing website. This is obviously too slow.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for detecting phishing web pages with spatial mixed index mechanism
  • Method for detecting phishing web pages with spatial mixed index mechanism
  • Method for detecting phishing web pages with spatial mixed index mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The present invention combines the browser rendering engine to extract the visual layout features of the designated suspicious web pages, and then uses the spatial database index to synchronously combine the text features and image features of the web pages, that is, uses the spatial tree DIIR tree of the integrated file image inverted index, Its structure is as figure 2 As shown, in order to accurately and quickly find the layout features with similar spatial positions and visual similarities, and find the most similar legitimate web pages in the sample space through statistical analysis, so as to achieve the purpose of phishing web page detection. For the overall flow chart, see figure 1 .

[0055] 1. Using the spatial tree DIIR tree of the integrated file image inverted index is to improve the spatial region R tree of the spatial index mechanism, and add the inverted index file and image features of the text in the network object to each node of the spatial region ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for detecting phishing web pages based on a spatial mixed index mechanism, which comprehensively utilizes spatial layout, character features and image features of the web pages. The method relates to a design scheme based on visual layout features of pages and aspatial database, solving the problems of detecting phishing web pages rapidly according to visual similarities of web pages. Combined with a rendering engine of a browser, the method carries out feature extraction of visual layout for appointed suspicious web pages and utilizes spatial database index combined with text and image features of the web pages simultaneously to form a spatial tree i.e., a DIIR tree, wherein the DIIR tree is a reverse index of comprehensive document images in the spatial mixed index mechanism. The DIIR tree improves an R tree of a spatial area in the spatial index mechanism by adding reverse index files of characters and image features of network objects to each node of the R tree of the spatial area. When querying a new network object, not only the spatial layout feature of the object is considered, but also text and image features of the network object are simultaneously combined.

Description

technical field [0001] The invention relates to a method for detecting phishing webpages, mainly from the perspective of webpage visual layout similarity, synchronously combining text features, image features and spatial layout features of webpages, and matching and identifying phishing webpages based on a spatial hybrid index mechanism, which belongs to information security field. Background technique [0002] Phishing website is an online fraud that has become extremely rampant with the popularity of the Internet and the increase in online transactions. Phishing websites are fraudulent websites made by criminals. Phishing websites are usually almost identical to banking websites or other well-known websites, thereby luring website users to submit sensitive information on the phishing website, such as: user name, password, bank account number or credit card Details etc. [Zhang2007]. [0003] The most typical phishing attack process is as follows: first, lure users to a ca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30H04L29/06
Inventor 张卫丰王慕妮周国强张迎周田先桃周国富陆柳敏许碧欢顾赛赛
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products