Method for detecting counterfeit webpage

A technology for counterfeiting web pages and detection methods, applied in the field of information security, can solve the problems of complex calculation, high dimension, and low similarity matching accuracy.

Inactive Publication Date: 2011-02-09
NORTH CHINA ELECTRIC POWER UNIV (BAODING)
View PDF3 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to provide a method for detecting counterfeit webpages, which solves the problems of insufficient processing of image webpages, high dimensionality resulting in complex calculations and low similarity matching accuracy due to the current purely document-based Phishing webpage detection technology.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for detecting counterfeit webpage
  • Method for detecting counterfeit webpage
  • Method for detecting counterfeit webpage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] The preferred embodiments will be described in detail below in conjunction with the accompanying drawings. It should be emphasized that the following description is only exemplary and not intended to limit the scope of the invention and its application.

[0054] Webpage image segmentation effect is the premise of obtaining fake webpage detection results. For this reason, the present invention introduces the spectral clustering method to segment the web page image. Since RGB is a very uneven color space, and R, G, and B components have a high degree of correlation, the web page image is first transformed from the RGB space to HSI (H means chroma, expressed by angle, S means saturation, I means brightness) space, and then perform spectral clustering and segmentation. Extract features from the segmented subimages, including subimage boundaries, colors, grayscale and other features. When extracting grayscale features, it is necessary to convert the subimages from HSI space...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for detecting a counterfeit webpage, which relates to the technical field of information safety and is used for improving the detection accuracy and detection efficiency of a phishing attack website. The method comprises the following steps of: converting a webpage to be detected into a webpage image and converting the webpage image from a red-green-blue (RGB) space into a hue-saturation-intensity (HSI) space; partitioning the converted webpage image by a spectral clustering method; extracting the characteristic vector of each sub-image after partitioning; extracting the characteristic vector of the position relation between adjacent sub-images after portioning; combining the characteristic vector of each sub-image with the characteristic vector of the position relation between the adjacent sub-images to form a webpage image characteristic vector and performing dimensionality reduction on the webpage image characteristic vector by a kernel principal component analysis method so as to acquire a characteristic space; training and testing the characteristic space by using a transductive support vector machine classifier; and judging whether the webpage to be detected is the counterfeit webpage or not according to the classification result of the classifier. The method has high adaptability and high detection accuracy in the aspect of the detection of the phishing attack website.

Description

technical field [0001] The invention belongs to the technical field of information security, in particular to a method for detecting counterfeit webpages. Background technique [0002] The rapid development of network applications has made online transactions and e-banking an important business model, and private information such as online user accounts has become extremely important. Phishing is a method of defrauding legitimate users of account passwords, etc. through fake websites. Fraudulent process for sensitive information. Phishing webpage detection technology is the process of detecting counterfeit webpages by identifying the characteristics of webpages. At present, most of the Phishing web page detection technologies are based on the document model, but due to the flexibility of the HTML language and the dynamic nature of web page elements, counterfeiters can make web pages that look the same but have completely different structures. In this regard, using the Phis...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F17/30
Inventor 李元诚赵留军
Owner NORTH CHINA ELECTRIC POWER UNIV (BAODING)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products