Paper
7 May 2012 A method for the automated detection phishing websites through both site characteristics and image analysis
Joshua S. White, Jeanna N. Matthews, John L. Stacy
Author Affiliations +
Abstract
Phishing website analysis is largely still a time-consuming manual process of discovering potential phishing sites, verifying if suspicious sites truly are malicious spoofs and if so, distributing their URLs to the appropriate blacklisting services. Attackers increasingly use sophisticated systems for bringing phishing sites up and down rapidly at new locations, making automated response essential. In this paper, we present a method for rapid, automated detection and analysis of phishing websites. Our method relies on near real-time gathering and analysis of URLs posted on social media sites. We fetch the pages pointed to by each URL and characterize each page with a set of easily computed values such as number of images and links. We also capture a screen-shot of the rendered page image, compute a hash of the image and use the Hamming distance between these image hashes as a form of visual comparison. We provide initial results demonstrate the feasibility of our techniques by comparing legitimate sites to known fraudulent versions from Phishtank.com, by actively introducing a series of minor changes to a phishing toolkit captured in a local honeypot and by performing some initial analysis on a set of over 2.8 million URLs posted to Twitter over a 4 days in August 2011. We discuss the issues encountered during our testing such as resolvability and legitimacy of URL's posted on Twitter, the data sets used, the characteristics of the phishing sites we discovered, and our plans for future work.
© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Joshua S. White, Jeanna N. Matthews, and John L. Stacy "A method for the automated detection phishing websites through both site characteristics and image analysis", Proc. SPIE 8408, Cyber Sensing 2012, 84080B (7 May 2012); https://doi.org/10.1117/12.918956
Lens.org Logo
CITATIONS
Cited by 17 scholarly publications and 12 patents.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image analysis

Image processing

Visualization

Web 2.0 technologies

Databases

Distance measurement

Current controlled current source

Back to Top