摘要:Web phishing is becoming an increasingly severe security threat in the web domain. Effective and efficientphishing detection is very important for protecting web users from loss of sensitive private information andeven personal properties. One of the keys of phishing detection is to efficiently search the legitimate webpage library and to find those page that are the most similar to a suspicious phishing page. Most existingphishing detection methods are focused on text and/or image features and have paid very limited attentionto spatial layout characteristics of web pages. In this paper, we propose a novel phishing detection methodthat makes use of the informative spatial layout characteristics of web pages. In particular, we develop twodifferent options to extract the spatial layout features as rectangle blocks from a given web page. Giventwo web pages, with their respective spatial layout features, we propose a page similarity definition thattakes into account their spatial layout characteristics. Furthermore, we build an R-tree to index all thespatial layout features of a legitimate page library. As a result, phishing detection based on the spatiallayout feature similarity is facilitated by relevant spatial queries via the R-tree. A series of simulationexperiments are conducted to evaluate our proposals. The results demonstrate that the proposed novelphishing detection method is effective and efficient