PDGAN: Phishing Detection With Generative Adversarial Networks

Phishing is a harmful online attack that could lead to identity theft and financial damages. The demand for high-accuracy phishing detection tools has risen due to the increase of online electronic services and payment systems. Most phishing detection techniques depend on features related to webpage...

Full description

Saved in:

Bibliographic Details
Published in	IEEE access Vol. 10; pp. 42459 - 42468
Main Authors	Al-Ahmadi, Saad, Alotaibi, Afrah, Alsaleh, Omar
Format	Journal Article
Language	English
Published	Piscataway IEEE 2022 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Accuracy Artificial neural networks Convolutional Neural Network (CNN) Convolutional neural networks Deep learning Feature extraction generative adversarial network (GAN) Generative adversarial networks long short-term memory network (LSTM) Machine learning Payment systems Phishing phishing website detection Theft Uniform resource locators URLs Websites
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Phishing is a harmful online attack that could lead to identity theft and financial damages. The demand for high-accuracy phishing detection tools has risen due to the increase of online electronic services and payment systems. Most phishing detection techniques depend on features related to webpage content, which necessitates crawling the webpage and relying on third-party services. Relying on features related to webpage content could not provide high detection accuracy and leads to high false detection rates. Recently, deep learning has become a popular approach for detecting phishing websites. However, limited attention has been given to the generative adversarial network (GAN). This paper proposes a phishing detection model called PDGAN that depends only on a website's uniform resource locator (URL) to achieve reliable performance. We use a long short-term memory network (LSTM) network as a generator of synthetic phishing URLs and a convolutional neural network (CNN) as a discriminator to decide whether the URLs are phishing or legitimate. We use a dataset containing nearly two million phishing and legitimate URLs obtained through PhishTank and DomCop. The experimental results show that the PDGAN achieves a detection accuracy of 97.58% and a precision of 98.02% without depending on third-party services and with greater accuracy than the state-of-the-art models.
ISSN:	2169-3536 2169-3536
DOI:	10.1109/ACCESS.2022.3168235