DEVELOPMENT OF A TEXT-BASED MODEL FOR DETECTING AND PREVENTING PHISHING ATTACKS USING DEEP LEARNING

Ayomide Seyi-Ayodele; Sunday Eric Adewumi; Victoria Ifeoluwa Yemi-Peters

doi:10.33003/fjs-2025-0909-4031

Authors

Ayomide Seyi-Ayodele
ayomideseyiayodele@gmail.com

Federal University Lokoja
Sunday Eric Adewumi
Federal University Lokoja
Victoria Ifeoluwa Yemi-Peters
Federal University Lokoja

Keywords:

Cyber threats, Deep learning, Machine learning, Models, Phishing attacks

Abstract

Phishing attacks are cyber threats that have persisted over the years, and are often disguised as legitimate messages. Different studies have been focused on the detection of attempted phishing attacks, and significant milestone has been reached. However, previous studies rely mostly on URL-based phishing detection, with machine learning and deep learning models, but limited studies have focused on text-based phishing detection, and adoption of hybrid deep learning and transformer based models. Addressing these gaps, this study focused on the development of a text based phishing detection modes using a hybrid of CNN-LSTM with attention mechanism, and a transformer-based BERT model. Deep learning based models were train separately on a publicly available dataset collected from Kaggle. The dataset was pre-processed and used to train the two models. The model was evaluated using standard evaluation metrics. The experimental result shows that the CNN-LSTM-Attention model outperformed the BERT model across all evaluation metrics used, achieving 92.77% accuracy, 93.00% precision, 92.00% recall, 92.00% F1-score, and 93.00% AUC, while the BERT model had average performance with 76.55% accuracy, 74.00% accuracy, 75.00% precision, 73% recall, 75.00%, F1-scores, and 81.70% AUC. This study was compared with the baseline study that used the K-Nearest Neighbour (KNN) on URL-based features. Our study, CNN-LSTM-Attention model demonstrated superior performance. This study shows that hybrid deep learning with attention mechanism approach is highly effective in the detection of phishing attacks. Future research can focus on exploring multilingual capabilities, integratingbehavioural feature, and real-time model deployment to enhancecyber security.

Dimensions

REFERENCES

Abed, L. H., Mohammed, H. J., & Yaseen, Y. S. (2023). Phishing identification through up-to-date features generation and exploration. International Journal on Technical and Physical Problems of Engineering, 15(3).

Alam, M. N., Sarma, D., Lima, F. F., Saha, I., Ulfath, R.-E. -, & Hossain, S. (2020). Phishing attacks detection using machine learning approach. 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT), 1173–1179. https://doi.org/10.1109/icssit48917.2020.9214225

Alanezi, M. (2021). Phishing detection methods: a review. Technium: Romanian Journal of Applied Sciences and Technology, 3(9), 19–35. https://doi.org/10.47577/technium.v3i9.4973

Aldakheel, E. A., Zakariah, M., Gashgari, G. A., Almarshad, F. A., & Alzahrani, A. I. A. (2023). A deep learning-based innovative technique for phishing detection in modern security with uniform resource Locators. Sensors, 23(9), 4403. https://doi.org/10.3390/s23094403

Alshingiti, Z., Alaqel, R., Al-Muhtadi, J., Haq, Q. E. U., Saleem, K., & Faheem, M. H. (2023). A deep learning-based phishing detection system using CNN, LSTM, and LSTM-CNN. Electronics, 12(1), 232. https://doi.org/10.3390/electronics12010232

APWG. (2024). Phishing activity trends report. Phishers combining tactics and resources in attacks. https://docs.apwg.org/reports/apwg_trends_report_q2_2024

Atawneh, S., &Aljehani, H. (2023). Phishing email detection model using deep learning. Electronics, 12(20), 4261. https://doi.org/10.3390/electronics12204261

Benavides-Astudillo, E., Fuertes, W., Sanchez-Gordon, S., Nuñez-Agurto, D., & Rodríguez-Galán, G. (2023). A phishing-attack-detection model using natural language processing and deep learning. Applied Sciences, 13(9), 5275. https://doi.org/10.3390/app13095275

Etuh, E., S. Bakpo, F., & A.H, E. (2021). Social Media Network Attacks and their Preventive Mechanisms: A Review. https://doi.org/10.5121/csit.2021.112405

Feng, J., Zou, L., Ye, O., & Han, J. (2020). Web2Vec: Phishing Webpage Detection Method Based on Multidimensional Features Driven by Deep Learning. IEEE Access, 8, 221214–221224. https://doi.org/10.1109/access.2020.3043188

Gallo, L., Gentile, D., Ruggiero, S., Botta, A., &Ventre, G. (2024). The human factor in phishing: Collecting and analyzing user behavior when reading emails. Computers and Security, 139. https://doi.org/10.1016/j.cose.2023.103671

Geest, R. J., Cascavilla, G., Hulstijn, J., & Zannone, N. (2024). The applicability of a hybrid framework for automated phishing detection. Computers & Security, 139, 103736. https://doi.org/10.1016/j.cose.2024.103736

Ige, T., Kiekintveld, C., &Piplai, A. (2024). Deep learning-based speech and vision synthesis to improve phishing attack detection through a multi-layer adaptive framework. ArXiv (Cornell University). https://doi.org/10.48550/arxiv.2402.17249

Jayaraj, R., Pushpalatha, A., Sangeetha, K., Kamaleshwar, T., Udhaya Shree, S., & Damodaran, D. (2024). Intrusion detection based on phishing detection with machine learning. Measurement: Sensors, 31, 101003. https://doi.org/10.1016/j.measen.2023.101003

Mehndiratta, M., Jain, N., Malhotra, A., Gupta, I., & Narula, R. (2023). Malicious URL: Analysis and Detection using Machine Learning. Proceedings of the 17th INDIACom; 2023 10th International Conference on Computing for Sustainable Global Development, INDIACom 2023.

Mohamed, N. (2023). Current trends in AI and ML for cybersecurity: A state-of-the-art survey. Cogent Engineering, 10(2). https://doi.org/10.1080/23311916.2023.2272358

NajwaAltwaijry, Isra Al-Turaiki, Alotaibi, R., &Alakeel, F. (2024). Advancing phishing email detection: a comparative study of deep learning models. Sensors, 24(7), 2077–2077. https://doi.org/10.3390/s24072077

Osemwegie, E. E., & Amadin, F. I. (2023). Student dropout prediction using machine learning. FUDMA Journal of Sciences, 7(6), 347–353. https://doi.org/10.33003/fjs-2023-0706-2103

Prasad, A., & Chandra, S. (2024). PhiUSIIL: A diverse security profile empowered phishing URL detection framework based on similarity index and incremental learning. Computers and Security, 136. https://doi.org/10.1016/j.cose.2023.103545

Rashid, J., Mahmood, T., Nisar, M. W., & Nazir, T. (2020). Phishing Detection Using Machine Learning Technique. IEEE Xplore. https://doi.org/10.1109/SMART-TECH49988.2020.00026

Saha, I., Sarma, D., Chakma, R. J., Alam, M. N., Sultana, A., & Hossain, S. (2020). Phishing Attacks Detection using Deep Learning Approach. IEEE Xplore. https://doi.org/10.1109/ICSSIT48917.2020.9214132

Sultan Asiri, Xiao, Y., Alzahrani, S., & Li, T. (2024). PhishingRTDS: A Real-time Detection System for Phishing Attacks Using a Deep Learning Model. Computers & Security, 141, 103843–103843. https://doi.org/10.1016/j.cose.2024.103843

DEVELOPMENT OF A TEXT-BASED MODEL FOR DETECTING AND PREVENTING PHISHING ATTACKS USING DEEP LEARNING

Authors

Keywords:

Abstract

REFERENCES

Published

How to Cite

Issue

Section

How to Cite

Most read articles by the same author(s)

Make a Submission

Browse

Developed By

Information

Latest publications