A Machine Learning Approach for Phishing Attack Detection

A Machine Learning Approach for Phishing Attack Detection

Authors

  • Tarun Choudhary Terna Engineering College, Nerul, Navi Mumbai, India
  • Siddhesh Mhapankar Terna Engineering College, Nerul, Navi Mumbai, India
  • Rohit Bhddha Terna Engineering College, Nerul, Navi Mumbai, India
  • Ashish Kharuk Terna Engineering College, Nerul, Navi Mumbai, India
  • Dr. Rohini Patil Terna Engineering College, Nerul, Navi Mumbai, India https://orcid.org/0000-0001-9476-3559

DOI:

https://doi.org/10.37965/jait.2023.0197

Keywords:

legitimate, machine learning, malicious, phishing, URL, website

Abstract

Phishing is the easiest method for gathering sensitive information from unwary people. Phishers seek to get private data including passwords, login information, and bank account details. Cyber security experts are actively seeking for trustworthy and effective ways to identify phishing websites. In order to distinguish between legal and phishing URLs, we used machine learning (ML) technology. In this research work, using ML technology extraction and analysis of both types of URLs was performed. Extreme Gradient Boosting, Decision Tree, Logistic Regression, Random Forest (RF), and Support Vector Machine were used to identify phishing websites. The goal was to identify phishing URLs and determine the most effective ML technique by comparing the accuracy rates of each algorithm. In this, proposed methodology two datasets were used. The accuracy of models was calculated on PhishTank and UCI dataset using K-fold, feature selection and hyperparameter tuning method. Performance measures precision, recall, F1-score, and receiver operating characteristics (ROC) curve were calculated. RF provided an accuracy of 98.80% and 97.87% on the PhishTank dataset and UCI, respectively. Highest precision, recall, and F1-score value were 99% each, and AUC-ROC value was 99.89% with PhishTank dataset. Validation with other researchers showed better results with proposed methodology. Therefore, this methodology can be of help to identify phishing websites.

Metrics

Metrics Loading ...

Downloads

Published

2023-05-10

How to Cite

Choudhary, T., Mhapankar, S., Bhddha, R., Kharuk, A., & Dr. Rohini Patil. (2023). A Machine Learning Approach for Phishing Attack Detection. Journal of Artificial Intelligence and Technology, 3(3), 108–113. https://doi.org/10.37965/jait.2023.0197

Issue

Section

Research Articles
Loading...