Real-time browser-integrated phishing uniform resource locator detection via deep learning and fuzzy matching
Dam Minh Linh, Han Minh Chau, Huynh Trong Thua, Tran Cong Hung
Abstract
Phishing attacks through deceptive URLs remain a critical cybersecurity threat, particularly in financial transactions and online payment systems. This study evaluates multiple deep learning (DL) models on the PhiUSIIL dataset of 235,795 URLs, with bidirectional gated recurrent unit (BiGRU) achieving the best performance—99.82% accuracy at a 60:40 split, along with high precision, F1-score, and the lowest test loss. To further improve detection of obfuscated URLs, an enhanced BiGRU variant is proposed using an expanded 366-character vocabulary. For real-time deployment, a Chrome extension is developed, integrating exact and fuzzy matching via the Ratcliff–Obershelp algorithm with cloud-based whitelist and blacklist checks. When fuzzy matching is inconclusive, the BiGRU model performs the final classification. By combining an adaptive browser-side tool with a robust DL backend, the proposed system ensures high accuracy, scalability, and efficiency for phishing detection in practical web environments.
Keywords
Blacklist and whitelist URLs, Character vocabulary, Deep neural networks, Fuzzy string matching, Online URL classification, Phishing prevention
DOI:
https://doi.org/10.11591/eei.v14i6.10099
Refbacks
There are currently no refbacks.
This work is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License .
<div class="statcounter"><a title="hit counter" href="http://statcounter.com/free-hit-counter/" target="_blank"><img class="statcounter" src="http://c.statcounter.com/10241695/0/5a758c6a/0/" alt="hit counter"></a></div>
Bulletin of EEI Stats
Bulletin of Electrical Engineering and Informatics (BEEI) ISSN: 2089-3191 , e-ISSN: 2302-9285 This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU) .