Forthcoming

Hybrid Feature Selection Framework for Machine Learning-based Bot Detection on Social Media

Authors

  • Amina Guendouz University of Blida 1, Blida, Algeria
  • Fatima Boumahdi University of Blida 1, Blida, Algeria https://orcid.org/0000-0001-6255-9713
  • Mohamed Abdelkarim Remmide University of Blida 1, Blida, Algeria
  • Abdelghani Foura University of Blida 1, Blida, Algeria
  • Amina Madani University of Blida 1, Blida, Algeria

DOI:

https://doi.org/10.26636/jtit.2026.2.2541

Keywords:

bot detection, feature selection, machine learning, social media

Abstract

Nowadays, social media impact all aspects of our lives, making us vulnerable to fraud and scams. Bots are believed to be the most prevalent form of malware that may be found in social media environments. New detection methods are required to keep up with the pace of their continuous advancement. This paper offers an overview of machine learning-based bot detection methods. The study revealed that the effectiveness of machine learning (ML) models can be significantly hindered by redundant and irrelevant features present in the datasets, which can lead to performance degradation. A hybrid feature selection (FS) combining characteristics of the genetic algorithm (GA) and the mutual information (MI) approach is proposed to overcome this challenge. The proposed method is evaluated using the following approaches: random forest (RF), decision tree (DT), support vector machine (SVM), and logistic regression (LR). Compared to the state-of-the-art models, the proposed method is capable of efficiently identifying bots using only a small number of features. For the dataset used, we achieved a classification accuracy of 0.99 using 4 features only.

Downloads

Download data is not yet available.

References

[1] D.A. Belokurov, E.S. Shamakova, and V.S. Kolomoitcev, "Using Machine Learning Techniques to Identify Bot Accounts on a Social Network", 2021 Wave Electronics and its Application in Information and Telecommunication Systems (WECONF), Saint Petersburg, Russia, 2021.
View in Google Scholar

[2] M. Aljabri et al., "Machine Learning-based Social Media Bot Detection: A Comprehensive Literature Review", Social Network Analysis and Mining, vol. 13, art. no. 20, 2023.
View in Google Scholar

[3] Z. Ellaky, F. Benabbou, and S. Ouahabi, "Systematic Literature Review of Social Media Bots Detection Systems", Journal of King Saud University-Computer and Information Sciences, vol. 35, art. no. 101551, 2023.
View in Google Scholar

[4] X. Wang et al., "Input Feature Selection Method Based on Feature Set Equivalence and Mutual Information Gain Maximization", IEEE Access, vol. 7, pp. 151525-151538, 2019.
View in Google Scholar

[5] K. Yang et al., "Arming the Public with Artificial Intelligence to Counter Social Bots", Human Behavior and Emerging Technologies, vol. 1, pp. 48-61, 2019.
View in Google Scholar

[6] E. Alothali, K. Hayawi, and H. Alashwal, "SEBD: A Stream Evolving Bot Detection Framework with Application of PAC Learning Approach to Maintain Accuracy and Confidence Levels", Applied Sciences, vol. 13, art. no. 4443, 2023.
View in Google Scholar

[7] E. Alothali, M. Salih, K. Hayawi, and H. Alashwal, "Bot-MGAT: A Transfer Learning Model Based on a Multi-view Graph Attention Network to Detect Social Bots", Applied Sciences, vol. 12, art. no. 8117, 2022.
View in Google Scholar

[8] S. Ye et al., "HOFA: Twitter Bot Detection with Homophily-oriented Augmentation and Frequency Adaptive Attention", arXiv, 2023 (https://arxiv.org/abs/2306.12870).
View in Google Scholar

[9] M. Heidari, J.H. Jones Jr, and O. Uzuner, "Online User Profiling to Detect Social Bots on Twitter", arXiv, 2022 (https://arxiv.org/abs/2203.05966).
View in Google Scholar

[10] I. Dimitriadis, K. Georgiou, and A. Vakali, "Social Botomics: A Systematic Ensemble ML Approach for Explainable and Multi-class Bot Detection", Applied Sciences, vol. 11, art. no. 9857, 2021.
View in Google Scholar

[11] D. Martin-Gutierrez et al., "A Deep Learning Approach for Robust Detection of Bots in Twitter Using Transformers", IEEE Access, vol. 9, pp. 54591-54601, 2021.
View in Google Scholar

[12] S.S. Sengar, S. Kumar, P. Raina, and M. Mahaliyan, "Bot Detection in Social Networks Based on Multilayered Deep Learning Approach", Sensors and Transducers, vol. 244, pp. 37-43, 2020.
View in Google Scholar

[13] H. Khalil, M.U. Khan, and M. Ali, "Feature Selection for Unsupervised Bot Detection", 2020 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET), Sukkur, Pakistan, 2020.
View in Google Scholar

[14] I. Mbona and J.H.P. Eloff, "Classifying Social Media Bots as Malicious or Benign Using Semi-supervised Machine Learning", Journal of Cybersecurity, vol. 9, art. no. tyac015, 2023.
View in Google Scholar

[15] A.A. Daya, M.A. Salahuddin, N. Limam, and R. Boutaba, "BotChase: Graph-based Bot Detection Using Machine Learning", IEEE Transactions on Network and Service Management, vol. 17, pp. 15-29, 2020.
View in Google Scholar

[16] A.A. Daya, M.A. Salahuddin, N. Limam, and R. Boutaba, "A Graph-based Machine Learning Approach for Bot Detection", arXiv, 2019.
View in Google Scholar

[17] P. Dhal and C. Azad, "A Comprehensive Survey on Feature Selection in the Various Fields of Machine Learning", Applied Intelligence, vol. 52, pp. 4543-4581, 2022.
View in Google Scholar

[18] Y. Li, T. Li, and H. Liu, "Recent Advances in Feature Selection and its Applications", Knowledge and Information Systems, vol. 53, pp. 551-577, 2017.
View in Google Scholar

[19] J.V.F. Abreu, C.G. Ralha, and J.J.C. Gondim, "Twitter Bot Detection with Reduced Feature Set", 2020 IEEE International Conference on Intelligence and Security Informatics (ISI), Arlington, USA, 2020.
View in Google Scholar

[20] E. Alothali, K. Hayawi, and H. Alashwal, "Hybrid Feature Selection Approach to Identify Optimal Features of Profile Metadata to Detect Social Bots in Twitter", Social Network Analysis and Mining, vol. 11, art. no. 84, 2021.
View in Google Scholar

[21] C. Cea, "Dataset for Supervised Bot Detection on Twitter (1.0)", Zenodo, 2021.
View in Google Scholar

[22] S. Katoch, S.S. Chauhan, and V. Kumar, "A Review on Genetic Algorithm: Past, Present, and Future", Multimedia Tools and Applications, vol. 80, pp. 8091-8126, 2021.
View in Google Scholar

[23] M.A. Remmide, F. Boumahdi, and N. Boustia, "Toward a Hybrid Approach Combining Deep Learning and Case-based Reasoning for Phishing Email Detection", International Journal on Artificial Intelligence Tools, vol. 33, art. no. 2450015, 2024.
View in Google Scholar

[24] M.A. Remmide, F. Boumahdi, B. Ilhem, and N. Boustia, "A Privacy-preserving Approach for Detecting Smishing Attacks Using Federated Deep Learning", International Journal of Information Technology, vol. 17, pp. 547-553, 2025.
View in Google Scholar

Downloads

Published

2026-06-01

Issue

Section

ARTICLES FROM THIS ISSUE

How to Cite

[1]
A. Guendouz, F. Boumahdi, M. A. . Remmide, A. Foura, and A. Madani, “Hybrid Feature Selection Framework for Machine Learning-based Bot Detection on Social Media”, JTIT, vol. 104, no. 2, pp. 40–47, Jun. 2026, doi: 10.26636/jtit.2026.2.2541.