Wubetu Barud Demilie

@wcu.edu.et

Department of Information Technology
Wachemo University, Hossana, Ethiopia

Wubetu Barud Demilie
Wubetu Barud Demilie graduated from Haramaya and Jimma Universities in the field of Information Technology with both BSc and MSc. degrees in the 2013 and 2017 academic years respectively. He has served in the field of education for over 9 years and is currently in service at Wachemo University, in the College of Engineering and Technology, Department of Information Technology, Hossana, Ethiopia. He has held administrative positions at the department, school, and college levels in his career, like the head of the department. He has actively participated in community-based training programs and research works. Accordingly, he has published many research works in different international reputable journals. Currently, he is working as an Assistant Professor position as a researcher and lecturer.

RESEARCH INTERESTS

I have identified my research interests as computer vision (CV), natural language processing (NLP), image processing, information retrieval (IR), deep learning/machine learning (DL/ML), and artificial intelligence (AI).
9

Scopus Publications

752

Scholar Citations

10

Scholar h-index

10

Scholar i10-index

Scopus Publications

  • News Classification in Low-Resource Languages: Insights From Transformer and Baseline Models
    Wubetu Barud Demilie, Ali Zia
    Concurrency and Computation Practice and Experience, 2026
    News text classification in low‐resource languages such as Amharic is challenging due to limited annotated data, rich morphology, class imbalance, and strong semantic overlaps (SOs) among news categories. Addressing these challenges is critical for reliable information organization in socially important domains, including media, education, policymaking, and so forth. In this work, we propose a unified framework for Amharic multi‐class news classification that integrates two state‐of‐the‐art (SOTA) Transformer‐based models with two traditional machine learning approaches under identical experimental setups. Specifically, we fine‐tune AfriBERTa, a monolingual RoBERTa‐based model pre‐trained on African languages, and AfroXLMR, a multilingual variant of XLM‐R, and compare them with TF‐IDF combined with logistic regression and Word2Vec combined with a multi‐layer perceptron. To the best of our knowledge, this is the first study to jointly benchmark these Transformer‐based and traditional models on an Amharic news dataset using stratified five‐fold cross‐validation (CV) and class‐balanced training. To enhance interpretability and real‐world usability, we deploy a real‐time Gradio‐based graphical user interface that exposes class‐wise probability distributions, enabling transparent analysis of SO across classes. The experimental results on a hold‐out test set show that AfriBERTa achieves the best performance, with a macro F 1 score of 94.12%, followed by AfroXLMR with 92.42%, while TF‐IDF + LR and Word2Vec + MLP achieve macro F 1 scores of 90.34% and 88.17%, respectively. All results are validated through statistical significance testing and comparative evaluation against zero‐shot large language models (LLMs), including ChatGPT‐4o, Gemini Pro, and Claude 3, where the proposed models consistently outperform due to language‐specific adaptability. Macro F 1 is used as the primary evaluation metric to ensure fair assessment under class imbalance and SO. Overall, this work provides a reproducible and interpretable benchmark for low‐resource news classification and contributes to research in explainable artificial intelligence and African language processing, with future directions including multimodal news text classifications.
  • Plant disease detection and classification techniques: a comparative study of the performances
    Wubetu Barud Demilie
    Journal of Big Data, 2024
    One of the essential components of human civilization is agriculture. It helps the economy in addition to supplying food. Plant leaves or crops are vulnerable to different diseases during agricultural cultivation. The diseases halt the growth of their respective species. Early and precise detection and classification of the diseases may reduce the chance of additional damage to the plants. The detection and classification of these diseases have become serious problems. Farmers’ typical way of predicting and classifying plant leaf diseases can be boring and erroneous. Problems may arise when attempting to predict the types of diseases manually. The inability to detect and classify plant diseases quickly may result in the destruction of crop plants, resulting in a significant decrease in products. Farmers that use computerized image processing methods in their fields can reduce losses and increase productivity. Numerous techniques have been adopted and applied in the detection and classification of plant diseases based on images of infected leaves or crops. Researchers have made significant progress in the detection and classification of diseases in the past by exploring various techniques. However, improvements are required as a result of reviews, new advancements, and discussions. The use of technology can significantly increase crop production all around the world. Previous research has determined the robustness of deep learning (DL) and machine learning (ML) techniques such as k-means clustering (KMC), naive Bayes (NB), feed-forward neural network (FFNN), support vector machine (SVM), k-nearest neighbor (KNN) classifier, fuzzy logic (FL), genetic algorithm (GA), artificial neural network (ANN), convolutional neural network (CNN), and so on. Here, from the DL and ML techniques that have been included in this particular study, CNNs are often the favored choice for image detection and classification due to their inherent capacity to autonomously acquire pertinent image features and grasp spatial hierarchies. Nevertheless, the selection between conventional ML and DL hinges upon the particular problem, the accessibility of data, and the computational capabilities accessible. Accordingly, in numerous advanced image detection and classification tasks, DL, mainly through CNNs, is preferred when ample data and computational resources are available and show good detection and classification effects on their datasets, but not on other datasets. Finally, in this paper, the author aims to keep future researchers up-to-date with the performances, evaluation metrics, and results of previously used techniques to detect and classify different forms of plant leaf or crop diseases using various image-processing techniques in the artificial intelligence (AI) field.
  • Automated all in one misspelling detection and correction system for Ethiopian languages
    Wubetu Barud Demilie, Ayodeji Olalekan Salau
    Journal of Cloud Computing, 2022
    In this paper, a misspelling detection and correction system was developed for Ethiopian languages (Amharic, Afan Oromo, Tigrinya, Hadiyyisa, Kambatissa, and Awngi). For some of these languages, there have been few works on typo detection and correction systems. However, an effective and all-in-one typo detector and corrector system for Ethiopian languages have yet to be developed. A dictionary-based methodology is used to detect and rectify various forms of misspelling-related issues. The major characteristics of the proposed model can be outlined by presenting suggestions for detected flaws and automatically correcting them utilizing the first suggestion. In addition, the proposed model is evaluated using dictionary-based data sets for all languages. The corpora used were gathered from a variety of sources, including economic, political, social, and related publications, newspapers, and magazines. In this model, the users can perform all spelling-related issues within a single system (all-in-one). That means if the user(s) is (are) working on the Amharic language and then he/she/they can change the language she/he/they prefer(s) without shifting to another graphical user interface (GUI). Here, the users can save time and perform their tasks easily. Similarly, the user(s) can improve their skills in the selected languages accordingly. Finally, precision, recall, and f-measures for each language have been computed following a successful evaluation of the model. The system outperforms an f-measure of 89.57%, 87.57%, 88.31%, 86.83%, 81.83%, and 87.59% for Amharic, Afan Oromo, Tigrinya, Hadiyyisa, Kambatissa, and Awngi languages respectively. Furthermore, recommendations have been provided for future researchers.
  • Detection and prevention of SQLI attacks and developing compressive framework using machine learning and hybrid techniques
    Wubetu Barud Demilie, Fitsum Gizachew Deriba
    Journal of Big Data, 2022
    A web application is a software system that provides an interface to its users through a web browser on any operating system (OS). Despite their growing popularity, web application security threats have become more diverse, resulting in more severe damage. Malware attacks, particularly SQLI attacks, are common in poorly designed web applications. This vulnerability has been known for more than two decades and is still a source of concern. Accordingly, different techniques have been proposed to counter SQLI attacks. However, the majority of them either fail to cover the entire scope of the problem. The structured query language injection (SQLI) attack is among the most harmful online application attacks and often happens when the attacker(s) alter (modify), remove (delete), read, and copy data from database servers. All facets of security, including confidentiality, data integrity, and data availability, can be impacted by a successful SQLI attack. This paper investigates common SQLI attack forms, mechanisms, and a method of identifying, detecting, and preventing them based on the existence of the SQL query. Here, we have developed a comprehensive framework for detecting and preventing the effectiveness of techniques that address specific issues following the essence of the SQLI attacks by using traditional Navies Bayes (NB), Decision Trees (DT), Support Vectors Machine (SVM), Random Forests (RF), Logistic Regression (LR), and Neural Networks Based on Multilayer Perceptron (MLP), and hybrid approach are used for our study. The machine learning (ML) algorithms were implemented using the Keras library, while the classical methods were implemented using the Tensor Flow-Learn package. For this proposed research work, we gathered 54,306 pieces of data from weblogs, cookies, session usage, and from HTTP (S) request files to train and test our model. The performance evaluation results for training set in metrics such as the hybrid approach (ANN and SVM) perform better accuracies in precision (99.05% and 99.54%), recall (99.65% and 99.61%), f1-score (99.35% and 99.57%), and training set (99.20% and 99.60%) respectively than other ML approaches. However, their training time is too high (i.e., 19.62 and 26.16 s respectively) for NB and RF. Accordingly, the NB technique performs poorly in accuracy, precision, recall, f1-score, training set evaluation metrics, and best in training time. Additionally, the performance evaluation results for test set in metrics such as hybrid approach (ANN and SVM) perform better accuracies in precision (98.87% and 99.20%), recall (99.13% and 99.47%), f1-score (99.00% and 99.33%) and test set (98.70% and 99.40%) respectively than other ML approaches. However, their test time is too high (i.e., 11.76 and 15.33 ms respectively). Accordingly, the NB technique performs poorly in accuracy, precision, recall, f1-score, test set evaluation metrics, and best in training time. Here, among the implemented ML techniques, SVM and ANN are weak learners. The achieved performance evaluation results indicated that the proposed SQLI attack detection and prevention mechanism has been improved over the previously implemented techniques in the theme. Finally, in this paper, we aimed to keep researchers up-to-date, with contributions, and recommendations to the understanding of the intersection between SQLI attacks and prevention in the artificial intelligence (AI) field.
  • Detection of fake news and hate speech for Ethiopian languages: a systematic review of the approaches
    Wubetu Barud Demilie, Ayodeji Olalekan Salau
    Journal of Big Data, 2022
    With the proliferation of social media platforms that provide anonymity, easy access, online community development, and online debate, detecting and tracking hate speech has become a major concern for society, individuals, policymakers, and researchers. Combating hate speech and fake news are the most pressing societal issues. It is difficult to expose false claims before they cause significant harm. Automatic fact or claim verification has recently piqued the interest of various research communities. Despite efforts to use automatic approaches for detection and monitoring, their results are still unsatisfactory, and that requires more research work in the area. Fake news and hate speech messages are any messages on social media platforms that spread negativity in society about sex, caste, religion, politics, race, disability, sexual orientation, and so on. Thus, the type of massage is extremely difficult to detect and combat. This work aims to analyze the optimal approaches for this kind of problem, as well as the relationship between the approaches, dataset type, size, and accuracy. Finally, based on the analysis results of the implemented approaches, deep learning (DL) approaches have been recommended for other Ethiopian languages to increase the performance of all evaluation metrics from different social media platforms. Additionally, as the review results indicate, the combination of DL and machine learning (ML) approaches with a balanced dataset can improve the detection and combating performance of the system.
  • Comparative Analysis of Automated Text Summarization Techniques: The Case of Ethiopian Languages
    Wubetu Barud Demilie
    Wireless Communications and Mobile Computing, 2022
    Nowadays, there is an abundance of information available from both online and offline sources. For a single topic, we can get more than hundreds of sources containing a wealth of information. The ability to extract or generate a summary of popular content allows users to quickly search for content and obtain preliminary data in the shortest amount of time. Manually extracting useful information from them is a difficult task. Automatic text summarization (ATS) systems are being developed to address this issue. Text summarization is the process of extracting useful information from large documents and compressing it into a summary while retaining all the relevant contents. This review paper provides a broad overview of ATS research works in various Ethiopian languages such as Amharic, Afan Oromo, and Tigrinya using different text summarization approaches. The work has identified the novel and recommended state-of-the-art techniques and methods for future researchers in the area and provides knowledge and useful support to new researchers in this field by providing a concise overview of the various feature extraction methods and classification techniques required for different types of ATS approaches applied to the Ethiopian languages. Finally, different recommendations for future researchers have been forwarded.
  • Evaluation of Part of Speech Tagger Approaches for the Amharic Language: A Review
    Wubetu Barud Demilie, Ayodeji Olalekan Salau, Kiran Kumar Ravulakollu
    Proceedings of the 2022 9th International Conference on Computing for Sustainable Global Development Indiacom 2022, 2022
    Accurately tagging correct grammar for individual words in a sentence is a critical task for natural language processing applications. Different deep and machine learning-oriented approaches to Part of Speech Tagger (POST) have recently been deployed as promising methods for identifying words in a phrase or sentence. This work presents the detailed concepts of POST research work on the Amharic language. Additionally, a comprehensive comparison of well-known deep and machine learning-oriented approaches was used in the development and implementation of POST for the language. A complete assessment of all published POST research works on the language is presented, together with a discussion of the proposed methods' performance, with a remark. Then, in terms of the recommended methodologies used and their performance evaluation criteria, recent developments and advancements in deep and machine learning oriented parts of speech taggers are described. Finally, we gave future recommendations for study in developing deep and machine learning-oriented POST using the results of the proposed methodologies based on their performances.
  • Development of a Compressive Framework Using Machine Learning Approaches for SQL Injection Attacks
    Fitsum Deriba
    Przeglad Elektrotechniczny, 2022
  • Artificial Intelligence Technologies: Applications, Threats, and Future Opportunities
    Ceur Workshop Proceedings, 2021

RECENT SCHOLAR PUBLICATIONS

  • News Classification in Low‐Resource Languages: Insights From Transformer and Baseline Models
    WB Demilie, A Zia
    Concurrency and Computation: Practice and Experience 38, e70648 , 2026
    2026
  • Plant disease detection and classification techniques: a comparative study of the performances
    WB Demilie
    Journal of Big Data 11 (1), 5 , 2024
    2024
    Citations: 455
  • Artificial intelligence assisted decision making in predicting COVID-19 patient’s path
    FG Deriba, AO Salau, BT Tefera, WB Demilie
    J. Pharm. Negat. Results 14, 1250-1255 , 2023
    2023
    Citations: 13
  • Detection and prevention of SQLI attacks and developing compressive framework using machine learning and hybrid techniques
    WB Demilie, FG Deriba
    Journal of Big Data 9 (1), 124 , 2022
    2022
    Citations: 68
  • Automated all in one misspelling detection and correction system for Ethiopian languages
    WB Demilie, AO Salau
    Journal of Cloud Computing 11 (1), 48 , 2022
    2022
    Citations: 6
  • Detection of fake news and hate speech for Ethiopian languages: a systematic review of the approaches
    WB Demilie, AO Salau
    Journal of big Data 9 (1), 66 , 2022
    2022
    Citations: 63
  • Artificial Intelligence Technologies: Applications, Threats, and Future Opportunities.
    AO Salau, WB Demilie, AT Akindadelo, JN Eneh
    ACI@ ISIC, 265-273 , 2022
    2022
    Citations: 30
  • Evaluation of part of speech tagger approaches for the amharic language: a review
    WB Demilie, AO Salau, KK Ravulakollu
    2022 9th International Conference on Computing for Sustainable Global … , 2022
    2022
    Citations: 5
  • Detection and prevention of SQLI attacks and developing compressive framework using machine learning and hybrid techniques. J Big Data 9 (1)
    WB Demilie, FG Deriba
    2022
    Citations: 10
  • Comparative Analysis of Automated Text Summarization Techniques: The Case of Ethiopian Languages
    WB Demilie
    Wireless Communications and Mobile Computing 2022, 1-28 , 2022
    2022
    Citations: 8
  • Development of a compressive framework using machine learning approaches for SQL injection attacks
    FG Deriba, AO Salau, TM Kassa, WB Demilie
    Przegląd Elektrotechniczny 98 , 2022
    2022
    Citations: 42
  • Implementation of Big Data in Educational Sectors (SWOT Analysis): The Case of Ethiopian Higher Institutions
    WB Demilie
    Drugs and Cell Therapies in Hematology 10 (1), 216-230 , 2021
    2021
  • Analysis of Implemented Part of Speech Tagger Approaches: The Case of Ethiopian Languages
    WB Demilie
    INDIAN JOURNAL OF SCIENCE AND TECHNOLOGY 13 (48), 11 , 2020
    2020
    Citations: 14
  • Multilingual Spelling Checker for Selected Ethiopian Languages
    WB Demilie
    International Journal of Advanced Science and Technology 29 (7), 8 , 2020
    2020
    Citations: 7
  • Implemented Stemming Algorithms for Information Retrieval Applications
    WB Demilie
    Journal of Information Engineering and Applications 10 (3), 6 , 2020
    2020
    Citations: 7
  • Why University Students Fail in Most Computer Programming Courses: The Case of Wachemo University-Student-Teacher Perspective
    WB Demilie
    2020
    Citations: 1
  • Strategies for Improving Academic Performance of Information Technology Department Students’ in Computer Programming Skills : The Case of Wachemo University
    WB Demilie
    International Journal of Scientific Research in Computer Science … , 2020
    2020
  • Implemented Stemming Algorithms for Six Ethiopian Languages
    WB Demilie
    2020
  • Causes of Failure of University Students in Computer Programming Courses : The Case of Wachemo University
    WB Demilie
    International Journal of Scientific Research in Computer Science … , 2019
    2019
    Citations: 13
  • Parts of speech tagger for Awngi language
    WB Demilie
    Int J Eng Sci Comput 9 (1) , 2019
    2019
    Citations: 10

MOST CITED SCHOLAR PUBLICATIONS

  • Plant disease detection and classification techniques: a comparative study of the performances
    WB Demilie
    Journal of Big Data 11 (1), 5 , 2024
    2024
    Citations: 455
  • Detection and prevention of SQLI attacks and developing compressive framework using machine learning and hybrid techniques
    WB Demilie, FG Deriba
    Journal of Big Data 9 (1), 124 , 2022
    2022
    Citations: 68
  • Detection of fake news and hate speech for Ethiopian languages: a systematic review of the approaches
    WB Demilie, AO Salau
    Journal of big Data 9 (1), 66 , 2022
    2022
    Citations: 63
  • Development of a compressive framework using machine learning approaches for SQL injection attacks
    FG Deriba, AO Salau, TM Kassa, WB Demilie
    Przegląd Elektrotechniczny 98 , 2022
    2022
    Citations: 42
  • Artificial Intelligence Technologies: Applications, Threats, and Future Opportunities.
    AO Salau, WB Demilie, AT Akindadelo, JN Eneh
    ACI@ ISIC, 265-273 , 2022
    2022
    Citations: 30
  • Analysis of Implemented Part of Speech Tagger Approaches: The Case of Ethiopian Languages
    WB Demilie
    INDIAN JOURNAL OF SCIENCE AND TECHNOLOGY 13 (48), 11 , 2020
    2020
    Citations: 14
  • Artificial intelligence assisted decision making in predicting COVID-19 patient’s path
    FG Deriba, AO Salau, BT Tefera, WB Demilie
    J. Pharm. Negat. Results 14, 1250-1255 , 2023
    2023
    Citations: 13
  • Causes of Failure of University Students in Computer Programming Courses : The Case of Wachemo University
    WB Demilie
    International Journal of Scientific Research in Computer Science … , 2019
    2019
    Citations: 13
  • Detection and prevention of SQLI attacks and developing compressive framework using machine learning and hybrid techniques. J Big Data 9 (1)
    WB Demilie, FG Deriba
    2022
    Citations: 10
  • Parts of speech tagger for Awngi language
    WB Demilie
    Int J Eng Sci Comput 9 (1) , 2019
    2019
    Citations: 10
  • Comparative Analysis of Automated Text Summarization Techniques: The Case of Ethiopian Languages
    WB Demilie
    Wireless Communications and Mobile Computing 2022, 1-28 , 2022
    2022
    Citations: 8
  • Multilingual Spelling Checker for Selected Ethiopian Languages
    WB Demilie
    International Journal of Advanced Science and Technology 29 (7), 8 , 2020
    2020
    Citations: 7
  • Implemented Stemming Algorithms for Information Retrieval Applications
    WB Demilie
    Journal of Information Engineering and Applications 10 (3), 6 , 2020
    2020
    Citations: 7
  • Automated all in one misspelling detection and correction system for Ethiopian languages
    WB Demilie, AO Salau
    Journal of Cloud Computing 11 (1), 48 , 2022
    2022
    Citations: 6
  • Evaluation of part of speech tagger approaches for the amharic language: a review
    WB Demilie, AO Salau, KK Ravulakollu
    2022 9th International Conference on Computing for Sustainable Global … , 2022
    2022
    Citations: 5
  • Why University Students Fail in Most Computer Programming Courses: The Case of Wachemo University-Student-Teacher Perspective
    WB Demilie
    2020
    Citations: 1
  • News Classification in Low‐Resource Languages: Insights From Transformer and Baseline Models
    WB Demilie, A Zia
    Concurrency and Computation: Practice and Experience 38, e70648 , 2026
    2026
  • Implementation of Big Data in Educational Sectors (SWOT Analysis): The Case of Ethiopian Higher Institutions
    WB Demilie
    Drugs and Cell Therapies in Hematology 10 (1), 216-230 , 2021
    2021
  • Strategies for Improving Academic Performance of Information Technology Department Students’ in Computer Programming Skills : The Case of Wachemo University
    WB Demilie
    International Journal of Scientific Research in Computer Science … , 2020
    2020
  • Implemented Stemming Algorithms for Six Ethiopian Languages
    WB Demilie
    2020