Website of Mohammed ZAGANE

RESEARCH

In this page, you find details about my research works conducted during my doctoral studies, Post-Doctoral Research (HDR preparation) and current work. To foster transparency, reproducibility, and collaboration within the research community, I consistently make publically available all datasets, scripts, and tools utilized in my research. By sharing these resources, I aim to facilitate the replication of my findings, encourage further investigation, and stimulate innovative advancements in the field.

Doctoral studies

My doctoral dissertation was titled 'A Contribution to Software Vulnerability Prediction: A Code Metrics-Based Approach.' Within the scope of this thesis, I proposed approaches that leverage software metrics and ML/DL techniques for automatic prediction of software vulnerabilities. Automatic vulnerability prediction can significantly assist developers and minimize the resources allocated to addressing software security issues. These costs can be further reduced by accurately pinpointing the exact locations of vulnerabilities (vulnerable lines of code).

A key strength of the proposed approaches lies in their ability to utilize code metrics to quantify code slices that suggest the presence of vulnerabilities at a fine-grained level (a few lines of code). The work conducted in my thesis has resulted in the following publications:

Paper 1 :
- Title : Evaluating and Comparing Size, Complexity and Coupling Metrics as Web Applications Vulnerabilities Predictors
- Journal : International Journal of Information Technology and Computer Science(IJITCS)
- Authors : Mohammed Zagane , Mustapha Kamel Abdi
- URL : https://www.mecs-press.org/ijitcs/ijitcs-v11-n7/IJITCS-V11-N7-5.pdf
- DOI : 10.5815/ijitcs.2019.07.05
Paper 2 :
- Title : Deep Learning for Software Vulnerabilities Detection Using Code Metrics
- Journal : IEEE Access
- Authors : Mohammed Zagane , Mustapha Kamel Abdi and Mamdouh Alenezi
- URL : https://ieeexplore.ieee.org/iel7/6287639/8948470/09069943.pdf
- DOI : 10.1109/ACCESS.2020.2988557
Paper 3 :
- Title : A New Approach to Locate Software Vulnerabilities Using Code Metrics
- Journal :
- Authors : Mohammed Zagane , Mustapha Kamel Abdi and Mamdouh Alenezi
- URL : https://www.igi-global.com/article/a-new-approach-to-locate-software-vulnerabilities-using-code-metrics/256238
- DOI : 10.4018/IJSI.2020070106
Paper 4 :
- Title : Évaluation des Metriques de Couplage en Tant Qu’indicateurs de Vulnérabilités dans les Applications Web
- Journal : Communication Science et Technologie (COST)
- Authors : Mohammed Zagane and Mustapha Kamel Abdi
- DOI/URL : https://asjp.cerist.dz/en/article/150783
Tools :
- Title : ZM Source Code Metrics
- URL :
  CLI Version (Written in C++) https://github.com/mzagane/ZM_Source_Code_Metrics
  GUI Version (written in Java): https://github.com/mzagane/ZM_J_Code_Metrics
Datasets:
- Title : Slice-based Code Metrics Dataset
- URL :
  Version 1 (published with Paper 3) : https://github.com/mzagane/CMDataset
  Version 1 (published with Paper 2) : https://github.com/mzagane/slice-based_code_metrics_dataset

Post-Doctoral research (HDR Preparation)

Following my doctoral studies, I sought to address the limitations of software metric-based approaches, which are specific to the domain of software engineering. To this end, I drew inspiration from the field of Natural Language Processing (NLP), where traditional and deep learning techniques have yielded impressive results. This inspiration was motivated by the similarities between software source code and natural language: both exhibit syntactic and semantic characteristics, as well as a defined vocabulary. In collaboration with Saudi researchers, I proposed a novel approach [Paper 5] that leverages Word Embeddings, a widely used technique in NLP, for vulnerability prediction. Moreover, in the primary article of my HDR [Paper 6], I introduced another approach inspired by NLP that employs Term Frequency-Inverse Document Frequency (TF-IDF), a common technique in both NLP and information retrieval. This approach enables the automatic extraction of relevant attributes and the construction of effective vulnerability prediction models. To demonstrate the efficacy of this proposed approach, I conducted comparative studies with traditional software metrics and found that the automatically extracted attributes significantly outperformed these metrics."

These works have led to the following publications and Datasets:

Paper 5 :
- Title : Efficient Deep Features Learning for Vulnerability Detection Using Character N-Gram Embedding
- Journal : Jordanian Journal of Computers and Information Technology (JJCIT)
- Authors : Mamdouh Alenezi and Mohammed Zagane
Paper 6 :
- Title : Hybrid Representation to Locate Vulnerable Lines of Code
- Journal : International Journal of Software Innovation (IJSI)
- Authors : Mohammed Zagane , Mustapha Kamel Abdi and Mamdouh Alenezi
- URL : https://www.igi-global.com/article/hybrid-representation-to-locate-vulnerable-lines-of-code/292020
- DOI : 10.4018/IJSI.292020
Conference Paper 1 :
- Title : Automatic Feature Extraction Method for Software Vulnerability Prediction
- Conference : 1st National Conference on Applied Computing and Smart Technologies, SBA, Algeria
- Authors : Mohammed Zagane , Mustapha Kamel Abdi and Mamdouh Alenezi
Conference Paper 2 :
- Title : Apprentissage Profond pour la Détection du Code Source Vulnérable : Approche inspirée du domaine de TLN
- Conference : Mathématiques et Informatique Appliquée aux Sciences, MIAS’2022, Université de Tamanghasset, Algeria
- Authors : Mohammed Zagane and AbdErrahim Zagane
Datasets and Tools:
- Title : Char N-gram Embedding Dataset for DL-based AVP (Published with Paper 5)
- URL : https://github.com/dzresearcher/char_n-gram_embedding_dataset_for_DL_AVP
- Title : TF-IDF-Based AVP Dataset (Published with Paper 6)
- URL : https://github.com/researcherdz29/TF-IDF-Based-Features

Current work (Post-HDR research)

In addition to my continuing research in automated vulnerability prediction (AVP), I have diversified my research interests to encompass the application of machine learning (ML) and deep learning (DL) techniques to tackle challenging problems within other software engineering subfields. Notably, I have conducted research in software change management, where I have introduced a novel hybrid approach that leverages both software metrics and word embeddings to significantly improve the performance of co-change prediction models. This work is detailed in [paper 7].

Paper 7 :
- Title : Enhancing Software Co-Change Prediction: Leveraging Hybrid Approaches for Improved Accuracy
- Journal : IEEE Access
- Authors : Mohammed Zagane and Mamdouh Alenezi

Furthermore, I have played a pivotal role in establishing the LABTEC-IA research laboratory and serve as the head of the AIIS team. As I move forward, my primary objective is to mentor PhD students and guide their research in exploring the potential of emerging AI technologies to address critical societal challenges such as cybersecurity and food security."