Leukemia diagnosis with machine learning ensemble from gene expression data

International Journal of Development Research

Volume: 
11
Article ID: 
22977
6 pages
Research Article

Leukemia diagnosis with machine learning ensemble from gene expression data

Jakelyne Machado Lima Silva, Joaquim dos Santos Costa, Edson Magalhaes da Costa, Maria Eliana da Silva Holanda,  Lucas Henrique Martins Soares, Dhian Kelson Leite de Oliveira, Fabrício Almeida Araújo, Guilherme Damasceno Silva, Isadora Mendes dos Santos, Gilberto Nerino de Souza Junior and Marcus de Barros Braga

Abstract: 

One of the great challenges of treating leukemia is targeting specific therapies for different categories. Classification models have been improved, making them decisive for improving the treatment of the disease. In this study, gene expression data was used and then different computational machine learning models were applied to establish the diagnosis of Acute Lymphoblastic Leukemia and Acute Myeloid Leukemia type leukemias. Three approaches, combined with data mining techniques, were used: one using a Support Vector Machine algorithm as core, the second one using an Artificial Neural Network and the third one using the Machine Learning Ensemble combination (Artificial Neural Network, Support Vector Machine, Random Forest, Gradient Boosting and k-NN). The Ensemble model achieved a consistent overall performance above 94% for five different learning algorithm evaluation metrics.

DOI: 
https://doi.org/10.37118/ijdr.22977.09.2021
Download PDF: