In Silico Identification of Effective Genes for Acute Leukemia Classification Using a Spline Regression-based Framework

Yazdanparast, Maryam; Sheikhpour, Razieh; Zangeneh Soroush, Morteza; Ghanizadeh, Fatemeh

doi:10.18502/ijpho.v14i2.15269

Volume 14, Issue 2 (3-2024) Iran J Ped Hematol Oncol 2024, 14(2): 104-115 | Back to browse issues page

‎ 10.18502/ijpho.v14i2.15269

Ethics code: IR.SSU.MEDICINE.REC.1401.143

Mendeley

Zotero

RefWorks

Yazdanparast M, Sheikhpour R, Zangeneh Soroush M, Ghanizadeh F. In Silico Identification of Effective Genes for Acute Leukemia Classification Using a Spline Regression-based Framework. Iran J Ped Hematol Oncol 2024; 14 (2) :104-115
URL: http://ijpho.ssu.ac.ir/article-1-845-en.html

In Silico Identification of Effective Genes for Acute Leukemia Classification Using a Spline Regression-based Framework

Maryam Yazdanparast

, Razieh Sheikhpour ^*

, Morteza Zangeneh Soroush

, Fatemeh Ghanizadeh

Department of Computer Engineering, Faculty of Engineering, Ardakan University, P.O. Box 184, Ardakan, Iran

Abstract: (1161 Views)

Background: Microarray technology enables the examination of gene expression in thousands of genes and can be highly effective in identifying various types of cancers, including leukemia. However, many genes in microarray data are redundant and lack useful information for cancer diagnosis. The main objective of this study is to identify relevant and effective genes in classification of leukemia microarray data using a spline regression-based method, taking into account the correlation between genes.
Materials and Methods: In this analytical study, leukemia microarray data are used to identify relevant genes in classification of leukemia into Acute Myeloid Leukemia (AML) and Acute Lymphoblastic Leukemia (ALL) using a spline regression-based gene selection method, called SRS³FS based on ℓ_2,p-norm (0 < p ≤ 1). Subsequently, the support vector machine (SVM) algorithm is employed to classify leukemia data into AML and ALL.
Results: In this study, the classification results of SVM algorithm for 5, 10, 15, and 20 genes reveal that the SRS³FS method, employing ℓ_2,1/4-norm, ℓ_2,1/2-norm and ℓ_2,3/4-norm, exhibited the highest accuracy of 97.06% when identifying 10 genes for distinguishing between AML and ALL. Moreover, the leukemia data was classified into AML and ALL with an accuracy of 100%, using a gene identified by the SRS³FS method based on ℓ_2,3/4-norm and ℓ_2,1-norm. The gene labeled as number 3252, annotated as GLUTATHIONE S-TRANSFERASE, MICROSOMAL, is recognized as the most important gene.
Conclusion: The experimental results on leukemia microarray data demonstrate that the spline regression-based gene selection method can effectively identify relevant genes in classification and prediction of leukemia.

Keywords: Acute lymphocytic leukemia, Acute myeloid leukemia, Gene expression, Sparse gene selection, Spline regression

Full-Text [PDF 679 kb] (422 Downloads)

Type of Study: Research | Subject: General
Received: 2024/01/5 | Accepted: 2024/03/1 | Published: 2024/03/20

Send email to the article author

Rights and permissions
	This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Related Websites

Site Keywords

Pediatric, Hematology, Oncology, journal, Shahid Sadoughi University of Medical Sciences Yazd

Vote

Iranian Journal of Pediatric Hematology and Oncology

Designed & Developed by : Yektaweb

How Do You Evaluate This Site?
	Excellent
	Good
	Average
	weak