Research Article

Enhanced Breast Cancer Risk Classification Through Genetic Algorithm-Based Feature Selection and Machine Learning Techniques

Volume: 46 Number: 2 June 30, 2025
EN

Enhanced Breast Cancer Risk Classification Through Genetic Algorithm-Based Feature Selection and Machine Learning Techniques

Abstract

Breast cancer remains one of the leading causes of mortality among women worldwide and represents a major global health challenge. Accurate classification of breast tumors as benign or malignant is therefore of critical importance for timely diagnosis and effective treatment. This study aims to enhance breast cancer risk classification by integrating machine learning (ML) techniques with a genetic algorithm-based feature selection method. Initially, multiple ML algorithms are applied to features extracted from digitized images obtained through fine-needle aspiration (FNA) of breast masses. Subsequently, a genetic algorithm-based feature selection approach is employed to identify a subset of the most discriminative features. The results demonstrate that ML models utilizing the feature subsets selected by the genetic algorithm consistently achieve higher classification accuracy compared to their baseline counterparts. This highlights the effectiveness of the proposed feature selection strategy in improving the discriminative capacity of ML models. Beyond the observed improvements in accuracy, the refined ML models developed in this study show potential for more precise and reliable breast cancer diagnoses. By enhancing the performance of ML-based decision support systems, the genetic algorithm-based feature selection approach may contribute to the advancement of personalized treatment strategies in breast cancer care.

Keywords

References

  1. [1] Lacoviello, L., Bonaccio, M., de Gaetano, G., and Donati, M. B. Epidemiology Of Breast Cancer, A Paradigm Of The “Common Soil” Hypothesis, Seminars In Cancer Biology, (2021) 4-10.
  2. [2] Siegel, R. L., Miller, K. D., and Jemal, A., Cancer Statistics, CA: A Cancer Journal For Clinicians, 69 (1) (2019), 7-34.
  3. [3] WHO. Available at https://www.who.int/news-room/fact-sheets/detail/breast-cancer. Retrieved March 10,2023.
  4. [4] Mridha, M. F., Hamid, M. A., Monowar, M. M., Keya, A. J., Ohi, A. Q., Islam, M. R., and Kim, J.-M., A Comprehensive Survey On Deep-Learning-Based Breast Cancer Diagnosis, Cancers, 13 (23) (2021) 6116.
  5. [5] Mehrotra, D., Basics Of Artificial Intelligence & Machine Learning, Notion Press, (2019).
  6. [6] Segal, M. R., Machine Learning Benchmarks And Random Forest Regression, (2004).
  7. [7] Blum, A. L. and Langley, P., Selection Of Relevant Features And Examples In Machine Learning, Artificial Intelligence, 97 (1-2) (1997) 245-271.
  8. [8] Chen, R.-C., Dewi, C., Huang, S.-W., and Caraka, R. E., Selecting Critical Features For Data Classification Based On Machine Learning Methods, Journal of Big Data, 7 (1) (2020) 52.

Details

Primary Language

English

Subjects

Biostatistics , Applied Statistics , Operation

Journal Section

Research Article

Publication Date

June 30, 2025

Submission Date

February 27, 2024

Acceptance Date

June 3, 2025

Published in Issue

Year 2025 Volume: 46 Number: 2

APA
Yonar, A., Yonar, H., & Özaltın, Ö. (2025). Enhanced Breast Cancer Risk Classification Through Genetic Algorithm-Based Feature Selection and Machine Learning Techniques. Cumhuriyet Science Journal, 46(2), 369-376. https://doi.org/10.17776/csj.1443598

As of 2026, Cumhuriyet Science Journal will be published in six issues per year, released in February, April, June, August, October, and December