Predicting clinical outcomes of radiotherapy for head and neck squamous cell carcinoma patients using machine learning algorithms

Tarun Gangil, Amina Beevi Shahabuddin, B. Dinesh Rao, Krishnamoorthy Palanisamy, Biswaroop Chakrabarti, Krishna Sharan

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)


Background: Radiotherapy is frequently used to treat head and neck Squamous cell carcinomas (HNSCC). Treatment outcomes being highly uncertain, there is a significant need for robust predictive tools to improvise treatment decision-making and better understand HNSCC by recognizing hidden patterns in data. We conducted this study to identify if Machine Learning (ML) could accurately predict outcomes and identify new prognostic variables in HNSCC. Method: Retrospective data of 311 HNSCC patients treated with radiotherapy between 2013 and 2018 at our center and having a follow-up of at least three months' duration were collected. Binary-classification prediction models were developed for: Choice of Initial Treatment, Residual disease, Locoregional Recurrence, Distant Recurrence, and Development of New Primary. Clinical data were pre-processed using Imputation, Feature selection, Minority Oversampling, and Feature scaling algorithms. A method to retain original characteristics of dataset in testing samples while performing minority oversampling is illustrated. The classification comparison was performed using Random Forest (RF), Kernel Support Vector Machine (KSVM), and XGBoost classification algorithms for each model. Results: For the choice of the initial treatment model, the testing accuracy was 84.58% using RF. The distant recurrence, locoregional recurrence, new-primary, and residual models had a testing accuracy (using KSVM) of 95.12%, 77.55%, 98.61%, and 92.25%, respectively. The important clinical determinants were identified using Shapely Values for each classification model, and the mean area under the curve (AUC) for the receiver operating curve was plotted. Conclusion: ML was able to predict several clinically relevant outcomes, and with additional clinical validation, could facilitate recognition of novel prognostic factors in HNSCC.

Original languageEnglish
Article number25
JournalJournal of Big Data
Issue number1
Publication statusPublished - 12-2022

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Hardware and Architecture
  • Computer Networks and Communications
  • Information Systems and Management


Dive into the research topics of 'Predicting clinical outcomes of radiotherapy for head and neck squamous cell carcinoma patients using machine learning algorithms'. Together they form a unique fingerprint.

Cite this