Robust Prediction and Biomarker Discovery in Rare Cancers Using Interpretable Machine Learning

dc.contributor.advisorKim, Wooyoung
dc.contributor.authorMadasamy, Dhurka Rohini
dc.date.accessioned2026-02-05T19:28:00Z
dc.date.available2026-02-05T19:28:00Z
dc.date.issued2026-02-05
dc.date.submitted2025
dc.descriptionThesis (Master's)--University of Washington, 2025
dc.description.abstractRare cancers such as Glioblastoma Multiforme (GBM, a rare brain cancer) pose persistent challenges in computational oncology due to limited data, biological noise, and difficulty in isolating disease-specific molecular signatures. Based on these constraints, this work began with the expectation that rare-cancer models would perform poorly. However, machine learning approaches on genomic data achieved unexpectedly strong accuracy, motivating investigation into whether this separability reflected genuine biology or artifactual signal. This thesis develops an interpretable machine learning framework that evaluates predictive robustness and isolates biologically meaningful biomarkers under extreme imbalance. Cascade Learning systematically removes broad cancer pathways and reveals biomarkers uniquely associated with the rare cancer, while SHAP-based interpretability aligns these genes with experimentally reported glioma biology. Complementary Tab2Image visualizations provide spatial confirmation of class separability, strengthening biological trust in the learned signal. Overall, this work provides a robust, biologically grounded, and ethically aligned pathway for rare-cancer biomarker discovery that emphasizes transparency, fairness, and accountability in scarce-data environments.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherMadasamy_washington_0250O_29135.pdf
dc.identifier.urihttps://hdl.handle.net/1773/55093
dc.language.isoen_US
dc.rightsnone
dc.subjectBiomarker Discovery
dc.subjectCascade Learning
dc.subjectGene Expression Analysis
dc.subjectGlioblastoma Multiforme (GBM)
dc.subjectInterpretable Machine Learning
dc.subjectRare Cancer
dc.subjectComputer science
dc.subjectBioinformatics
dc.subjectBiomedical engineering
dc.subject.otherComputing and software systems
dc.titleRobust Prediction and Biomarker Discovery in Rare Cancers Using Interpretable Machine Learning
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Madasamy_washington_0250O_29135.pdf
Size:
8.77 MB
Format:
Adobe Portable Document Format