Princess Margaret Cancer Centre, Toronto, ON, Canada
Baijiang Yuan, Muammar Kabir, Kevin He, Benjamin Grant, Sharon Narine, Rami Ajaj, Luna Jia Zhan, Aly Fawzy, Janine Xu, Yuhua Zhang, Vivien Yu, Conor French, Wei Xu, Rahul G. Krishnan, Steven Gallinger, Monika K. Krzyzanowska, Tran Truong, Geoffrey Liu, Robert C Grant
Background: Cancer and its treatments often cause symptoms. Automated warning systems could mitigate symptoms by alerting healthcare teams and enabling personalized preventative interventions. We developed a general-purpose longitudinal system for predicting symptomatic deterioration among outpatients undergoing intravenous systemic anti-cancer therapy. Methods: Patients treated for aerodigestive cancers at the Princess Margaret Cancer Centre were randomly divided into development and testing cohorts. For each treatment, machine learning was applied to preceding electronic medical record (EMR) data to predict patient-reported symptom deterioration, defined as at least a four point worsening on the Edmonton Symptom Assessment Scale. Features included diagnostic and treatment characteristics, laboratory tests, and patient-reported symptoms. Single-task (e.g., LASSO and XGboost) and multi-task (e.g., temporal CNNs, LSTM and Transformer) models were trained, tuned, and evaluated based on discrimination, calibration, and net benefit. Results: The cohort consisted of 3,998 patients who underwent 45,904 treatment sessions, with data across 400 features. Among these patients, 1,547 (38.6%) were female; median age was 64.0 (interquartile range 13.0). The most common diagnoses were lung (1,505, 37.6%), head and neck (696, 17.4%), and pancreatic cancers (685, 17.1%). The best model, a multi-task transformer, predicted symptom deterioration with an AUROC range of 0.732-0.822, marking a 1.4-6.2% improvement over the best single-task model. At a 10% alert rate, treatments associated with alerts would be enriched 4-13 fold for symptom deterioration (P<0.001). The system was calibrated and would provide a net benefit across a wide range of threshold probabilities in decision curve analysis. Conclusions: Longitudinal general-purpose multi-task machine learning systems trained using EMR data can accurately predict a wide range of symptoms. Based on these results, automated warning systems for symptoms should be implemented and evaluated in real-time clinical practice to guide preventative interventions.
Symptom Deteriorationa | Treatment-level Rate | Patient-level Rate | AUROC (95% CI) | Precisionb | Sensitivityb |
---|---|---|---|---|---|
Pain | 3.47% | 19.33% | 0.774 (0.758, 0.791) | 0.3700 | 0.3835 |
Fatigue | 4.22% | 24.10% | 0.758 (0.742, 0.776) | 0.5929 | 0.5226 |
Nausea | 2.99% | 15.43% | 0.810 (0.790, 0.826) | 0.1343 | 0.4885 |
Depression | 3.07% | 16.68% | 0.822 (0.804, 0.838) | 0.1546 | 0.5915 |
Anxious | 2.69% | 15.03% | 0.812 (0.794, 0.832) | 0.1321 | 0.3624 |
Drowsy | 4.38% | 24.15% | 0.784 (0.768, 0.800) | 0.1779 | 0.3383 |
Appetite | 4.77% | 24.63% | 0.760 (0.744, 0.775) | 0.1306 | 0.4620 |
WellBeing | 2.85% | 17.53% | 0.732 (0.707, 0.755) | 0.1529 | 0.4851 |
Dyspnea | 3.38% | 17.80% | 0.803 (0.785, 0.818) | 0.1352 | 0.4454 |
a Increase ≥4 ESAS points within 30 days b At the threshold corresponding to alerts with 10% of treatments
Disclaimer
This material on this page is ©2024 American Society of Clinical Oncology, all rights reserved. Licensing available upon request. For more information, please contact licensing@asco.org
Abstract Disclosures
2023 ASCO Annual Meeting
First Author: Hossein Honarvar
2023 ASCO Quality Care Symposium
First Author: Tyler Raclin
2023 ASCO Annual Meeting
First Author: Yue Yu
2023 ASCO Annual Meeting
First Author: Deep K. Hathi