Evaluating generalizability of practice-changing randomized clinical trials in non-small cell lung cancer using machine learning-based in-silico trials.

Authors

null

Xavier Orcutt

Chinle Comprehensive Health Care Facility, Chinle, AZ

Xavier Orcutt , Ronac Mamtani , Arjun Sondhi , Aaron B. Cohen , Ravi Bharat Parikh

Organizations

Chinle Comprehensive Health Care Facility, Chinle, AZ, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, Flatiron Health, New York, NY

Research Funding

No funding received
None.

Background: Results of randomized clinical trials (RCTs) of anticancer agents are not generalizable to many real-world patients. Advances in machine learning (ML) and increasing availability of curated real-world data offer opportunities to assess generalizability by simulating trials “in-silico”. Our objective was to assess the generalizability of survival outcomes reported in 2 practice-changing phase III trials in first-line (1L) advanced non-small cell lung cancer (aNSCLC). Methods: Our cohort included patients from the nationwide Flatiron Health EHR-derived de-identified database diagnosed with stage IIIB-IV or recurrent aNSCLC between 2011 and 2020. First, we trained and validated supervised ML models (gradient boosted, random forest, support vector machine and penalized Cox) to predict 1-year survival for patients with aNSCLC; the Lung Cancer Prognostic Index (LCPI), a published disease-specific prognostic index, was used as a comparator model. We used 130 demographic, vital sign, laboratory, and biomarker features at aNSCLC diagnosis to build models. Second, we used the best-performing ML model to create 4 prognostic risk groups. Third, we simulated 2 seminal trials for 1L treatment of aNSCLC, using inverse probability of treatment-weighted survival analyses, coarsely reproducing inclusion/exclusion criteria across ML-derived risk groups. We compared median overall survival (mOS) using Kaplan-Meier curves from the start of 1L treatment to death in in-silico trials (ISTs) vs. RCTs. Results: Our cohort included 61,339 patients with aNSCLC. The best-performing gradient boosted model outperformed the LCPI (AUC 0.784 vs 0.688). In ISTs, survival benefits of novel treatments varied across risk groups and were generally lower in ISTs compared to RCTs (Table). IST results in high- and very high-risk patients were inconsistent with RCT survival results. For example, for KEYNOTE-024, mOS in the pembrolizumab arm was 30.0 months, whereas in the IST, mOS among patients receiving pembrolizumab varied from 1.3 months in very high-risk patients to 41.5 months in low-risk patients. RCT results overestimated treatment effects for high-risk aNSCLC patients. Conclusions: ML-based ISTs can reveal heterogeneity in real-world survival outcomes associated with novel oncology treatments and elucidate populations for whom RCT results generalize poorly.

TrialRCT: mOS (mos) Treatment vs Control
(difference in mOS)
IST: mOS (mos) Treatment vs Control (difference in mOS) by Risk Group
All
(n = 61,069)
Very High
(n = 9,630)
High
(n = 22,123)
Medium
(n = 18,166)
Low
(n = 11,420)
KEYNOTE-024 (2016)30.0 vs 14.2
(15.8)
10.9 vs 5.0
(5.9)
1.3 vs 1.7
(-0.4)
5.6 vs 4.9
(0.7)
21.6 vs 9.4
(12.2)
41.5 vs 17.0
(24.5)
KEYNOTE-189 (2018)22.0 vs 10.7
(11.3)
14.0 vs 4.9
(9.1)
1.9 vs 1.5
(0.4)
6.6 vs 4.4
(2.2)
19.3 vs 9.2
(10.1)
43.1 vs 25.5
(17.6)

Disclaimer

This material on this page is ©2024 American Society of Clinical Oncology, all rights reserved. Licensing available upon request. For more information, please contact licensing@asco.org

Abstract Details

Meeting

2023 ASCO Annual Meeting

Session Type

Poster Session

Session Title

Lung Cancer—Non-Small Cell Metastatic

Track

Lung Cancer

Sub Track

Metastatic Non–Small Cell Lung Cancer

Citation

J Clin Oncol 41, 2023 (suppl 16; abstr 9130)

DOI

10.1200/JCO.2023.41.16_suppl.9130

Abstract #

9130

Poster Bd #

118

Abstract Disclosures