Automated classification of disease response in radiology reports using natural language processing.

Authors

null

Hai Xiang Ong

Nanyang Technological University, Singapore, Singapore

Hai Xiang Ong , Yiyu Cai , Zi Meng Yi , Wen Hui Tee , Ryan Shea Tan Ying Cong , Wei Chong Tan , Abu Bakr Azam

Organizations

Nanyang Technological University, Singapore, Singapore, National Cancer Centre Singapore, Singapore, Singapore

Research Funding

Other Government Agency
SingHealth Duke-NUS Academic Medicine Research Grant: Special Category (Artificial Intelligence Research)

Background: Classification of disease response is an essential task in cancer research and needs to be done at scale. Automating this process can improve efficiency in the generation of real-world evidence, potentially leading to better patient outcomes. We aim to develop and evaluate Natural Language Processing (NLP) models for this task. Methods: Using 6203 computed tomography (CT) and 1358 magnetic resonance imaging (MRI) reports from 587 patients with lung cancer of all stages seen at the National Cancer Centre Singapore (NCCS), we trained four NLP models (BioBERT, RadBERT-RoBERTA, BioClinicalBERT, GatorTron) to classify the reports into one of four categories: no evidence of disease, stable disease, partial response or disease progression. Model output was compared against human-curated ground truth and performance was evaluated by accuracy. Results: Of the 4 models, GatorTron performed the best (accuracy = 97.1%), followed by RadBERT-RoBERTA (accuracy = 96.2%), BioBERT (accuracy = 94.2%), with BioClinicalBERT being last (accuracy = 90.4%). NLP Model runtimes for the dataset were relatively short, with BioBERT and BioClinicalBERT taking 3 minutes per epoch, RadBERT-RoBERTA taking 6 minutes per epoch, and GatorTron taking 10 minutes per epoch on a single central processing unit (CPU). Conclusions: We have demonstrated the effectiveness of NLP models for classifying disease responses in radiology reports of lung cancer patients. This has the potential to help derive progression-free survival for real-world evidence generation.

Disclaimer

This material on this page is ©2024 American Society of Clinical Oncology, all rights reserved. Licensing available upon request. For more information, please contact licensing@asco.org

Abstract Details

Meeting

2023 ASCO Breakthrough

Session Type

Poster Session

Session Title

Poster Session B

Track

Gastrointestinal Cancer,Gynecologic Cancer,Head and Neck Cancer,Quality of Care,Genetics/Genomics/Multiomics,Healthcare Equity and Access to Care,Healthtech Innovations,Models of Care and Care Delivery,Population Health,Viral-Mediated Malignancies

Sub Track

Artificial Intelligence/Deep Learning

Citation

JCO Global Oncology 9, 2023 (suppl 1; abstr 108)

DOI

10.1200/GO.2023.9.Supplement_1.108

Abstract #

108

Poster Bd #

G3

Abstract Disclosures

Similar Abstracts