Analyzing free-text clinical narratives for veterans with lymphoid malignancies using natural language processing (NLP).

Authors

Lu He

University of California, Irvine, Irvine, CA

Lu He , Matthew Moldenhauer , Kai Zheng , Helen Ma

Organizations

University of California, Irvine, Irvine, CA, UC Irvine School of Medicine (Irvine, CA), Irvine, CA, University of California, Irvine Department of Informatics, Irvine, CA, VA Long Beach Healthcare System, Long Beach, CA

Research Funding

Other Foundation

Lymphoma Research Foundation, VA Long Beach Healthcare System, University of California, Irvine

Background: Free-text clinical narratives contain rich patient information, which is labor-intensive to extract through chart review. We developed an NLP pipeline to enable automatic extraction of performance status (PS), staging, and diagnosis from clinical narratives from Veterans Affairs (VA) patients with lymphoid malignancies (LM). Methods: The rule-based NLP algorithm was developed and iteratively refined using a development corpus of 287 notes independently annotated by two clinicians. The F1-score for PS was 95.8 (precision 98.6, recall 93.2), 92.7 for staging (precision 94.0, recall 81.6), and 67 (precision 80.2, recall 57.9) for diagnosis. The NLP pipeline was then externally validated using an evaluation corpus of 97 notes from another group of 100 veterans with T-cell LM. Results: The results are reported. In the 97 notes, primary diagnosis was most routinely documented, with 2.76 mentions per note. In comparison, staging was most sparsely documented with only 34 mentions (note that 11 patients with large granular lymphocytic leukemia were not staged). The NLP pipeline performed relatively well in extracting PS and staging (F1-scores were 0.74 and 0.72, respectively). It also achieved high precision in extracting diagnosis information (precision 0.93). However, recall (0.44) for diagnosis was poor, likely due to the complexities and inconsistencies of how diagnoses are documented for LM. Frequency of documentation and performance in the external validation set. Conclusions: The pipeline shows promising performance on the external validation set, demonstrating the feasibility of using NLP to extract information from notes of patients with LM for clinical research. The NLP pipeline generally has lower recall than precision, indicating that the pipeline may miss clinical information that should be captured. FPs incorrectly capture entities that are easily confused with the clinical entities of interest, such as nutritional status versus performance status. Future work includes capturing more lexical variations and indicators of documentation, as well as contextual information, such as in which sections of notes elements are likely documented. In addition, we describe how diagnosis may exist as primary, secondary, and in a differential, and we are building an NLP-based classifier to distinguish between these types of diagnoses. We will use results from the rule-based NLP pipeline as labels to fine-tune transformer-based, weak-supervised models to further enhance the performance.

Variable	Frequency	Precision¹	Recall²	F1-score³
Performance status	49	73	76	74
Staging	34	79	66	72
Diagnosis: Primary	268	-	-	-
Diagnosis: Secondary	45	-	-	-
Diagnosis: Differential	65	-	-	-
Diagnosis: Combined	378	93	44	60

¹Precision (P) = true positive (TP)/(TP + false positive [FP]); ²Recall (R) = TP/(TP + false negative [FN]); ³F1=2*P*R/(P + R).

Disclaimer

Abstract Details

Meeting

2023 ASCO Annual Meeting

Session Type

Publication Only

Session Title

Publication Only: Care Delivery and Regulatory Policy

Track

Care Delivery and Quality Care

Sub Track

Clinical Informatics/Advanced Algorithms/Machine Learning

Citation

J Clin Oncol 41, 2023 (suppl 16; abstr e13576)

DOI

10.1200/JCO.2023.41.16_suppl.e13576

Abstract #

e13576

Abstract Disclosures

FEATURED

Analyzing free-text clinical narratives for veterans with lymphoid malignancies using natural language processing (NLP).

Authors

Lu He

Organizations

Research Funding

Abstract Details

Meeting

Session Type

Session Title

Track

Sub Track

Citation

DOI

Abstract #

Similar Abstracts

Abstract

Application of natural language processing to assess the performance status documentation quality metric in patients with non–small-cell lung cancer.

Abstract

Development of natural language processing (NLP) models for extracting key features from unstructured notes to create real-world data (RWD) assets for clinical research at scale.

Abstract

Improving performance status documentation by hematology-oncology fellows.

Abstract

Using natural language processing (NLP) tools to identify veterans with metastatic prostate cancer (mPCa).