Published on in Vol 11 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/72005, first published .
Cancer Diagnosis Categorization in Electronic Health Records Using Large Language Models and BioBERT: Model Performance Evaluation Study

Cancer Diagnosis Categorization in Electronic Health Records Using Large Language Models and BioBERT: Model Performance Evaluation Study

Cancer Diagnosis Categorization in Electronic Health Records Using Large Language Models and BioBERT: Model Performance Evaluation Study

Soheil Hashtarkhani   1 , PhD ;   Rezaur Rashid   1 , PhD ;   Christopher L Brett   2 , MD ;   Lokesh Chinthala   1 , MSc ;   Fekede Asefa Kumsa   1 , PhD ;   Janet A Zink   1 , PhD ;   Robert L Davis   1 , MPH, MD ;   David L Schwartz   1, 3 , MD ;   Arash Shaban-Nejad   1 , MPH, PhD

1 Center for Biomedical Informatics, Department of Pediatrics, College of Medicine, University of Tennessee Health Science Center, Memphis, TN, United States

2 University of Tennessee Graduate School of Medicine, Knoxville, TN, United States

3 Departments of Radiation Oncology and Preventive Medicine, College of Medicine, University of Tennessee Health Science Center, Memphis, TN, United States

Corresponding Author:

  • Arash Shaban-Nejad, MPH, PhD
  • Center for Biomedical Informatics, Department of Pediatrics
  • College of Medicine, University of Tennessee Health Science Center
  • 50 N Dunlap Street
  • Memphis, TN 38103
  • United States
  • Phone: 1 9012875836
  • Email: ashabann@uthsc.edu