Deep learning to convert unstructured CT pulmonary angiography reports into structured reports

Adam Spandorfer, Cody Branch, Puneet Sharma, Pooyan Sahbaee, U. Joseph Schoepf, James G. Ravenel, John W. Nance

Research output: Contribution to journalArticlepeer-review

18 Scopus citations


Background: Structured reports have been shown to improve communication between radiologists and providers. However, some radiologists are concerned about resultant decreased workflow efficiency. We tested a machine learning-based algorithm designed to convert unstructured computed tomography pulmonary angiography (CTPA) reports into structured reports. Methods: A self-supervised convolutional neural network-based algorithm was trained on a dataset of 475 manually structured CTPA reports. Labels for individual statements included “pulmonary arteries,” “lungs and airways,” “pleura,” “mediastinum and lymph nodes,” “cardiovascular,” “soft tissues and bones,” “upper abdomen,” and “lines/tubes.” The algorithm was applied to a test set of 400 unstructured CTPA reports, generating a predicted label for each statement, which was evaluated by two independent observers. Per-statement accuracy was calculated based on strict criteria (algorithm label counted as correct if the statement unequivocally contained content only related to that particular label) and a modified criteria, accounting for problematic statements, including typographical errors, statements that did not fit well into the classification scheme, statements containing content for multiple labels, etc. Results: Of the 4,157 statements, 3,806 (91.6%) and 3,986 (95.9%) were correctly labeled by the algorithm using strict and modified criteria, respectively, while 274 (6.6%) were problematic for the manual observers to label, the majority of which (n = 173) were due to more than one section being included in one statement. Conclusion: This algorithm showed high accuracy in converting free-text findings into structured reports, which could improve communication between radiologists and clinicians without loss of productivity and provide more structured data for research/data mining applications.

Original languageEnglish (US)
Article number37
JournalEuropean Radiology Experimental
Issue number1
StatePublished - Dec 1 2019


  • Artificial intelligence
  • Machine learning
  • Natural language processing
  • Structured reporting
  • Tomography (x-ray, computed)

ASJC Scopus subject areas

  • Radiology Nuclear Medicine and imaging


Dive into the research topics of 'Deep learning to convert unstructured CT pulmonary angiography reports into structured reports'. Together they form a unique fingerprint.

Cite this