Performance evaluation of a natural language processing tool to extract infectious disease problems

dc.contributor.advisorPayne, Thomas Hen_US
dc.contributor.authorMandel, Hannah Lilyen_US
dc.date.accessioned2013-11-14T20:51:46Z
dc.date.available2013-11-14T20:51:46Z
dc.date.issued2013-11-14
dc.date.submitted2013en_US
dc.descriptionThesis (Master's)--University of Washington, 2013en_US
dc.description.abstractUse of a complete problem list can benefit patient care, quality improvement initiatives, and research activities. However, it can be time consuming for physicians to enter the correct encoded problem from a standardized terminology. I evaluated Discern nCode, the natural language processing (NLP) system embedded in Cerner Powerchart at Harborview Medical Center (HMC), for its utility to add Infectious Diseases (ID) problems to the electronic medical record problem list, in comparison with the usual practice of physicians adding problems unaided by NLP. 74 ID consultation notes were annotated by human experts to create gold standard problem lists. NLP-extracted problems and problem list entries were recorded for each note. Recall, precision and f-measure were calculated for nCode and the problem list, and an error analysis was performed to describe false positives and missed concepts. Discern nCode's recall was .65 and precision was .14. Problem list recall was .10 and precision was .43. Many false negatives resulted from partial matches between NLP-extracted and reference standard problems. The majority of false positives were due to inclusion of past medical problems and non-ID problems; nearly 20% of false positives should not have been extracted. Discern nCode had significantly higher recall for ID problems than the problem list. Recommendations are provided for increasing system sensitivity and recall. Overall, nCode could be a useful facilitator of problem entry and result in higher problem list completeness, but recall should be increased.en_US
dc.embargo.termsNo embargoen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.otherMandel_washington_0250O_12157.pdfen_US
dc.identifier.urihttp://hdl.handle.net/1773/24103
dc.language.isoen_USen_US
dc.rightsCopyright is held by the individual authors.en_US
dc.subjectBiomedical informatics; Clinical informatics; Electronic health records; Natural language processing; Problem listen_US
dc.subject.otherInformation technologyen_US
dc.subject.otherMedicineen_US
dc.subject.otherbiomedical and health informaticsen_US
dc.titlePerformance evaluation of a natural language processing tool to extract infectious disease problemsen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mandel_washington_0250O_12157.pdf
Size:
506.17 KB
Format:
Adobe Portable Document Format