Semantic concept extraction from electronic medical records for enhancing information retrieval performance

dc.contributor.advisorHuang, Jimmy
dc.creatorKasperowicz, Dawid
dc.date.accessioned2016-09-13T13:13:11Z
dc.date.available2016-09-13T13:13:11Z
dc.date.copyright2013-06
dc.degree.disciplineInformation Systems and Technology
dc.degree.levelMaster's
dc.degree.nameMA - Master of Arts
dc.description.abstractWith the healthcare industry increasingly using EMRs, there emerges an opportunity for knowledge discovery within the healthcare domain that was not possible with paper-based medical records. One such opportunity is to discover UMLS concepts from EMRs. However, with opportunities come challenges that need to be addressed. Medical verbiage is very different from common English verbiage and it is reasonable to assume extracting any information from medical text requires different protocols than what is currently used in common English text. This thesis proposes two new semantic matching models: Term-Based Matching and CUI-Based Matching. These two models use specialized biomedical text mining tools that extract medical concepts from EMRs. Extensive experiments to rank the extracted concepts are conducted on the University of Pittsburgh BLULab NLP Repository for the TREC 2011 Medical Records track dataset that consists of 101,711 EMRs that contain concepts in 34 predefined topics. This thesis compares the proposed semantic matching models against the traditional weighting equations and information retrieval tools used in the academic world today.
dc.identifier.urihttp://hdl.handle.net/10315/31910
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subject.keywordsElectronic medical records
dc.subject.keywordsEMR
dc.subject.keywordsInformation retrieval
dc.subject.keywordsMedical verbiage
dc.subject.keywordsExtracting
dc.subject.keywordsTerm-based matching model
dc.subject.keywordsCUI-based matching model
dc.titleSemantic concept extraction from electronic medical records for enhancing information retrieval performance
dc.typeElectronic Thesis or Dissertation

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kasperowicz_Dawid_2013_Masters.pdf
Size:
3.52 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
license.txt
Size:
1.83 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
YorkU_ETDlicense.txt
Size:
3.36 KB
Format:
Plain Text
Description: