Enhancing General Language Models for Biomedical Test Retrieval via Diversified Prior Knowledge
dc.contributor.advisor | Huang, Jimmy | |
dc.contributor.author | Huang, Yizheng | |
dc.date.accessioned | 2023-12-08T14:42:10Z | |
dc.date.available | 2023-12-08T14:42:10Z | |
dc.date.issued | 2023-12-08 | |
dc.date.updated | 2023-12-08T14:42:09Z | |
dc.degree.discipline | Information Systems and Technology | |
dc.degree.level | Master's | |
dc.degree.name | MA - Master of Arts | |
dc.description.abstract | The thesis introduces the Diversified Prior Knowledge Enhanced General Language Model (DPK-GLM) to improve the efficacy of general language models in biomedical Information Retrieval (IR). General language models often struggle with biomedical data due to its specialized terminology and the need for precise matching. DPK-GLM tackles these challenges by integrating domain-specific knowledge, thereby enhancing the model's ability to understand and process biomedical information. The framework comprises three core components. The first, Knowledge-based Query Expansion, leverages authoritative biomedical databases to enrich search queries with domain-specific entities. The second, Aspect-based Filter, identifies documents that are highly relevant to the query. The third, Diversity-based Score Reweighting, re-ranks these filtered documents by combining similarity and diversity scores, yielding more accurate results. Experimental tests on public biomedical IR datasets confirm that DPK-GLM significantly improves retrieval performance. | |
dc.identifier.uri | https://hdl.handle.net/10315/41736 | |
dc.language | en | |
dc.rights | Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests. | |
dc.subject | Information technology | |
dc.subject | Artificial intelligence | |
dc.subject | Bioinformatics | |
dc.subject.keywords | Biomedical information retrieval | |
dc.subject.keywords | Text retrieval | |
dc.subject.keywords | Ranking | |
dc.subject.keywords | Deep learning | |
dc.title | Enhancing General Language Models for Biomedical Test Retrieval via Diversified Prior Knowledge | |
dc.type | Electronic Thesis or Dissertation |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Huang_Yizheng_2023_Masters.pdf
- Size:
- 3.1 MB
- Format:
- Adobe Portable Document Format