Information Systems and Technology
Permanent URI for this collection
Browse
Browsing Information Systems and Technology by Subject "Bioinformatics"
Now showing 1 - 1 of 1
Results Per Page
Sort Options
Item Open Access Enhancing General Language Models for Biomedical Test Retrieval via Diversified Prior Knowledge(2023-12-08) Huang, Yizheng; Huang, JimmyThe thesis introduces the Diversified Prior Knowledge Enhanced General Language Model (DPK-GLM) to improve the efficacy of general language models in biomedical Information Retrieval (IR). General language models often struggle with biomedical data due to its specialized terminology and the need for precise matching. DPK-GLM tackles these challenges by integrating domain-specific knowledge, thereby enhancing the model's ability to understand and process biomedical information. The framework comprises three core components. The first, Knowledge-based Query Expansion, leverages authoritative biomedical databases to enrich search queries with domain-specific entities. The second, Aspect-based Filter, identifies documents that are highly relevant to the query. The third, Diversity-based Score Reweighting, re-ranks these filtered documents by combining similarity and diversity scores, yielding more accurate results. Experimental tests on public biomedical IR datasets confirm that DPK-GLM significantly improves retrieval performance.