Studying The Effectiveness Of Large Language Models In Benchmark Biomedical Tasks

Jahan, Israt

Studying The Effectiveness Of Large Language Models In Benchmark Biomedical Tasks

dc.contributor.advisor	Huang, Jimmy Xiangji
dc.contributor.author	Jahan, Israt
dc.date.accessioned	2024-10-28T13:37:20Z
dc.date.available	2024-10-28T13:37:20Z
dc.date.copyright	2024-07-26
dc.date.issued	2024-10-28
dc.date.updated	2024-10-28T13:37:19Z
dc.degree.discipline	Biology
dc.degree.level	Master's
dc.degree.name	MSc - Master of Science
dc.description.abstract	Recently, Large Language Models (LLMs) have demonstrated impressive capability to solve a wide range of tasks. However, despite their success across various tasks, no prior work has investigated their capability in the biomedical domain yet. To this end, this thesis aims to evaluate the performance of LLMs on benchmark biomedical tasks. For this purpose, a comprehensive evaluation of 4 popular LLMs in 6 diverse biomedical tasks across 26 datasets has been conducted. Interestingly, this evaluation shows that in biomedical datasets that have smaller training sets, zero-shot LLMs even outperform the current state-of-the-art models when they were fine-tuned only on the training set of these datasets. This suggests that pretraining on large text corpora makes LLMs quite specialized even in the biomedical domain. The findings also shows that not a single LLM can outperform other LLMs in all tasks, with the performance of different LLMs may vary depending on the task. While their performance is still quite poor in comparison to the biomedical models that were fine-tuned on large training sets, this study demonstrates that LLMs have the potential to be a valuable tool for various biomedical tasks that lack large annotated data.
dc.identifier.uri	https://hdl.handle.net/10315/42384
dc.language	en
dc.rights	Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subject	Biology
dc.subject	Artificial intelligence
dc.subject	Bioinformatics
dc.subject.keywords	Bioinformatics
dc.subject.keywords	Large language models
dc.subject.keywords	LLMs in biology
dc.subject.keywords	ChatGPT
dc.subject.keywords	Biomedical text processing tasks
dc.title	Studying The Effectiveness Of Large Language Models In Benchmark Biomedical Tasks
dc.type	Electronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Jahan_Israt_2024_Masters.pdf
Size:: 3.63 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: license.txt
Size:: 1.87 KB
Format:: Plain Text
Description:

Download

Name:: YorkU_ETDlicense.txt
Size:: 3.39 KB
Format:: Plain Text
Description:

Download

Collections

Biology