Efficient Text-Image Retrieval Using Large Language Models

dc.contributor.advisorYu, Xiaohui
dc.contributor.authorLiu, Jiahao
dc.date.accessioned2026-03-10T16:15:25Z
dc.date.available2026-03-10T16:15:25Z
dc.date.copyright2025-12-05
dc.date.issued2026-03-10
dc.date.updated2026-03-10T16:15:25Z
dc.degree.disciplineInformation Systems and Technology
dc.degree.levelMaster's
dc.degree.nameMA - Master of Arts
dc.description.abstractEfficient retrieval from large-scale image databases is a key challenge, particularly as applications increasingly rely on multimodal models such as CLIP. While CLIP offers strong joint image–text representations for semantic search, its globally pooled embeddings often struggle with fine-grained, multi-concept queries, leading to high false positives and reliance on costly verification models. To address this, we propose a hybrid framework that structures the embedding space through feature clustering and models candidate selection as a multi-armed bandit problem. Each cluster acts as an arm, with relevance scores from ground-truth systems as rewards. Using Thompson Sampling, this approach balances exploration and exploitation to quickly identify promising clusters, reducing unnecessary ground-truth queries. Experiments show that our method significantly improves precision and lowers computational costs in multi-keyword retrieval tasks, enabling scalable, fine-grained retrieval in resource-constrained settings. This structured, adaptive approach effectively enhances CLIP-based retrieval pipelines.
dc.identifier.urihttps://hdl.handle.net/10315/43614
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectInformation technology
dc.subject.keywordsThompson sampling
dc.subject.keywordsImage retrieval
dc.subject.keywordsCLIP
dc.subject.keywordsContrastive Language-Image Pre-Training
dc.titleEfficient Text-Image Retrieval Using Large Language Models
dc.typeElectronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Liu_Jiahao_2025_MA.pdf
Size:
10.68 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
Loading...
Thumbnail Image
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description: