Comparative Analysis of Language Models on Augmented Low-Resource Datasets for Application in Question & Answering Systems

Ranjbargol, Seyedehsamaneh

Comparative Analysis of Language Models on Augmented Low-Resource Datasets for Application in Question & Answering Systems

Files

Ranjbargol_Seyedehsamaneh_2024_Masters.pdf (2.66 MB)

Date

2024-11-07

Authors

Ranjbargol, Seyedehsamaneh

Abstract

This thesis aims to advance natural language processing (NLP) in question-answering (QA) systems for low-resource domains. The research presents a comparative analysis of several pre-trained language models, highlighting their performance enhancements when fine-tuned with augmented data to address several critical questions, such as the effectiveness of synthetic data and the efficiency of data augmentation techniques for improving QA systems in specialized contexts. The study focuses on developing a hybrid QA framework that can be integrated with a cloud-based information system. This approach refines the functionality and applicability of QA systems, boosting their performance in low-resource settings by using targeted fine-tuning and advanced transformer models. The successful application of this method demonstrates the significant potential for specialized, AI-driven QA systems to adapt and thrive in specific environments.

Keywords

Information technology, Computer science

URI

https://hdl.handle.net/10315/42401

Collections

Information Systems and Technology

Full item page

Comparative Analysis of Language Models on Augmented Low-Resource Datasets for Application in Question & Answering Systems

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections