Toward Trustworthy Automated Data Story Generation: Benchmarking, Multi-Agent Generation and Bias Evaluation in Data Storytelling

dc.contributor.advisorPrince, Enamul Hoque
dc.contributor.authorIslam, Mohammed Saidul
dc.date.accessioned2025-11-11T20:08:47Z
dc.date.available2025-11-11T20:08:47Z
dc.date.copyright2025-08-20
dc.date.issued2025-11-11
dc.date.updated2025-11-11T20:08:46Z
dc.degree.disciplineComputer Science
dc.degree.levelMaster's
dc.degree.nameMSc - Master of Science
dc.description.abstractData-driven storytelling is a powerful method for conveying insights by combining narrative techniques with visualizations and text. In this thesis, we introduce a novel task for data story generation and a benchmark containing 1,449 stories from diverse sources. We propose a multi-step LLM agent framework mimicking the human storytelling process: one for planning and narration, and another for verification at each intermediary step. Results show that our proposed framework significantly outperforms non-agentic baselines. In parallel, we recognize that trustworthy storytelling must also be fair and unbiased. To this end, we conduct a largescale empirical study to uncover systematic geo-economic bias in the foundational subtask of data storytelling: producing narrative summaries of charts. We further explore inference-time debiasing strategies and highlight the need for more robust bias mitigation methods. Together, these contributions provide both a powerful generative system and a fairness-focused evaluation to ensure automated data storytelling is accurate, coherent, and ethically responsible.
dc.identifier.urihttps://hdl.handle.net/10315/43335
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectComputer science
dc.subject.keywordsData storytelling
dc.subject.keywordsBias evaluation
dc.subject.keywordsLarge language models
dc.subject.keywordsLLM agents
dc.titleToward Trustworthy Automated Data Story Generation: Benchmarking, Multi-Agent Generation and Bias Evaluation in Data Storytelling
dc.typeElectronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Islam_Mohammed_Saidul_2025_MSc.pdf
Size:
23.07 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
Loading...
Thumbnail Image
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description:

Collections