Use of Visual Content for Inference and Response in Q/A Forums

Ahmed, Faiz

Use of Visual Content for Inference and Response in Q/A Forums

dc.contributor.advisor	Nayebi, Maleknaz
dc.contributor.author	Ahmed, Faiz
dc.date.accessioned	2025-07-23T15:24:15Z
dc.date.available	2025-07-23T15:24:15Z
dc.date.copyright	2025-05-27
dc.date.issued	2025-07-23
dc.date.updated	2025-07-23T15:24:15Z
dc.degree.discipline	Computer Science
dc.degree.level	Master's
dc.degree.name	MSc - Master of Science
dc.description.abstract	In the rapidly evolving landscape of developer communities, Q&A platforms serve as crucial resources for crowdsourcing developers' knowledge. A notable trend is the increasing use of images to convey complex queries more effectively. However, the current state-of-the-art method of duplicate question detection has not kept pace with this shift, which predominantly concentrates on text-based analysis. Inspired by advancements in image processing and numerous studies in software engineering illustrating the promising future of image-based communication on social coding platforms, we delved into image-based techniques for identifying duplicate questions on Stack Overflow. When focusing solely on text analysis of Stack Overflow questions and omitting the use of images, our automated models overlook a significant aspect of the question. Previous research has demonstrated the complementary nature of images to text. To address this, we implemented two methods of image analysis: first, integrating the text from images into the question text, and second, evaluating the images based on their visual content using image captions. After a rigorous evaluation of our model, it became evident that the efficiency improvements achieved were relatively modest, approximately an average of 1%. This marginal enhancement falls short of what could be deemed a substantial impact. As an encouraging aspect, our work lays the foundation for easy replication and hypothesis validation, allowing future research to build upon our approach and explore novel solutions for more effective image-driven duplicate question detection.
dc.identifier.uri	https://hdl.handle.net/10315/43072
dc.language	en
dc.rights	Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subject	Computer science
dc.subject	Artificial intelligence
dc.subject.keywords	Visual programming communication
dc.subject.keywords	Stack Overflow image analysis
dc.subject.keywords	Screenshot text extraction
dc.subject.keywords	Multimodal large language models
dc.subject.keywords	Programming screenshots
dc.subject.keywords	Code image processing
dc.subject.keywords	Developer Q&A platforms
dc.subject.keywords	Optical character recognition
dc.subject.keywords	Image-based software engineering
dc.title	Use of Visual Content for Inference and Response in Q/A Forums
dc.type	Electronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Ahmed_Faiz_2025_MSc.pdf
Size:: 5.07 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: license.txt
Size:: 1.87 KB
Format:: Plain Text
Description:

Download

Name:: YorkU_ETDlicense.txt
Size:: 3.39 KB
Format:: Plain Text
Description:

Download

Collections

Computer Science