Use of Visual Content for Inference and Response in Q/A Forums

dc.contributor.advisorNayebi, Maleknaz
dc.contributor.authorAhmed, Faiz
dc.date.accessioned2025-07-23T15:24:15Z
dc.date.available2025-07-23T15:24:15Z
dc.date.copyright2025-05-27
dc.date.issued2025-07-23
dc.date.updated2025-07-23T15:24:15Z
dc.degree.disciplineComputer Science
dc.degree.levelMaster's
dc.degree.nameMSc - Master of Science
dc.description.abstractIn the rapidly evolving landscape of developer communities, Q&A platforms serve as crucial resources for crowdsourcing developers' knowledge. A notable trend is the increasing use of images to convey complex queries more effectively. However, the current state-of-the-art method of duplicate question detection has not kept pace with this shift, which predominantly concentrates on text-based analysis. Inspired by advancements in image processing and numerous studies in software engineering illustrating the promising future of image-based communication on social coding platforms, we delved into image-based techniques for identifying duplicate questions on Stack Overflow. When focusing solely on text analysis of Stack Overflow questions and omitting the use of images, our automated models overlook a significant aspect of the question. Previous research has demonstrated the complementary nature of images to text. To address this, we implemented two methods of image analysis: first, integrating the text from images into the question text, and second, evaluating the images based on their visual content using image captions. After a rigorous evaluation of our model, it became evident that the efficiency improvements achieved were relatively modest, approximately an average of 1%. This marginal enhancement falls short of what could be deemed a substantial impact. As an encouraging aspect, our work lays the foundation for easy replication and hypothesis validation, allowing future research to build upon our approach and explore novel solutions for more effective image-driven duplicate question detection.
dc.identifier.urihttps://hdl.handle.net/10315/43072
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subject.keywordsVisual programming communication
dc.subject.keywordsStack Overflow image analysis
dc.subject.keywordsScreenshot text extraction
dc.subject.keywordsMultimodal large language models
dc.subject.keywordsProgramming screenshots
dc.subject.keywordsCode image processing
dc.subject.keywordsDeveloper Q&A platforms
dc.subject.keywordsOptical character recognition
dc.subject.keywordsImage-based software engineering
dc.titleUse of Visual Content for Inference and Response in Q/A Forums
dc.typeElectronic Thesis or Dissertation

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ahmed_Faiz_2025_MSc.pdf
Size:
5.07 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description:

Collections