Classification in textual conversations: A study of emotion prediction and derailment forecasting

dc.contributor.advisorJenkin, Michael R.
dc.contributor.authorAlTarawneh, Enas Khaled Ahm
dc.date.accessioned2025-07-23T15:09:26Z
dc.date.available2025-07-23T15:09:26Z
dc.date.copyright2025-02-25
dc.date.issued2025-07-23
dc.date.updated2025-07-23T15:09:25Z
dc.degree.disciplineElectrical Engineering & Computer Science
dc.degree.levelDoctoral
dc.degree.namePhD - Doctor of Philosophy
dc.description.abstractEmotion is fundamental to human communication, shaping not just the content but the very essence of our interactions with others. In the realm of Natural Language Processing (NLP), particularly for applications that bridge human-machine communication such as health-care, education, and social networks, understanding and emulating emotional nuances becomes paramount. While it may be straightforward for humans to perceive and reason about the feelings of others in conversations, it is a challenge for machines, mainly due to context. Conversation models in the literature that incorporate context vary in the type of contextual information they incorporate (e.g., temporal structure, speaker identification, commonsense knowledge). However, studies to date have not explicitly quantified the impact of the type(s) of information incorporated within the critical conversation classification tasks of future emotion prediction and emotional derailment forecasting, nor the structure of the model architectures and encoding structures used for these tasks. These issues are addressed in this work. This thesis approaches this problem by developing AI models that can capture different design choices for these tasks. Critically, the models developed here are designed to capture three properties inherently connected to the emotional predictive problem in dialogues; sequence modeling, self-dependency modeling, and recency. These modeling dimensions are then incorporated into one of two deep neural network architectures, a sequence model and a graph convolutional network model. The former is designed to capture the sequence of utterances in a dialogue, while the latter captures the sequence of utterances and the formation of multi-party dialogues. Through an empirical evaluation of these model architectures, data type and data encoding choices, this work demonstrates (i) the importance of the self- dependency and recency model dimensions for the prediction tasks, (ii) the effectiveness of graph neural models in improving the predictions obtained by sequence-only models, (iii) the impact of fusing multi-source information of each utterance into utterance capsules, specifically emotion labels and common sense knowledge and, (iv) that using a transformer-based forecaster for the conversation predictive task also improves performance. Optimal design choices within these structures provides near best in class performance for next emotion prediction in conversations and best in class performance for conversation derailment prediction. This thesis also shows that simple fine-tuning of large language models is not an effective classification method for these tasks. Evaluations are performed using standard conversational datasets and current state of the art network models. Results from this work will help inform future dataset structure and the development of advanced sentiment analysis systems.
dc.identifier.urihttps://hdl.handle.net/10315/42956
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subject.keywordsEmotion in communication
dc.subject.keywordsNatural language processing (NLP)
dc.subject.keywordsHuman-machine communication
dc.subject.keywordsContext-aware conversation models
dc.subject.keywordsFuture emotion prediction
dc.subject.keywordsEmotional derailment forecasting
dc.subject.keywordsDeep neural networks
dc.subject.keywordsSequence modeling
dc.subject.keywordsSelf-dependency modeling
dc.subject.keywordsRecency effect
dc.subject.keywordsGraph Convolutional Networks (GCN)
dc.subject.keywordsTransformer-based forecasting
dc.subject.keywordsMulti-party dialogues
dc.subject.keywordsCommonsense knowledge fusion
dc.subject.keywordsSentiment analysis
dc.subject.keywordsConversational AI
dc.subject.keywordsDataset structure optimization
dc.subject.keywordsFine-tuning large language models
dc.subject.keywordsPredictive dialogue modeling
dc.subject.keywordsEmotion Recognition in Conversations (ERC)
dc.subject.keywordsNext emotion prediction in conversations
dc.subject.keywordsDerailment forecasting in conversations
dc.titleClassification in textual conversations: A study of emotion prediction and derailment forecasting
dc.typeElectronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Enas_Khaled_Ahm_AlTarawneh_2025_PhD.pdf
Size:
17.45 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
Loading...
Thumbnail Image
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description: