Mining Large-Scale News Articles for Predicting Forced Migration via Machine Learning Techniques

dc.contributor.advisorAn, Aijun
dc.creatorKhonsari, Forouqsadat
dc.date.accessioned2018-08-27T16:42:52Z
dc.date.available2018-08-27T16:42:52Z
dc.date.copyright2018-05-15
dc.date.issued2018-08-27
dc.date.updated2018-08-27T16:42:52Z
dc.degree.disciplineComputer Science
dc.degree.levelMaster's
dc.degree.nameMSc - Master of Science
dc.description.abstractMany people are being displaced every day from all around the globe. Many of them are forced to leave their homes because of socio-political conflicts, human-made or natural disasters. In order to develop an early warning system for forced migration in the context of humanitarian crisis, it is essential to study the factors that cause forced migration, and build a model to predict the future number of displaced people. In this research, we focus on studying forced migration due to socio-political conflicts for which violence is the main reason. In particular, we investigate whether the degree of violence in a specific region can be detected from news articles related to that region and whether the detected violence scores can be used to improve the prediction accuracy. We investigate three techniques to extract the degree of violence from a corpus of news articles: ED-FE, TD-FE and SWSW. SWSW measures the semantic similarity between documents and a set of seed-words representing violence. ED-FE extracts violent events from news articles, which are the incidents related to attacks or the ones resulting in casualties. TD-FE uses topic modeling techniques to reduce the size of the information for easier analysis and filtering the violent incidents. Experiments indicate that ED-FE and TD-FE provide accurate violence scores which are very effective features for making forced displacement forecasts and using them in prediction models improves the prediction accuracy.
dc.identifier.urihttp://hdl.handle.net/10315/35024
dc.language.isoen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectSocial research
dc.subject.keywordsNatural language processing
dc.subject.keywordsAnalyzing news articles
dc.subject.keywordsForced displacement prediction
dc.subject.keywordsEvent detection. topic detection. emotion detection
dc.subject.keywordsTime-series analysis
dc.subject.keywordsForced migration
dc.subject.keywordsViolence extraction
dc.subject.keywordsLarge-scale news articles
dc.subject.keywordsDeep learning
dc.subject.keywords
dc.titleMining Large-Scale News Articles for Predicting Forced Migration via Machine Learning Techniques
dc.typeElectronic Thesis or Dissertation

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Khonsari_Forouqsadat_2018_Master.pdf
Size:
2.04 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
YorkU_ETDlicense.txt
Size:
3.4 KB
Format:
Plain Text
Description: