YorkSpace has migrated to a new version of its software. Access our Help Resources to learn how to use the refreshed site. Contact diginit@yorku.ca if you have any questions about the migration.
 

A Hybrid Approach for Large-Scale Product Categorization Based on Weighted KNN and LSTM-BPV

dc.contributor.advisorHuang, Xiangji
dc.contributor.authorHu, Haohao
dc.date.accessioned2019-12-04T13:30:56Z
dc.date.available2019-12-04T13:30:56Z
dc.date.copyright2019-08
dc.date.issued2019-12-04
dc.date.updated2019-12-04T13:30:55Z
dc.degree.disciplineInformation Systems and Technology
dc.degree.levelMaster's
dc.degree.nameMA - Master of Arts
dc.description.abstractIn modern e-commerce systems, large volumes of new items are being added to the product list everyday, which calls for automatic product categorization. In this thesis we propose a weighted K-Nearest Neighbour (KNN) based classification system for solving large-scale e-commerce product taxonomy classification problem. We use information retrieval (IR) model as similarity function in our weighted KNN algorithm. Among all IR models used in this study, we achieved highest classification performance through using information-based (IB) model as similarity function in the KNN algorithm. Moreover, our proposed method can improve the overall performance when combining prediction results with those from advanced neural network based method, namely Long Short-Term Memory with Balanced Pooling Views (LSTM-BPV). The hybrid system could achieve results comparable to the state of the art (SotA). We also get good results by fine-tuning pre-trained Bidirectional Encoder Representations from Transformers (BERT) model.
dc.identifier.urihttp://hdl.handle.net/10315/36836
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectArtificial intelligence
dc.subject.keywordsE-commerce Product Taxonomy Classification
dc.subject.keywordsInformation Retrieval
dc.subject.keywordsK-Nearest Neighbour
dc.subject.keywordsEnsemble
dc.subject.keywordsText Classification
dc.subject.keywordsData Mining
dc.titleA Hybrid Approach for Large-Scale Product Categorization Based on Weighted KNN and LSTM-BPV
dc.typeElectronic Thesis or Dissertation

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
hu_haohao_2019_Master.pdf
Size:
890.58 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description: