YorkSpace
York University's Institutional Repository
    • English
    • français
  • English 
    • English
    • français
  • Login
View Item 
  •   YorkSpace Home
  • Faculty of Graduate Studies
  • Electronic Theses and Dissertations (ETDs)
  • Computer Science and Engineering
  • View Item
  •   YorkSpace Home
  • Faculty of Graduate Studies
  • Electronic Theses and Dissertations (ETDs)
  • Computer Science and Engineering
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Approximate Parallel High Utility Itemset Mining

Thumbnail
View/Open
Chen_Yan_2015_Masters.pdf (502.2Kb)
Date
2016-09-20
Author
Chen, Yan

Metadata
Show full item record
Abstract
High utility itemset mining discovers itemsets whose utility is above a given threshold, where utilities measure the importance of itemsets. In high utility itemset mining, memory and time performance limitations cause scalability issues, when the dataset is very large. In this thesis, the problem is addressed by proposing a distributed parallel algorithm, PHUI-Miner, and a sampling strategy, which can be used either separately or simultaneously. PHUI-Miner parallelizes the state-of-the-art high utility itemset mining algorithm HUI-Miner. The sampling strategy investigates the required sample size of a dataset, in order to achieve a given accuracy. We also propose an approach combining sampling with PHUI-Miner, which provides better time performance. In our experiments, we show that PHUI-Miner has high performance and outperforms the state-of-the-art non-parallel algorithm. The sampling strategy achieves accuracies much higher than the guarantee. Extensive experiments are also conducted to compare the time performance of PHUI-Miner with and without sampling.
URI
http://hdl.handle.net/10315/32162
Collections
  • Computer Science and Engineering

All items in the YorkSpace institutional repository are protected by copyright, with all rights reserved except where explicitly noted.

YorkU LogoContact Us | Send Feedback
Sitemap for search engines

 

Browse

All of YorkSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister

Statistics

View Usage Statistics

All items in the YorkSpace institutional repository are protected by copyright, with all rights reserved except where explicitly noted.

YorkU LogoContact Us | Send Feedback
Sitemap for search engines