Yu, Xiaohui2019-03-052019-03-052018-12-072019-03-05http://hdl.handle.net/10315/35922Recent advances in social and mobile technology have enabled an abundance of digital traces (in the form of mobile check-ins, WiFi hotspots handshaking, etc.) revealing the physical presence history of diverse sets of entities. One challenging, yet important, task is to identify k entities that are most closely associated with a given query entity based on their digital traces. We propose a suite of hierarchical indexing techniques and algorithms to enable fast query processing for this problem at scale. We theoretically analyze the pruning effectiveness of the proposed methods based on a human mobility model which we propose and validate in real life situations. Finally, we conduct extensive experiments on both synthetic and real datasets at scale, evaluating the performance of our techniques, confirming the effectiveness and superiority of our approach over other applicable approaches across a variety of parameter settings and datasets.enAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.Computer scienceTop-K Queries Over Digital TracesElectronic Thesis or Dissertation2019-03-05Spatio-temporal dataTop-k queryHashingHuman mobility