Bayesian Model Selection for Discrete Graphical Models
dc.contributor.advisor | Gao, Xin | |
dc.contributor.author | Roach, Lyndsay | |
dc.date.accessioned | 2023-08-04T15:20:02Z | |
dc.date.available | 2023-08-04T15:20:02Z | |
dc.date.issued | 2023-08-04 | |
dc.date.updated | 2023-08-04T15:20:02Z | |
dc.degree.discipline | Mathematics & Statistics | |
dc.degree.level | Doctoral | |
dc.degree.name | PhD - Doctor of Philosophy | |
dc.description.abstract | Graphical models allow for easy interpretation and representation of complex distributions. There is an expanding interest in model selection problems for high-dimensional graphical models, particularly when the number of variables increases with the sample size. A popular model selection tool is the Bayes factor, which compares the posterior probabilities of two competing models. Consider data given in the form of a contingency table where N objects are classified according to q random variables, where the conditional independence structure of these random variables are represented by a discrete graphical model G. We assume the cell counts follow a multinomial distribution with a hyper Dirichlet prior distribution imposed on the cell probability parameters. Then we can write the Bayes factor as a product of gamma functions indexed by the cliques and separators of G. In this thesis, we study the behaviour of the Bayes factor when the dimension of a true discrete graphical model is fixed and when the dimension increases to infinity with the sample size. We prove that the Bayes factor is strong model selection consistent for both decomposable and non-decomposable discrete graphical models. When the true graph is non-decomposable, we prove that the Bayes factor selects a minimal triangulation of the true graph. We support our theoretical results with various simulations. In addition, we introduce a variation of the genetic algorithm, called the graphical local genetic algorithm, which can be implemented on large data sets. We use a local search operator and a normalizing constant proportionate to the posterior probability of the candidate models to determine optimal submodels, then reconstruct the full graph from the resulting subgraphs. We demonstrate the graphical local genetic algorithm's capabilities on both simulated data sets with known true graphs and on a real-world data set. | |
dc.identifier.uri | https://hdl.handle.net/10315/41382 | |
dc.language | en | |
dc.rights | Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests. | |
dc.subject | Statistics | |
dc.subject.keywords | Bayesian | |
dc.subject.keywords | Consistency | |
dc.subject.keywords | Decomposable | |
dc.subject.keywords | Graph selection | |
dc.subject.keywords | Hyper Dirichlet | |
dc.subject.keywords | Marginal likelihood | |
dc.title | Bayesian Model Selection for Discrete Graphical Models | |
dc.type | Electronic Thesis or Dissertation |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Roach_Lyndsay_M_2023_PhD_v2.pdf
- Size:
- 1.02 MB
- Format:
- Adobe Portable Document Format