Adaptive Momentum for Neural Network Optimization
dc.contributor.advisor | An, Aijun | |
dc.contributor.author | Rashidi, Zana | |
dc.date.accessioned | 2020-05-11T12:56:16Z | |
dc.date.available | 2020-05-11T12:56:16Z | |
dc.date.copyright | 2019-12 | |
dc.date.issued | 2020-05-11 | |
dc.date.updated | 2020-05-11T12:56:16Z | |
dc.degree.discipline | Computer Science | |
dc.degree.level | Master's | |
dc.degree.name | MSc - Master of Science | |
dc.description.abstract | In this thesis, we develop a novel and efficient algorithm for optimizing neural networks inspired by a recently proposed geodesic optimization algorithm. Our algorithm, which we call Stochastic Geodesic Optimization (SGeO), utilizes an adaptive coefficient on top of Polyaks Heavy Ball method effectively controlling the amount of weight put on the previous update to the parameters based on the change of direction in the optimization path. Experimental results on strongly convex functions with Lipschitz gradients and deep Autoencoder benchmarks show that SGeO reaches lower errors than established first-order methods and competes well with lower or similar errors to a recent second-order method called K-FAC (Kronecker-Factored Approximate Curvature). We also incorporate Nesterov style lookahead gradient into our algorithm (SGeO-N) and observe notable improvements. We believe that our research will open up new directions for high-dimensional neural network optimization where combining the efficiency of first-order methods and the effectiveness of second-order methods proves a promising avenue to explore. | |
dc.identifier.uri | https://hdl.handle.net/10315/37485 | |
dc.language | en | |
dc.rights | Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests. | |
dc.subject | Computer science | |
dc.subject.keywords | Machine learning | |
dc.subject.keywords | Optimization | |
dc.subject.keywords | Momentum | |
dc.subject.keywords | Neural networks | |
dc.subject.keywords | Geodesics | |
dc.subject.keywords | Artificial intelligence | |
dc.title | Adaptive Momentum for Neural Network Optimization | |
dc.type | Electronic Thesis or Dissertation |
Files
Original bundle
1 - 1 of 1