Aaron Mishkin

Publications

  1. To Each Optimizer a Norm, To Each Norm its Generalization. S. Vaswani, R. Babanezhad, J. Gallego, A. Mishkin, S. Lacoste-Julien, N. Le Roux. arXiv Preprint, 2020. [arXiv]

  2. Painless Stochastic Gradient: Interpolation, Line-Search, and Convergence Rates. S. Vaswani, A. Mishkin, I. Laradji, M. Schmidt, G. Gidel, S. Lacoste-Julien. NeurIPS, 2019. [arXiv] [code] [video]

  3. SLANG: Fast Structured Covariance Approximations for Bayesian Deep Learning with Natural Gradient. A. Mishkin, F. Kunstner, D. Nielsen, M. Schmidt, M. E. Khan. NeurIPS, 2018. [arXiv] [code] [video]

  4. Web ValueCharts: Analyzing Individual and Group Preferences with Interactive, Web-based Visualizations. A. Mishkin. Review of Undergraduate Computer Science, 2018. [pdf]

Talks

Talks about Painless SGD:

Talks at the UBC Machine Learning Reading Group (MLRG):

Miscellaneous: