You can also browse my Google Scholar profile.
-
When does return-conditioned supervised learning work for offline reinforcement learning?
David Brandfonbrener, Alberto Bietti, Jacob Buckman, Romain Laroche, Joan Bruna. Conference on Neural Information Processing Systems, (NeurIPS) 2022.
PDF -
The Importance of Pessimism in Fixed-Dataset Policy Optimization
Jacob Buckman, Carles Gelada, Marc G. Bellemare. International Conference on Learning Representations, (ICLR) 2021. PDF Code -
DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare. International Conference on Machine Learning, (ICML) 2019.
PDF Code -
Sample-efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Jacob Buckman, Danijar Hafner, George Tucker, Eugene Brevdo, Honglak Lee. Conference on Neural Information Processing Systems, (NeurIPS) 2018.
PDF -
Neural Lattice Language Models
Jacob Buckman, Graham Neubig.
Transactions of the Association for Computational Linguistics (TACL) 2018.
PDF Code -
Is Generator Conditioning Causally Related to GAN Performance?
Augustus Odena, Jacob Buckman, Catherine Olsson, Tom B. Brown, Christopher Olah, Colin Raffel, Ian Goodfellow. International Conference on Machine Learning, (ICML) 2018.
PDF -
Thermometer Encoding: One Hot Way to Resist Adversarial Examples
Jacob Buckman, Aurko Roy, Colin Raffel, Ian Goodfellow. International Conference on Learning Representations (ICLR) 2018.
PDF -
Transition-Based Dependency Parsing with Heuristic Backtracking
Jacob Buckman, Miguel Ballesteros, Chris Dyer. Empirical Methods on Natural Language Processing (EMNLP) 2016.
PDF