List of Publications#

List of publications & submissions using Minigrid or BabyAI (please open a pull request to add missing entries):

Hierarchies of Reward Machines (Imperial College London, ILASP, Universitat Pompeu Fabra, ICML 2023)
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning (Mila, McGill University, CoLLAs 2023)
DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards (U-Tokyo, Google Brain, IJCAI 2023)
Contrastive Meta-Learning for Partially Observable Few-Shot Learning (University of Edinburgh, Microsoft Research Cambridge, ICLR 2023)
Towards Improving Exploration in Self-Imitation Learning using Intrinsic Motivation (TECNALIA, IEEE ADPRL 2022)
An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration Environments (TECNALIA, CD-MAKE 2022)
Evolution Strategies for Sparse Reward Gridworld Environments. (DSTG, AJCAI 2022)
History Compression via Language Models in Reinforcement Learning. (Johannes Kepler University Linz, PMLR 2022)
Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill Diversity (Arizona State University, ICML 2022)
How to Stay Curious while avoiding Noisy TVs using Aleatoric Uncertainty Estimation (University College London, Boston University, ICML 2022)
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications (Imperial College London, ICLR 2022)
Compositional Generalization in Grounded Language Learning via Induced Model Sparsity (Aalto University, NAACL-SRW 2022)
Interesting Object, Curious Agent: Learning Task-Agnostic Exploration (Meta AI Research, NeurIPS 2021)
Safe Policy Optimization with Local Generalized Linear Function Approximations (IBM Research, Tsinghua University, NeurIPS 2021)
A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning (Mila, McGill University, NeurIPS 2021)
SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning (Tufts University, SIFT, AAMAS 2021)
Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning (UCL, AAMAS 2021)
Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments (Texas A&M University, Kuai Inc., ICLR 2021)
Adversarially Guided Actor-Critic (INRIA, Google Brain, ICLR 2021)
Information-theoretic Task Selection for Meta-Reinforcement Learning (University of Leeds, NeurIPS 2020)
BeBold: Exploration Beyond the Boundary of Explored Regions (UCB, December 2020)
Approximate Information State for Approximate Planning and Reinforcement Learning in Partially Observed Systems (McGill, October 2020)
Prioritized Level Replay (FAIR, October 2020)
AllenAct: A Framework for Embodied AI Research (Allen Institute for AI, August 2020)
Learning with AMIGO: Adversarially Motivated Intrinsic Goals (MIT, FAIR, ICLR 2021)
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments (FAIR, ICLR 2020)
Learning to Request Guidance in Emergent Communication (University of Amsterdam, Dec 2019)
Working Memory Graphs (MSR, Nov 2019)
Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning (Oct 2019, University of Antwerp)
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck (MSR, NeurIPS, Oct 2019)
Recurrent Independent Mechanisms (Mila, Sept 2019)
Learning Effective Subgoals with Multi-Task Hierarchical Reinforcement Learning (Tsinghua University, August 2019)
Mastering emergent language: learning to guide in simulated navigation (University of Amsterdam, Aug 2019)
Transfer Learning by Modeling a Distribution over Policies (Mila, June 2019)
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives (Mila, June 2019)
Learning distant cause and effect using only local and immediate credit assignment (Incubator 491, May 2019)
Practical Open-Loop Optimistic Planning (INRIA, April 2019)
Learning World Graphs to Accelerate Hierarchical Reinforcement Learning (Salesforce Research, 2019)
Variational State Encoding as Intrinsic Motivation in Reinforcement Learning (Mila, TARL 2019)
Unsupervised Discovery of Decision States Through Intrinsic Control (Georgia Tech, TARL 2019)
Modeling the Long Term Future in Model-Based Reinforcement Learning (Mila, ICLR 2019)
Unifying Ensemble Methods for Q-learning via Social Choice Theory (Max Planck Institute, Feb 2019)
Planning Beyond The Sensing Horizon Using a Learned Context (MLMP@IROS, 2018)
Guiding Policies with Language via Meta-Learning (UC Berkeley, Nov 2018)
On the Complexity of Exploration in Goal-Driven Navigation (CMU, NeurIPS, Nov 2018)
Transfer and Exploration via the Information Bottleneck (Mila, Nov 2018)
Creating safer reward functions for reinforcement learning agents in the gridworld (University of Gothenburg, 2018)
BabyAI: First Steps Towards Grounded Language Learning With a Human In the Loop (Mila, ICLR, Oct 2018)

This environment has been built as part of work done at Mila. The Dynamic obstacles environment has been added as part of work done at IAS in TU Darmstadt and the University of Genoa for mobile robot navigation with dynamic obstacles.