sarthak dayal


robots:


games:


papers:

(contemporary)

Achieving Human Level Competitive Robot Table Tennis2024
Direct Preference Optimization: Your Language Model is Secretly a
Reward Model
2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion2023
Unsupervised Skill Discovery with Bottleneck Option Learning2021
Reward is Enough2021
Diversity is All You Need: Learning Skills Without a Reward Function2019

(older)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning
with a Stochastic Actor
2018
Learning Multi-Level Hierarchies with Hindsight2017
Playing Atari with Deep Reinforcement Learning2013
On the Complexity of Solving Markov Decision Problems1995
Neural Networks and Physical Systems with Emergent Collective Computational Abilities1982

books: