PaperSwipe

Coarse-to-fine Q-Network with Action Sequence for Data-Efficient Reinforcement Learning

Published 1 year agoVersion 6arXiv:2411.12155

Authors

Younggyo Seo, Pieter Abbeel

Categories

cs.LGcs.AIcs.RO

Abstract

Predicting a sequence of actions has been crucial in the success of recent behavior cloning algorithms in robotics. Can similar ideas improve reinforcement learning (RL)? We answer affirmatively by observing that incorporating action sequences when predicting ground-truth return-to-go leads to lower validation loss. Motivated by this, we introduce Coarse-to-fine Q-Network with Action Sequence (CQN-AS), a novel value-based RL algorithm that learns a critic network that outputs Q-values over a sequence of actions, i.e., explicitly training the value function to learn the consequence of executing action sequences. Our experiments show that CQN-AS outperforms several baselines on a variety of sparse-reward humanoid control and tabletop manipulation tasks from BiGym and RLBench.

Coarse-to-fine Q-Network with Action Sequence for Data-Efficient Reinforcement Learning

1 year ago
v6
2 authors

Categories

cs.LGcs.AIcs.RO

Abstract

Predicting a sequence of actions has been crucial in the success of recent behavior cloning algorithms in robotics. Can similar ideas improve reinforcement learning (RL)? We answer affirmatively by observing that incorporating action sequences when predicting ground-truth return-to-go leads to lower validation loss. Motivated by this, we introduce Coarse-to-fine Q-Network with Action Sequence (CQN-AS), a novel value-based RL algorithm that learns a critic network that outputs Q-values over a sequence of actions, i.e., explicitly training the value function to learn the consequence of executing action sequences. Our experiments show that CQN-AS outperforms several baselines on a variety of sparse-reward humanoid control and tabletop manipulation tasks from BiGym and RLBench.

Authors

Younggyo Seo, Pieter Abbeel

arXiv ID: 2411.12155
Published Nov 19, 2024

Click to preview the PDF directly in your browser