PaperSwipe

Avoidance of an unexpected obstacle without reinforcement learning: Why not using advanced control-theoretic tools?

Published 3 months agoVersion 1arXiv:2509.03721

Authors

Cédric Join, Michel Fliess

Categories

eess.SYcs.ROmath.OC

Abstract

This communication on collision avoidance with unexpected obstacles is motivated by some critical appraisals on reinforcement learning (RL) which "requires ridiculously large numbers of trials to learn any new task" (Yann LeCun). We use the classic Dubins' car in order to replace RL with flatness-based control, combined with the HEOL feedback setting, and the latest model-free predictive control approach. The two approaches lead to convincing computer experiments where the results with the model-based one are only slightly better. They exhibit a satisfactory robustness with respect to randomly generated mismatches/disturbances, which become excellent in the model-free case. Those properties would have been perhaps difficult to obtain with today's popular machine learning techniques in AI. Finally, we should emphasize that our two methods require a low computational burden.

Avoidance of an unexpected obstacle without reinforcement learning: Why not using advanced control-theoretic tools?

3 months ago
v1
2 authors

Categories

eess.SYcs.ROmath.OC

Abstract

This communication on collision avoidance with unexpected obstacles is motivated by some critical appraisals on reinforcement learning (RL) which "requires ridiculously large numbers of trials to learn any new task" (Yann LeCun). We use the classic Dubins' car in order to replace RL with flatness-based control, combined with the HEOL feedback setting, and the latest model-free predictive control approach. The two approaches lead to convincing computer experiments where the results with the model-based one are only slightly better. They exhibit a satisfactory robustness with respect to randomly generated mismatches/disturbances, which become excellent in the model-free case. Those properties would have been perhaps difficult to obtain with today's popular machine learning techniques in AI. Finally, we should emphasize that our two methods require a low computational burden.

Authors

Cédric Join, Michel Fliess

arXiv ID: 2509.03721
Published Sep 3, 2025

Click to preview the PDF directly in your browser