[1]
T. Fadziso, “Reward Redistribution as Align-RUDDER: Learning from a Few Demonstrations”, Int. j. recipr. symmetry theor. phys., vol. 7, pp. 1–8, Feb. 2020, Accessed: Oct. 05, 2024. [Online]. Available: https://upright.pub/index.php/ijrstp/article/view/52