Fadziso, Takudzwa. “Reward Redistribution As Align-RUDDER: Learning from a Few Demonstrations”. International Journal of Reciprocal Symmetry and Theoretical Physics 7 (February 15, 2020): 1–8. Accessed October 5, 2024. https://upright.pub/index.php/ijrstp/article/view/52.