Fadziso, T. (2020). Reward Redistribution as Align-RUDDER: Learning from a Few Demonstrations. International Journal of Reciprocal Symmetry and Theoretical Physics, 7, 1-8. https://upright.pub/index.php/ijrstp/article/view/52