Fadziso, Takudzwa. “Reward Redistribution As Align-RUDDER: Learning from a Few Demonstrations”.
International Journal of Reciprocal Symmetry and Theoretical Physics
, vol. 7, Feb. 2020, pp. 1-8,
https://upright.pub/index.php/ijrstp/article/view/52
.