1.
Fadziso T. Reward Redistribution as Align-RUDDER: Learning from a Few Demonstrations.
Int. j. recipr. symmetry theor. phys.
2020;7:1-8. Accessed July 12, 2025.
https://upright.pub/index.php/ijrstp/article/view/52