The development of reward functions to serve as a method of auxiliary learning is more prevalent in the resource allocation task section. However, there are still a few prominent examples of reward function manipulation guiding networks to better solutions. Of course this is alluding to Honerkamp ...