system, moving processes from the ready state to the running state. You will use three different dispatching algorithms: 1. first-come-first-served (FCFS) 2. shortest-job-first (SJF) 3. round-robin (RR) Each process will have a burst time, which represents the total time that a process ...
If you plan on using reward shaping, take a look at how the default shaped rewards are swapped by the agent index in the provided notebook. Upon episode reset, agents are assigned randomly to one of the 2 starting positions. This assignment is only reflected in the official observation that...