compare+ddpg+agent+to+lqr+controller

2025-03-29 08:25:08

拼音 [ 拼音 ]

Compare DDPG Agent to LQR Controller

A DDPG agent approximates the discounted cumulative long-term reward using a Q-value-function critic. A Q-value function critic must accept an observation and an action as inputs and return a scalar (the estimated discounted cumulative long-term reward) as output. To approximate the Q-value f...