have been a long time coming: Zero-knowledge proof systems were introduced by Shafi Goldwasser, Silvio Micali, and Charles Rackoff in 1985, and had a transformative effect on the field of cryptography; they were recognized by the ACM Turing Award awarded to Goldwasser and Micali in in 2012. ...
As the true values are unknown, we intended to determine (adaptive) policies that maximize a discounted reward criterion with constraints, that is, we used Lagrange multipliers to find optimal (adaptive) policies for the unconstrained version of the optimal control problem. In the present context,...
In the literature, this approach is known as the Principle of Estimation and Control. This problem has been studied in several contexts. For instance, refs. [5,6,7,8] and the references therein are about stochastic control systems evolving in discrete time. On the other hand, adaptive ...
Let us also point out that the fact that each minimum satisfies the associated Euler–Lagrange equation can be not at all obvious. See, e.g., the survey paper [6]. This problem has also been addressed in [7], and the assumptions we will impose on Ψ are related to those required in...