Using route choice in an activity context as an example, we estimate a latent class random regret-minimization model, which takes into account the travel time and therefore arrival time uncertainty that people face when making route choice decisions. In addition, it incorporates the effects of ...
Deep model-based Reinforcement Learning (RL) has the potential to substantially improve the sample-efficiency of deep RL. While various challenges have long held it back, a number of papers have recently come out reporting success with deep model-bas...
model selectionThis paper considers portfolio construction in a dynamic setting. We specifya loss function comprised of utility and complexity components with an unknowntradeoff parameter. We develop a novel regret-based criterion for selecting thetradeoff parameter to construct optimal sparse portfolios ...
In Section 2, we provide a short introduction to regret theory and formulate an optimal insurance model under regret theory. Section 3 attempts to solve the optimal insurance model and shows that the optimal solution for a regret-averse insured can be in the form of no insurance or partial ...
60 construct a regret-based three-way decision model under interval type-2 fuzzy (IT2F) environment. Nonetheless, all of the above methods based on regret theory only compare with the best alternative, but ignore the worst alternative when calculating the regret-rejoice value, which may lead to...
Mondal, A., Roy, S.K., Zhan, J.M.: A reliability-based consensus model and regret theory-based selection process for linguistic hesitant-Z multi-attribute group decision making. Expert Syst. Appl. Lingras, P.J., Yao, Y.Y.: Data mining using extensions of the rough set model. J. ...
The model consists of 14 buses and 15 branches, as shown in Figure 6. Different types of DGs are connected to altered buses in the test system [39]. Almost every DG type has stable output power, but two of them have variations in output power on account of the fluctuation in resources...
For the multiple criteria decision-making (MCDM) problem with interval-valued probabilistic linguistic information, we propose a novel method considering the regret theory and cobweb area model. We first propose a new score function, which can be used to compare different interval-valued probabilistic...
+86一lO-62562563ReinforcementLearningModelBasedonRegretforMulti-AgentConflictGamesXIAOZheng+,ZHANGShi.Yong(DepartmentofComputerandInformationTechnology,FudanUniversity,Shanghai200433,China)+Correspondingauthor:E-mail:xiaozhen9206@163.corn,hUp://www.fudan.edu.∞XiaoZ,ZhangSY.Reinforcementlearningmodelbasedonregret...
A novel 3WD model based on RT By using the basic idea of RT, Section 4.1 proposes a new 3WD method based on regret values, rejoicing values and overall psychological perception values, i.e., constructing the score function under the three strategies and analyzing the related properties of thre...