...的翻译是:Based on Q-learning theory, Q-learning algorithm...
a基于Q学习理论,研究Q学习算法的理论基础以及主要思想,阐述Q学习的构成和特点,对Q学习算法步骤、期望回报函数、Q值函数、动作选择机制、Q值更新函数等进行了详细的分析,探讨Q学习算法的详细内容。 Based on the Q study theory, studies the Q study algorithm the rationale as well as the main thought, elaborate...