搜索资源列表
maze_TD_lamda
- 时序差分学习算法TD(lamda)求解N*N方格走迷宫问题,求解每个方格的V值-Temporal difference learning algorithm for N*N maze problem
Chapter 6 (Temporal Difference Learning)
- 讲解强化学习中的Q学习和sarsa学习,并通过具体实例讲解怎样运用这些学习方法(Explain Q learning and sarsa learning in intensive learning, and explain how to use these learning methods through concrete examples)