搜索资源列表
catandmouse
- 强化学习实例:catchmouse 该实例用到了很多的算法monte carlo method,TD method and saras method.-an example for reinforcement learning:catchmouse.the example includes the following method:monte carlo method,TD method and saras method.
td_jd111
- TD-SCDMA系统中的调制解调技术及信道估计与联合检测-TD-SCDMA system, modulation and demodulation technology and channel estimation and joint detection
Network_Structure
- network structure in td
jsp-c
- 一个产生TD类JSP算例的C程序,源码调试通过,附带运行结果。-A C procedures to JSP example of class TD
nonlinear-TD
- 关于非线性跟踪微分器的源程序,有需要的朋友可以看一看-About nonlinear tracking differentiator source, a friend in need can take a look
maze_TD_lamda
- 时序差分学习算法TD(lamda)求解N*N方格走迷宫问题,求解每个方格的V值-Temporal difference learning algorithm for N*N maze problem
万门大学强化学习算法代码RW模型+TD模型
- 万门大学,强化学习,rw模型算法代码实现, V(CS) = V(CS) + A * ( V(US) * us - V(CS) * cs ) td模型, V(s{t}) = V(s{t}) + a[R(t+1) + rV{S(t+1)} - V{S(t)}](In the intensive learning of the University of Wan men, the RW algorithm, the python implementation, the algorithm f
RLS-TD(lamda)
- Efficient reinforcement learning using recursive least-squares methods. Jounral of AI Research, 2002, 16, 259-292
