搜索资源列表
WindyGridWorldQLearning
- Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively
gai
- 一個串口雙向通訊的小程序,接收數值并繪製圖線,同時發送滑竿的數值。-A two-way serial communication applet receives the value and sketch line, and send the value of the slider.