搜索资源列表
Optimality-of-the-NVI-Adaptive-Policy-for-a-Parti
- Paper on the optimality of a non-stationary value iteration adaptive policy for a Partially Observed Markov Decision Proce-Paper on the optimality of a non-stationary value iteration adaptive policy for a Partially Observed Markov Decision Process
KLSPI论文
- Kernel-based least squares policy iteration for reinforcement learning. IEEE Transactions on Neural Networks, 2007, 18(4) 973-992