搜索资源 - reward - 搜珍网

CDN加速镜像 | 设为首页 | 加入收藏夹

热门搜索： 源码 Android 整站插件识别 p2p OpenCV 网络编程游戏源码算法更多...

登陆 | 会员注册

当前位置：

数值算法/人工智能

搜索资源 - reward

下载资源主分类

源码下载

Web源码

开发工具

文档下载

其它资源

资源分类

压缩解压

STL

数据结构常用算法

数学计算/工程计算

人工智能/神经网络/遗传算法

matlab例程

生物技术

密码/编码算法

mathematica

Maple

数据挖掘

大数据

comsol

物理计算

化学计算

仿真建模

搜索资源列表

Main

0下载：
AI Reinforcement Learning 走格子，输出每1000步达到目标格子次数。reward: goal-> +1 rest -> 0.-Java implementation for an Reinforcement Learning agent to search through a Grid World from start point to goal state. reward: goal->+1 rest-> 0.
所属分类：AI-NN-PR
- 发布日期：2017-04-01
- 文件大小：1957
- 提供者：Sean

getPoints

0下载：
Basis point, 1/100 of one percent, denoted bp, bps, and ‱ Pivot point, a price level of significance in analysis of a financial market that is used as a predictive indicator of market movement Point (mortgage), a percentage sometimes refe
所属分类：matlab
- 发布日期：2017-04-04
- 文件大小：941
- 提供者：KIM JUNG HYUN

LMin

0下载：
俄罗斯套娃：南北向和东西向的道路纵横交错。现在，路口放着纯金打造的俄罗斯娃娃，重量大小不等，重的都能装下轻的。你可以沿着道路飞奔，拾取路口的娃娃，要求是任何时刻必须是一个套娃，装好后就不能再拆开了。注意不要走重复路。设计规划路线，使得能够有最大的收获。-Ivan Pavlov in the contest Conference Lectra pack, become the new " Prairie Eagle," has won great honor for the trib
所属分类：Data structs
- 发布日期：2017-04-06
- 文件大小：192264
- 提供者：tangxing

a

0下载：
建一个表示雇员信息的employee类，其中包含数据成员name、empNo和salary，分别表示雇员的姓名、编号和月薪。再从employee类派生出3个类worker、technician和salesman，分别代表普通工人、科研人员、销售人员。三个类中分别包含数据成员productNum、workHours和monthlysales，分别代表工人每月生产产品的数量、科研人员每月工作的时数和销售人员每月的销售额。要求各类中都包含成员函数pay，用来计算雇员的月薪，并假定：普通工人的月薪
所属分类：Data structs
- 发布日期：2017-03-29
- 文件大小：1157
- 提供者：小东

pso

0下载：
pso 例子,给出了PSO全局最优的函数，奖励和奖励及惩罚函数-The pso example, PSO global optimal function, reward and incentive and penalty function
所属分类：matlab
- 发布日期：2017-11-30
- 文件大小：738
- 提供者：jin fei

java12

0下载：
令狐冲JAVA成绩大于90，并且C成绩大于80分，师傅奖励他，或者 java成绩等于100，音乐成绩大于70，师傅也可以奖励他-Linghu JAVA scores greater than 90, and C score more than 80 points, the master reward him, or the java score equal to 100, music scores greater than 70, the master can reward him
所属分类：AI-NN-PR
- 发布日期：2017-11-08
- 文件大小：678
- 提供者：祝洪彬

code-and-dataset

0下载：
implementation of image cosegmentation using color reward strategy and active contours. I run this code and it was correct. this code is free bye its author Fanman Meng.-implementation of image cosegmentation using color reward strategy and acti
所属分类：matlab
- 发布日期：2017-05-06
- 文件大小：1318559
- 提供者：seyyed

MDPgridworldExample

0下载：
世界是空格自由（0）或障碍物（1）。每转动机器人可以在8个方向移动，或者留在地方。奖励功能，给人一种自由空间，目标定位，高回报。所有其他空格自由具有小的损失，和障碍具有大的负的奖励。值迭代是用来学习的最佳“政策”，即指定一个控制输入到每一个可能的位置的功能。- The world is freespaces (0) or obstacles (1). Each turn the robot can move in 8 directions, or stay in place. A reward
所属分类：matlab
- 发布日期：2017-04-14
- 文件大小：3371
- 提供者：莫文杰

Reward

0下载：
基于MFC，编写一个双色球选号器程序，开发工具为VC++6.0。要求以对话框的模式实现红球和篮球的随机选号，当用户点击“开始”按钮时，开始选号，点击“停止”按钮时，把球的号码显示在对话框内。 -DBased on MFC, the preparation of a double color code selection procedures, development tools for VC++6.0. Request dialog box mode to achieve random se
所属分类：Compress-Decompress algrithms
- 发布日期：2017-12-13
- 文件大小：91010
- 提供者：王小二

Q-Learning-master

0下载：
Successfully implemented Q-Learning for a simple robot navigation problem of a robot moving on a 5 x 5 grid with one arbitrary goal (reward of +10) and three arbitrary obstacles (reward of -10)
所属分类：matlab例程
- 发布日期：2018-05-02
- 文件大小：4096
- 提供者：YH.HO

DGP-IRL-master

0下载：
We propose a new approach to inverse reinforcement learning (IRL) based on the deep Gaussian process (deep GP) model, which is capable of learning complicated reward structures with few demonstrations.
所属分类：matlab例程
- 发布日期：2019-10-08
- 文件大小：933424
- 提供者：QQLogin_A608381F55732A12

搜珍网 www.dssz.com

本网站为编程资源及源代码搜集、介绍的搜索网站，版权归原作者所有！　　粤ICP备11031372号

1999-2046 搜珍网 All Rights Reserved.