1) reinforcement learning system
增强式学习系统
2) reinforcement learning
增强式学习
1.
However,reinforcement learning will take much time using the trial and error mechanism without the certain environment,and not satisfy the real-time request.
由于增强式学习不需要精确的环境模型 ,而是采用逐次逼近的机理 ,所以微直升机需要很大的计算开销 ,难以满足实时性要求。
2.
Each agent does not know the others structures of Q function and payoff using this multi-agent reinforcement learning without rigorous condition.
提出一种多智能体增强式学习方法 ,每个智能体在学习过程中将其他智能体和环境区分开来 ,并且通过维持其他智能体的替代传导径迹来预测它们的行为 ,从而也确定了自身的行为 。
3.
To solve the robot s path planning problem in dynamic environment, this paper applies adaptive learning to path planning based on reinforcement learning.
本文结合机器人路径规划问题介绍了增强式学习方法 ,实现了动态环境中基于增强式学习的自适应路径规划 。
3) connectionist reinforcement learning
连接增强式学习
1.
The robot needs to memorize the behaviors and states, at the same time the EMS memory space is not enough, the connectionist reinforcement learning can approximate the Q function with MLPs, and generalize the state space to economize the memory space.
机器人在学习过程中需要对行为状态进行记忆,连接增强式学习利用多层感知器逼近Q函数,泛化状态空间,节约了存储容量。
4) reinforcement learning algorihm
增强式学习算法
5) reinforcement learning
增强学习
1.
Parallel machines scheduling with reinforcement learning;
基于增强学习的平行机调度研究
2.
Optimized negotiation strategy based on reinforcement learning;
一种优化的基于增强学习协商策略
3.
A survey of direct policy search methods in reinforcement learning;
增强学习中的直接策略搜索方法综述
6) Reinforcement Learning
增强型学习
1.
Hybrid Intelligent Control for Ship Steering Based on Reinforcement Learning Algorithm;
基于增强型学习算法的船舶运动混合智能控制
2.
In this paper, a hierarchical reinforcement learning algorithm is investigated for Markov Decision Process with average reward.
对平均报酬型马氏决策过程 ,本文研究了一种递阶增强型学习算法 ;并将算法应用于一个两台机器组成的闭环可重入生产系统 ,计算机仿真结果表明 ,调度结果优于熟知的两种启发式调度策略 。
3.
A set of optimised fuzzy control rules can be automatically generated through reinforcement learning based on the state variables of object system.
该控制器能根据被控对象的状态通过增强型学习自动生成模糊控制规则 。
补充资料:增强
分子式:
CAS号:
性质:在分析化学中,引起测量信号增大的一种干扰现象。它造成分析试样表观浓度或含量高于待测物质的真实浓度或含量。通称为增强。
CAS号:
性质:在分析化学中,引起测量信号增大的一种干扰现象。它造成分析试样表观浓度或含量高于待测物质的真实浓度或含量。通称为增强。
说明:补充资料仅用于学习参考,请勿用于其它任何用途。
参考词条