梯度策略,gradsdient strategy,音标,读音,翻译,英文例句,英语词典

您的位置：首页 -> 词典 -> 梯度策略

1) gradsdient strategy 点击朗读

梯度策略

例句>>

2) policy gradient 点击朗读

策略梯度

Theories, Algortihms and Applications of Policy Gradient Reinforcement Learning; 点击朗读

策略梯度增强学习的理论、算法及应用研究

The adaptive heuristic critic(AHC) reinforcement learning frame is approximate of the value function and the policy function of Markov decision process(MDP),the stochastic MDPs can be converted to deterministic MDPs by the policy gradient reinforcement learning.

自适应启发评价(AHC)增强学习结构分别逼近马尔可夫决策过程的值函数和策略函数,策略梯度增强学习能够将随机不确定的马尔可夫决策过程转换为确定性的马尔可夫决策过程。

Although policy gradient reinforcement learning (PGRL) has good convergence properties, the variance of policy gradient estimation in existing PGRL algorithms is usually large, which becomes a significant problem for policy gradient algorithms in theory and in practice.

尽管策略梯度强化学习算法有较好的收敛性,但是在梯度估计的过程中方差过大,却是该方法在理论和应用上的一个主要弱点。

更多例句>>

3) gradient searching strategy 点击朗读

梯度搜索策略

A steady-state optimization method of chain-grate boiler based on gradient searching strategy is presented to improve the running effi- ciency and reduce fuel consuming.

提出了一个基于梯度搜索策略的链条炉稳态优化方法,优化锅炉运行效率,降低能耗。

4) policy-gradient algorithm 点击朗读

策略梯度算法

On the basis of partially observable Markov decision processes,two finite-memory policy-gradient algorithms,that is,model-based GAMP algorithm and model-free IState-GPOMDP algorithm,were implemented,and employed in the simulation of a robot walking in a maze.

通过分析仿真结果,对这两种算法引入了基于观测的优化;并发现在所给报酬函数下,策略梯度算法中的步长参数也在一定程度上影响着优化策略的效率。

5) the Echelon-stock policies 点击朗读

梯度库存策略

6) the gradient instruction strategy 点击朗读

梯度指导策略

补充资料：G(?)teaux梯度

G(?)teaux梯度
Gateaux gradient

‘凌如以梯度【珑加倒优脚曲斌;raTo rp”脱.TI，田-咖李回H的俘甲f夺丁卓x0牛的 H中与f在x。的C自妞.玫导数(G云姗uxderi珊tiVe)f。(x。)相等的向量.换句话说，G舀teaux梯度由公式 f(x。+h)二f(凡)+(无(x。)，h)+。(h)定义，其中。(th)/t~0，当t~0.在”维Eodid空间中C冶姗以梯度f。(x。)为具有坐标了叮(x。)___盯(凡)、 \口x:”口x，了的向量，并简称为梯度(脚djent).C冶如ux梯度概念可以推广到下列情形:X为侧组日的n流形(有限维)或无穷维Hilbert流形，而f为X上光滑实函数.f在其C冶如以梯度方向上的增长大于过此点任何其他方向的增长. B.M.THxo栩叼Po.撰郑维行译沈永欢、王声望校

说明：补充资料仅用于学习参考，请勿用于其它任何用途。

参考词条

"四度"策略反梯度战略

策略梯度估计策略梯度优化算法决策梯度调度策略制度策略策略密度

说明：双击或选中下面任意单词，将显示该词的音标、读音、翻译等；选中中文或多个词，将显示翻译。
	您的位置：首页 -> 词典 -> 梯度策略 1) gradsdient strategy 梯度策略例句>> 2) policy gradient 策略梯度 1. Theories, Algortihms and Applications of Policy Gradient Reinforcement Learning; 策略梯度增强学习的理论、算法及应用研究 2. The adaptive heuristic critic(AHC) reinforcement learning frame is approximate of the value function and the policy function of Markov decision process(MDP),the stochastic MDPs can be converted to deterministic MDPs by the policy gradient reinforcement learning. 自适应启发评价(AHC)增强学习结构分别逼近马尔可夫决策过程的值函数和策略函数,策略梯度增强学习能够将随机不确定的马尔可夫决策过程转换为确定性的马尔可夫决策过程。 3. Although policy gradient reinforcement learning (PGRL) has good convergence properties, the variance of policy gradient estimation in existing PGRL algorithms is usually large, which becomes a significant problem for policy gradient algorithms in theory and in practice. 尽管策略梯度强化学习算法有较好的收敛性,但是在梯度估计的过程中方差过大,却是该方法在理论和应用上的一个主要弱点。更多例句>> 3) gradient searching strategy 梯度搜索策略 1. A steady-state optimization method of chain-grate boiler based on gradient searching strategy is presented to improve the running effi- ciency and reduce fuel consuming. 提出了一个基于梯度搜索策略的链条炉稳态优化方法,优化锅炉运行效率,降低能耗。 4) policy-gradient algorithm 策略梯度算法 1. On the basis of partially observable Markov decision processes,two finite-memory policy-gradient algorithms,that is,model-based GAMP algorithm and model-free IState-GPOMDP algorithm,were implemented,and employed in the simulation of a robot walking in a maze. 通过分析仿真结果,对这两种算法引入了基于观测的优化;并发现在所给报酬函数下,策略梯度算法中的步长参数也在一定程度上影响着优化策略的效率。 5) the Echelon-stock policies 梯度库存策略 6) the gradient instruction strategy 梯度指导策略补充资料：G(?)teaux梯度 G(?)teaux梯度 Gateaux gradient ‘凌如以梯度【珑加倒优脚曲斌;raTo rp”脱.TI，田-咖李回H的俘甲f夺丁卓x0牛的 H中与f在x。的C自妞.玫导数(G云姗uxderi珊tiVe)f。(x。)相等的向量.换句话说，G舀teaux梯度由公式 f(x。+h)二f(凡)+(无(x。)，h)+。(h)定义，其中。(th)/t~0，当t~0.在”维Eodid空间中C冶姗以梯度f。(x。)为具有坐标了叮(x。)___盯(凡)、 \口x:”口x，了的向量，并简称为梯度(脚djent).C冶如ux梯度概念可以推广到下列情形:X为侧组日的n流形(有限维)或无穷维Hilbert流形，而f为X上光滑实函数.f在其C冶如以梯度方向上的增长大于过此点任何其他方向的增长. B.M.THxo栩叼Po.撰郑维行译沈永欢、王声望校说明：补充资料仅用于学习参考，请勿用于其它任何用途。参考词条 "四度"策略反梯度战略

©2011 dictall.com