1) semi-Markov decision process
半Markov决策过程
1.
For the problem of value iteration(VI) optimization of the compact action set in semi-Markov decision process,a unified standard VI algorithm directly based on the equivalent infinitesimal generator under both discount and average criteria was proposed with the proof of convergence.
针对半Markov决策过程在紧致行动集上的数值迭代优化,提出了折扣和平均准则下直接基于等价无穷小生成子的统一的标准数值迭代算法,并证明了其收敛性。
2.
Discrete event dynamic systems (DEDSs) are generally existing man-made systems in the real world, and semi-Markov decision process (SMDP) is one of the major models for such systems.
离散事件动态系统(DEDS)是实际生活中广泛存在的一类人造系统,而半Markov决策过程(SMDP)是这类系统建模的主要方法之一。
2) Semi-Markov decision processes
半Markov决策过程
1.
Relations between discounted models and average models for semi-Markov decision processes;
半Markov决策过程折扣模型与平均模型之间的关系
3) Markov decision processes
Markov决策过程
1.
In the optimal design and cootrol of preparative chromatographic processes,the obstacles appear when one tries to link the Wilson's framework of chromatographic theories based on partial differential equations(PDEs) with the Eulerian presentation to optimal control approaches based on discrete time states,such as Markov decision processes(MDP) or Model predictive control(MPC).
在制备色谱的优化设计和控制过程中,若试图把基于偏微分方程(PDE)-Eulerian描述的Wilson色谱理论框架和基于离散时间状态的优化控制方法(如Markov决策过程(MDP)和模型预测控制(MPC)等)衔接在一起时,就会出现明显的障碍。
2.
Based on continuous-time Markov decision processes, a power management policy optimization approach is proposed.
提出一种基于连续时间Markov决策过程的动态电源管理策略优化方法。
3.
An algorithm to optimize policy in call admission control is presented by using the method of Markov decision processes combined with performance potentials.
应用 Markov决策过程与性能势相结合的方法 ,给出了呼叫接入控制的策略优化算法。
4) Markov decision process
Markov决策过程
1.
An algorithm of reinforcement learning for finite-horizon Markov decision processes;
一种有限时段Markov决策过程的强化学习算法
2.
Based on the theory of Markov decision processes,the policy iteration and online policy iteration algorithms for performance optimization are provided.
在Markov决策过程理论的基础上,给出了关于性能指标的策略迭代和在线策略迭代算法,并通过实例仿真说明该方法的优越性。
3.
Markov Decision Process(MDP)model is the general frame for solving reinforcement learning problems.
Markov决策过程(MDP)模型是解决激励学习问题的通用方法,而动态规划方法是Agent在具有Markov环境下与策略相关的值函数学习算法。
6) Countable semi-Markov decision processes
可数半Markov决策过程
补充资料:购买决策过程
购买决策过程
process of purchasing decision making
购买决策过程(proeess of purehasingdecision making)广义的消费者购买决策过程见消费者行为过程。就狭义而言,可理解为消费者在购买产品时,把引起注意的备择商标产品同己确立的标准进行比较,根据比较结果,消费者必决定将一个与其己经确立的标准更接近的商标产品作为他最后购买的对象,这一过程即为购买的决策过程。 (张玉峰撰马谋超审)
说明:补充资料仅用于学习参考,请勿用于其它任何用途。
参考词条