欢迎访问《石家庄铁道大学学报》（自然科学版）网站！

基于Q-Learning的智能体训练

How to Train the Agent Based on the Way of Q-Learning

修订日期：2006-11-06

中文关键词:Q-Learning 模糊控制回报函数

英文关键词:Q-Learning fuzzy control reward function

基金项目:河北师范大学青年基金（L2004Q15）

赵增荣韩提文

[1]河北师范大学职业技术学院,河北石家庄050031 [2]河北工业职业技术学院成教部,河北石家庄050030

摘要点击次数: 1142

全文下载次数: 0

中文摘要:

针对机器人足球比赛的多智能体环境下智能体的训练问题,提出了一种将模糊控制与Q-Learning相结合的学习方法,并在学习过程中自动调节回报函数以获得最优策略,此方法的有效性在中型组的仿真平台上得到了验证,并取得了较好效果,还可将它改进应用于其他多智体环境。

英文摘要:

In order to solve the problem of agent training in the multi-agent circumstances of robot soccer, a new method of agent learning is put forward, which combines the fuzzy control with the Q-Learning. During the learning process, the reward function is cont

赵增荣,韩提文.基于Q-Learning的智能体训练[J].石家庄铁道大学学报(自然科学版),2007,(2):37-.

查看全文下载PDF阅读器

关闭