• Sorted by Date • Classified by Publication Type • Sorted by First Author Last Name • Classified by Topic •
Çetin Meriçli, Tekin Meriçli, and H. Levent Akın. A Reward Function Generation Method Using Genetic Algorithms: A Robot Soccer Case Study. In 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010.
Immediate rewards play a key role in a reinforcement learning (RL) scenario as they help the system deal with the credit assignment problem. Therefore, reward function definition has a drastic effect on both how fast the system learns and to what policy it converges. It becomes even more important in case of multi-agent learning, where the state space usually gets even bigger. We propose a Genetic Algorithms (GA) based reward function shaping method for multi-robot learning problems and evaluate its performance in a robot soccer case study. A set of metrics calculated from the positions of the players and the ball on the field are used as the primitive building blocks of an immediate reward function, which is defined as a weighted combination of these metrics obtained using GA, yielding a significantly better soccer playing performance.
@InProceedings{MericliAAMAS2010, author = {Çetin Meriçli and Tekin Meriçli and H. Levent Akın}, title = {A Reward Function Generation Method Using Genetic Algorithms: A Robot Soccer Case Study}, booktitle = {9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010)}, year = {2010}, abstract = {Immediate rewards play a key role in a reinforcement learning (RL) scenario as they help the system deal with the credit assignment problem. Therefore, reward function definition has a drastic effect on both how fast the system learns and to what policy it converges. It becomes even more important in case of multi-agent learning, where the state space usually gets even bigger. We propose a Genetic Algorithms (GA) based reward function shaping method for multi-robot learning problems and evaluate its performance in a robot soccer case study. A set of metrics calculated from the positions of the players and the ball on the field are used as the primitive building blocks of an immediate reward function, which is defined as a weighted combination of these metrics obtained using GA, yielding a significantly better soccer playing performance.} bib2html_pubtype = {Refereed Conference}, bib2html_rescat = {Multi-robot Planning}, bib2html_dl_pdf = {../files/cmericliAAMAS2010RewardShapingExtendedAbstract.pdf}, }
Generated by bib2html.pl (written by Patrick Riley ) on Tue Nov 07, 2017 00:34:04