Kagan Tumer: <b>Time-Extended Policies in Multiagent Reinforcement Learning</b>

Kagan Tumer's Publications

Display Publications by [Year] [Type] [Topic]

Time-Extended Policies in Multiagent Reinforcement Learning. K. Tumer and A. Agogino. In Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 1336–1337, New York, NY, July 2004.

Abstract

Many algorithms such as Q-learning successfully address reinforcement learning in single-agent multi-time-step problems. In addition there are methods that address reinforcement learning in multi-agent single-time-step problems. However, unmodified single-agent multi-time-step methods and multi-agent single-time-step methods cannot necessarily be combined to solve multi-agent multi-time-step problems due to strong coupling between multi-agent interactions between time steps. Rewards that result in multi-agent collaboration for a single time-step may result in poor collaboration in future time-steps. This paper shows how to avoid this problem.

Download

[PDF]72.2kB

BibTeX Entry

@inproceedings{tumer-agogino_marl_aamas04,
	author = {K. Tumer and A. Agogino},
	title = {Time-Extended Policies in Multiagent Reinforcement Learning},
	booktitle = {Proceedings of the Third International Joint Conference on
		Autonomous Agents and Multiagent Systems},
	pages = {1336-1337},
	month = {July},
	address = {New York, NY},
	abstract = {Many algorithms such as Q-learning successfully address reinforcement learning in single-agent multi-time-step problems. In addition there are methods that address reinforcement learning in multi-agent single-time-step problems. However, unmodified single-agent multi-time-step methods and multi-agent single-time-step methods cannot necessarily be combined to solve multi-agent multi-time-step problems due to strong coupling between multi-agent interactions between time steps. Rewards that result in multi-agent collaboration for a single time-step may result in poor collaboration in future time-steps. This paper shows how to avoid this problem.},
	bib2html_pubtype = {Refereed Conference Papers},
	bib2html_rescat = {Multiagent Systems, Reinforcement Learning},
	year = {2004}
}

Generated by bib2html.pl (written by Patrick Riley ) on Wed Apr 01, 2020 17:39:43