Kagan Tumer: <b>Memory-Augmented Multi-Robot Teams that Learn to Adapt</b>

Kagan Tumer's Publications

Display Publications by [Year] [Type] [Topic]

Memory-Augmented Multi-Robot Teams that Learn to Adapt. S. Khadka, J. J. Chung, and K. Tumer. In Proceedings of the International Symposium on Multi-Robot and Multi-Agent Systems (MRS'17), pp. , Los Angeles, CA, December 2017.

Abstract

As multi-robot teams increasingly permeate real world environments like factories, homes and other extraterrestrial surfaces, the ability to form joint strategies that can effectively adapt to new observations and changes in teammatesÕ policy becomes more vital. Memory is a crucial part of adaptive behavior since it is the mechanism by which knowledge from past observations can be used to modify future behavior. Previous works have studied adaptive behaviors primarily with the tools of memory but has largely been limited to single robot approaches. In this paper, we formulate the Extended T-Maze domain that marries the difficulties of a task requiring explicit adaptive behavior, to the complexities introduced by concurrent actions of multiple robots acting in the same environment. We develop a memory-based learning approach using Gated Recurrent units with Memory Block as robot policies, and demonstrate its efficacy in diverse sets of experiments in the Extended T-Maze domain that vary across multiple axes of difficulty including team size and task depth. Our results show that the memory-based multi-robot controllers are able to form effective joint strategies significantly outperforming feedforward controllers trained without memory.

Download

[PDF]457.0kB

BibTeX Entry

@InProceedings{tumer-khadka_mrs17,
author = {S. Khadka and J. J. Chung and K. Tumer},
title = {Memory-Augmented Multi-Robot Teams that Learn to Adapt},
booktitle = {Proceedings of the International Symposium on Multi-Robot and Multi-Agent Systems (MRS'17)},
address = {Los Angeles, CA},
month = {December},
 pages={},
 abstract={As multi-robot teams increasingly permeate real world environments like factories, homes and other extraterrestrial surfaces, the ability to form joint strategies that can effectively adapt to new observations and changes in teammatesÕ policy becomes more vital. Memory is a crucial part of adaptive behavior since it is the mechanism by which knowledge from past observations can be used to modify future behavior. Previous works have studied adaptive behaviors primarily with the tools of memory but has largely been limited to single robot approaches. In this paper, we formulate the Extended T-Maze domain that marries the difficulties of a task requiring explicit adaptive behavior, to the complexities introduced by concurrent actions of multiple robots acting in the same environment. We develop a memory-based learning approach using Gated Recurrent units with Memory Block as robot policies, and demonstrate its efficacy in diverse sets of experiments in the Extended T-Maze domain that vary across multiple axes of difficulty including team size and task depth. Our results show that the memory-based multi-robot controllers are able to form effective joint strategies significantly outperforming feedforward controllers trained without memory.},
	bib2html_pubtype = {Refereed Conference Papers},
	bib2html_rescat = {Multiagent Systems, Robotics},
year = {2017}
}

Generated by bib2html.pl (written by Patrick Riley ) on Wed Apr 01, 2020 17:39:43