Kagan Tumer: <b>Neuroeveolution of a Modular Memory-Augmented Neural Network for Deep Memory Problems</b>

Kagan Tumer's Publications

Display Publications by [Year] [Type] [Topic]

Neuroeveolution of a Modular Memory-Augmented Neural Network for Deep Memory Problems. S. Khadka, J. J. Chung, and K. Tumer. Evolutionary Computation, , 2020.

Abstract

We present Modular Memory Units (MMUs), a new class of memory-augmented neural network. MMU builds on the gated neural architecture of Gated Recurrent Units (GRUs) and Long Short Term Memory (LSTMs), to incorporate an external memory block, similar to a Neural Turing Machine (NTM). MMU interacts with the memory block using independent read and write gates that serve to decouple the memory from the central feedforward operation. This allows for regimented memory access and update, administering our network the ability to choose when to read from memory, update it, or simply ignore it. This capacity to act in detachment allows the network to shield the memory from noise and other distractions, while simultaneously using it to effectively retain and propagate information over an extended period of time. We train MMU using both neuroevolution and gradient descent, and perform experiments on two deep memory benchmarks. Results demonstrate that MMU performs significantly faster and more accurately than traditional LSTM-based methods, and is robust to dramatic increases in the sequence depth of these memory benchmarks.

Download

(unavailable)

BibTeX Entry

@article{tumer-khadka_ecj20,
author = {S. Khadka and J. J. Chung and K. Tumer},
title = {Neuroeveolution of a Modular Memory-Augmented Neural Network for Deep Memory Problems},
journal = {Evolutionary Computation},
publisher = {},
	bib2html_pubtype = {Journal Articles},
	bib2html_rescat = {Evolutionary Algorithms},
abstract={We present Modular Memory Units (MMUs), a new class of memory-augmented neural network. MMU builds on the gated neural architecture of Gated Recurrent Units (GRUs) and Long Short Term Memory (LSTMs), to incorporate an external memory block, similar to a Neural Turing Machine (NTM). MMU interacts with the memory block using independent read and write gates that serve to decouple the memory from the central feedforward operation. This allows for regimented memory access and update, administering our network the ability to choose when to read from memory, update it, or simply ignore it. This capacity to act in detachment allows the network to shield the memory from noise and other distractions, while simultaneously using it to effectively retain and propagate information over an extended period of time. We train MMU using both neuroevolution and gradient descent, and perform experiments on two deep memory benchmarks. Results demonstrate that MMU performs significantly faster and more accurately than traditional LSTM-based methods, and is robust to dramatic increases in the sequence depth of these memory benchmarks.},
note = {},
year = {2020}
}

Generated by bib2html.pl (written by Patrick Riley ) on Wed Apr 01, 2020 17:39:43