Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks

Campos V, Jou B, Giró-i-Nieto X, Torres J, Chang S-F. Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks. In NIPS Time Series Workshop 2017. Long Beach, CA, USA; 2017.

(427.72 KB)

Abstract

Recurrent Neural Networks (RNNs) continue to show outstanding performance in sequence modeling tasks. However, training RNNs on long sequences often face challenges like slow inference, vanishing gradients and difficulty in capturing long term dependencies. In backpropagation through time settings, these issues are tightly coupled with the large, sequential computational graph resulting from unfolding the RNN in time. We introduce the Skip RNN model which extends existing RNN models by learning to skip state updates and shortens the effective size of the computational graph. This model can also be encouraged to perform fewer state updates through a budget constraint. We evaluate the proposed model on various tasks and show how it can reduce the number of required RNN updates while preserving, and sometimes even improving, the performance of the baseline RNN models.

Project page with source code
Preprint on arXiv
Short version (4 pages) presented at the NIPS Time Series Workshop 2017

Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks from Xavier Giro-i-Nieto

Projects

	Deep learning
	MALEGRA - Multimodal Signal Processing and Machine Learning on Graphs

Image Processing Group

Search form

User login

Abstract

Projects