Folded recurrent neural networks for future video prediction

Oliu Simon, Marc; Selva, Javier; Escalera, Sergio

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10609/123286

Título :	Folded recurrent neural networks for future video prediction
Autoría:	Oliu Simon, Marc Selva, Javier Escalera, Sergio
Otros:	Universitat Oberta de Catalunya (UOC) Universitat de Barcelona (UB)
Citación :	Oliu-Simón, M., Selva, J. & Escalera Guerrero, S. (2018). Folded recurrent neural networks for future video prediction. Lecture Notes in Computer Science, 11218, 745-761. doi: 10.1007/978-3-030-01264-9_44
Resumen :	This work introduces double-mapping Gated Recurrent Units (dGRU), an extension of standard GRUs where the input is considered as a recurrent state. An extra set of logic gates is added to update the input given the output. Stacking multiple such layers results in a recurrent auto-encoder: the operators updating the outputs comprise the encoder, while the ones updating the inputs form the decoder. Since the states are shared between corresponding encoder and decoder layers, the representation is stratified during learning: some information is not passed to the next layers. We test our model on future video prediction. Main challenges for this task include high variability in videos, temporal propagation of errors, and non-specificity of future frames. We show how only the encoder or decoder needs to be applied for encoding or prediction. This reduces the computational cost and avoids re-encoding predictions when generating multiple frames, mitigating error propagation. Furthermore, it is possible to remove layers from a trained model, giving an insight to the role of each layer. Our approach improves state of the art results on MMNIST and UCF101, being competitive on KTH with 2 and 3 times less memory usage and computational cost than the best scored approach.
Palabras clave :	predicción de video futuro aprendizaje no supervisado redes neuronales recurrentes
DOI:	10.1007/978-3-030-01264-9_44
Tipo de documento:	info:eu-repo/semantics/conferenceObject
Fecha de publicación :	9-oct-2018
Aparece en las colecciones:	Articles

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
1712.00311.pdf		2,05 MB	Adobe PDF	Visualizar/Abrir

Mostrar el registro completo del ítem

Comparte:

Impacto:

Google Scholar

Microsoft Academic

Exporta:

Consulta las estadísticas