Please use this identifier to cite or link to this item:

http://hdl.handle.net/10609/100446
Title: Deep learning for image captioning: an encoder-decoder architecture with soft attention
Author: Gómez Martínez, Mario
Director: Casas Roma, Jordi
Tutor: Bosch Rue, Anna
Keywords: image captioning
deep learning
artificial neural networks
Issue Date: 28-Jun-2019
Publisher: Universitat Oberta de Catalunya (UOC)
Abstract: Automatic image captioning, the task of automatically producing a natural-language description for an image, has the potential to assist those with visual impairments by explaining images using text-to-speech systems. However, accurate image captioning is a challenging task that requires integrating and pushing further the latest improvements at the intersection of computer vision and natural language processing fields This work aims at building an advanced model based on neural networks and deep learning for the automated generation of image captions.
Language: English
URI: http://hdl.handle.net/10609/100446
Appears in Collections:Bachelor thesis, research projects, etc.

Share:
Export:
Files in This Item:
File Description SizeFormat 
magomarTFM0619memory.pdfMemory of TFM7.85 MBAdobe PDFView/Open

This item is licensed under a Creative Commons License Creative Commons