Implementation of a spoken language system

Perez Guijarro, Jessica

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10609/81437

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Perez Guijarro, Jessica	-
dc.contributor.other	Universitat Oberta de Catalunya	-
dc.contributor.other	Ventura, Carles	-
dc.date.accessioned	2018-06-27T19:52:25Z	-
dc.date.available	2018-06-27T19:52:25Z	-
dc.date.issued	2018-06-05	-
dc.identifier.uri	http://hdl.handle.net/10609/81437	-
dc.description.abstract	This project consists of a implementation of a Spoken Language System that is part of the dialogue systems like Siri. The system is constituted by two independent blocks: Automatic Speech Recognition, in charge of identifying who is verbalizing the user and transforming a text, and the Spoken Language System, in charge of reading the text, identifying the significant parts of said text. Each of the compo- nents has been trained with different techniques and datasets since it does not share a common goal. In particular, for the development of the ASR module we have worked with a previously selected subset of the VoxForge English dataset, whose data has been trained using Hidden Markov Models for gener- ate the Acoustic Model. On the other hand, for the development of the SLU module we have worked with Recurrent Neural Networks and a variant of the ATIS dataset previously trained with the Word Embedding method. Although the precision obtained in all the components is more than acceptable, the performance of the integration of both components results unstable.	en
dc.description.abstract	Este proyecto consiste en la implementación de un sistema de lenguaje que forma parte de los sistemas de diálogo como Siri. El sistema está constituido por dos bloques independientes: Automatic Speech Recognition, encargado de identificar aquello que está verbalizando el usuario y transformalo a texto, y el Spoken Language System, encargado de dotar comprensión al texto, identificando las partes significativas de dicho texto. Cada uno de los componentes se ha entrenado con técnicas y datasets distintos ya que, no comparten un objetivo común. En concreto, para el desarrollo del módulo ASR se ha trabajado con un subset previamente seleccionado del dataset VoxForge English, cuyos datos han sido entrenados mediante Hidden Markov Models. Por otro lado, para el desarrollo del módulo SLU se ha trabajado con Redes Neuronales Recurrentes y un variante del dataset ATIS previamente entrenado con el método Word Embedding. Pese a que la precisión obtenida en los cada uno de los es más que aceptable, el funcionamiento de la integración de ambos resulta inestable.	es
dc.description.abstract	Aquest projecte consisteix en la implementació d'un sistema de llenguatge que forma part dels sistemes de diàleg com Siri. El sistema està constituït per dos blocs independents: Automatic Speech Recognition, encarregat d'identificar allò que està verbalizando l'usuari i transformalo a text, i el Spoken Language System, encarregat de dotar comprensió al text, identificant les parts significatives d'aquest text. Cadascun dels components s'ha entrenat amb tècniques i datasets diferents ja que, no comparteixen un objectiu comú. En concret, per al desenvolupament del mòdul ASR s'ha treballat amb un subset prèviament seleccionat del dataset VoxForge English, les dades del qual han estat entrenats mitjançant Hidden Markov Models. D'altra banda, per al desenvolupament del mòdul SLU s'ha treballat amb Xarxes Neuronals Recurrents i un variant del dataset ATIS prèviament entrenat amb el mètode Word Embedding. Malgrat que la precisió obtinguda en els cadascun dels és més que acceptable, el funcionament de la integració de tots dos resulta inestable.	ca
dc.language.iso	eng	-
dc.publisher	Universitat Oberta de Catalunya	-
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/	-
dc.subject	automatic speech recognition	en
dc.subject	spoken language understanding	en
dc.subject	recurrent neural network	en
dc.subject	redes neuronales recurrentes	es
dc.subject	xarxes neuronals recurrents	ca
dc.subject	reconocimiento automático del habla	es
dc.subject	reconeixement automàtic de la parla	ca
dc.subject	comprensió de la llengua oral	ca
dc.subject.lcsh	Artificial intelligence -- TFG	en
dc.title	Implementation of a spoken language system	-
dc.type	info:eu-repo/semantics/bachelorThesis	-
dc.audience.educationlevel	Estudis de Grau	ca
dc.audience.educationlevel	Estudios de Grado	es
dc.audience.educationlevel	University degrees	en
dc.subject.lemac	Intel·ligència artificial -- TFG	ca
dc.subject.lcshes	Inteligencia artificial -- TFG	es
dc.contributor.tutor	Isern, David	-
Aparece en las colecciones:	Bachelor thesis, research projects, etc.

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
TFG Presentacio¿n.mov		576,73 MB	Video Quicktime	Visualizar/Abrir
jperezguijaTFG0618memoria.pdf	Memòria del TFG	913,56 kB	Adobe PDF	Visualizar/Abrir
jperezguijaTFG0618presentación.pdf	Presentación del TFG	462,72 kB	Adobe PDF	Visualizar/Abrir