Reducción de ruido en señales de audio basada en una red neuronal convolucional

Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/97227

Title:	Reducción de ruido en señales de audio basada en una red neuronal convolucional
Author:	López Mora, Adrián
Tutor:	Meler Corretjé, Lourdes
Others:	García-Solórzano, David
Abstract:	This project describes a speech enhancement system implementation based on a Convolutional Neural Network (CNN). A feature transform module computes the STFT and extracts spectral phase and magnitude from the speech signal. The CNN maps the spectrum magnitude of an input noisy speech signal to an output enhanced spectrum. A reconstruction module computes inverse STFT to recover the speech enhanced audio signal. Mozilla Common Voice database, in its Catalan corpus version, is used to perform training and testing. Noisy audio samples are obtained adding AWGN with 0 dB SNR to clean speech signals. PESQ and STOI objective metrics are used to measure system performance. System evaluation shows positive results when using SNR levels as in training, while overall intelligibility deteriorates when using higher SNR levels due to phase distortion.
Keywords:	speech enhancement audio CNN
Document type:	info:eu-repo/semantics/bachelorThesis
Issue Date:	Jun-2019
Publication license:	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Appears in Collections:	Bachelor thesis, research projects, etc.

Files in This Item:

File	Description	Size	Format
alopezmoraPresentaciónTFG0619.mp4	Presentación del TFG	26,12 MB	MP4	View/Open
alopezmoraCódigoTFG0619.zip	Código implementado	268,23 kB	ZIP	View/Open
alopezmoraTFG0619memoria.pdf		1,35 MB	Adobe PDF	View/Open

Show full item record

Share:

Impact:

Microsoft Academic

Export:

View statistics

This item is licensed under a Creative Commons License