Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/10609/148916
Títol: Using deep learning for sound classification in citizen science: a practical approach with soundless
Autoria: Castelló Tejera, David
Tutor: Garcia Lopez, Pedro  
Resum: The field of Deep Learning has experienced tremendous growth in recent years, sparking interest among users and researchers. However, deploying Deep Learning models in real-world projects presents significant technical challenges. This Master's Thesis provides a practical approach to designing and constructing a custom Deep Learning model for audio classification, intended for use within the Soundless project—a citizen science platform investigating noise pollution and its impact on human health. The primary objective is to construct a custom model deployable within the Android application of the Soundless project. Different model architectures are explored considering the complexity constraints of deploying Deep Learning models on the edge. The model is built using the TensorFlow framework. Evaluated against the ESC-50 benchmark, the model demonstrates prediction accuracies of over 86%. The model is then integrated into an Android app prototype for testing. A custom dataset is constructed, termed NBAC, comprising 780 audio samples covering 13 distinct classes. NBAC is designed to be aligned with the acoustic context of the Soundless project. The model's performance on NBAC achieves over 90% accuracy. Further, this work investigates various implementation alternatives for utilizing and enhancing the model in a production environment. A centralized improvement approach is proposed, which entails locally storing labeled feature representations of audio samples and training a classifier. Alternatively, a decentralized improvement approach is formulated using Federated Learning. Both strategies, leveraging the custom-designed models, yield promising outcomes. They not only preserve the anticipated accuracies but also facilitate the desired enhancements.
Paraules clau: deep learning
federated learning
TensorFlow
ESC-50
deployment on Android
citizen science
audio classification
Tipus de document: info:eu-repo/semantics/masterThesis
Data de publicació: 1-set-2023
Llicència de publicació: http://creativecommons.org/licenses/by-nc-nd/3.0/es/  
Apareix a les col·leccions:Bachelor thesis, research projects, etc.

Arxius per aquest ítem:
Arxiu Descripció MidaFormat 
dcastellotFMDPreport.pdfFMDP report3,19 MBAdobe PDFThumbnail
Veure/Obrir
Comparteix:
Exporta:
Consulta les estadístiques

Aquest ítem està subjecte a una llicència de Creative CommonsLlicència Creative Commons Creative Commons