Please use this identifier to cite or link to this item:
http://hdl.handle.net/10609/150950
Title: | Tècniques de Deep Learning per reconeixement i classificació d'àudio |
Author: | Garriga Muñoz, Jordi |
Tutor: | Sanchez, Friman Moya-Alcover, Gabriel |
Others: | Acedo Nadal, Susana |
Abstract: | The world of Artificial Intelligence is advancing faster and every day we see innovations in all its facets, especially in the processing and generation of images and video. Current deep learning techniques such as Convolutional Neural Networks or transformer models have enabled great advancements never thought of before. This work aims to introduce the reader to the use of these techniques adapted to sound and audio processing. To do this, a specific field such as sound recognition and classification has been chosen. The work is divided in two parts: on one hand, the basic theoretical concepts of this discipline are introduced: sound characteristics and analog-to-digital conversion methods, processing, and subsequent treatment. It also explains the applications currently available on the market and the different studies and research carried out by various researchers in this field. On the other hand, it aims to show a specific practical case of the use of deep learning for sound classification. Through the creation of a properly trained model from a dataset with a large number of sound references, it should be able to identify and classify sound fragments of the same type with the highest level of accuracy possible. To do this, a combination of various techniques will be used, encompassed within a theoretical concept known as CLAP (Contrastive Language Audio Processing), which uses CNNs to process sound fragments from the training set along with text labels describing the sound contained in the fragment. |
Keywords: | deep learning classificació d'àudio CNN |
Document type: | info:eu-repo/semantics/bachelorThesis |
Issue Date: | 28-Jun-2024 |
Publication license: | http://creativecommons.org/licenses/by/3.0/es/ |
Appears in Collections: | Bachelor thesis, research projects, etc. |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
jgarrigamoonTFG0624memoria.pdf | Memòria del TFG | 2,17 MB | Adobe PDF | View/Open |
jgarrigamoonTFG0624presentacio.mp4 | Vídeo de la presentació del TFG | 541,67 MB | MP4 | View/Open |
Share:
This item is licensed under aCreative Commons License