Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/106366
Title: Generació i anàlisi d'un model per relacionar el microbioma humà i dades clíniques amb malalties autoimmunitàries
Author: Canet Carbó, Joan
Tutor: Paytuví Gallart, Andreu
Others: Prados Carrasco, Ferran  
Abstract: Different taxonomic compositions of the gut microbiome have been related to some diseases, such as diabetes or Crohn's disease. In this project, the microbiological composition of fecal samples and clinical variables -such as age or body mass index- associated to a big number of subjects have been described. Different models have been generated using Machine Learning algorithms, such as Random Forest, Support Vector Machine and XGBoost, to predict whether a subject has developed, or not, any autoimmune disease, using its gut taxonomic composition and some clinical variables. The obtained results in the taxonomic description do not show very differentiated enterotypes between the samples. Most of the categorical clinical variables do not follow a balanced distribution of their levels. The analyzed numerical clinical variables¿ distribution does follow approximately a normal distribution. The best classifier model has been obtained using a sampling method called SMOTE to generate the training set and using the XGBoost algorithm, obtaining a Kappa statistic value of 0.6612. This value is considered to have a substantial adequacy to the real data. The Bifidobacterium genus has been the one that has contributed the most to the model performance. In conclusion, the samples could not be classified into very differentiated enterotypes; however, a model to predict whether a subject has developed, or not, an autoimmune disease has been generated, using gut microbiome data and clinical variables, giving a substantial adequacy to the real data.
Keywords: machine learning
autoimmune diseases
human microbiome
Document type: info:eu-repo/semantics/masterThesis
Issue Date: 3-Jan-2020
Publication license: http://creativecommons.org/licenses/by-nc-nd/3.0/es/  
Appears in Collections:Trabajos finales de carrera, trabajos de investigación, etc.

Files in This Item:
File Description SizeFormat 
jcanetcarboTFM0120memòria.pdfMemòria del TFM2,19 MBAdobe PDFThumbnail
View/Open