Please use this identifier to cite or link to this item:
http://hdl.handle.net/10609/97387
Title: | Evaluación y comparación de métodos de ensamblaje y binning a partir de datos metagenómicos reales |
Author: | Vergara Gómez, Andrea |
Tutor: | Guillén Montalbán, Yolanda |
Others: | Canovas Izquierdo, Javier Luis |
Abstract: | Thanks to the next generation sequencing it is possible to analyze the genes of all the microorganisms in a sample (metagenomics), without the need to cultivate them. The analysis of shotgun data represents a great challenge. Grouping sequences from different metagenomic species based on external references means that many sequences will remain unassigned, so it seems more appropriate to use the reference independent methods (binning). The objective of this study was to compare two assemblers and two binning methods with real metagenomic data. The de-novo assembly of trimmed reads was performed with two assemblers: MEGAHIT and MetaSPAdes. The performance of these assemblies was analyzed with QUAST. A catalog of unique genes was generated from the contigs and binning with Canopy and MetaBAT2 was carried out. The performance of the binning was evaluated with CheckM. A cluster of supercomputers was used and, whenever possible, jobs were executed in parallel, in order to optimize time of analysis. Regarding the assembly, better results were obtained using MetaSPAdes than MEGAHIT. Regarding the binning, Canopy generated many more bins than MetaBAT2, but the visualization of the bins showed that the results were suboptimal for both. Working in a cluster of PCs allows you to save analysis time and optimize resources. According to these data, new approaches are necessary to achieve better results: the single-sample strategy based on contigs, using complete contigs instead of genes and testing the result of multiple co-assembly for several samples. |
Keywords: | metagenomics assembly binning |
Document type: | info:eu-repo/semantics/masterThesis |
Issue Date: | 4-Jun-2019 |
Publication license: | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
Appears in Collections: | Trabajos finales de carrera, trabajos de investigación, etc. |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
avergaragoTFM0619memoria.pdf | Memoria del TFM | 2,12 MB | Unknown | View/Open |
Share:
This item is licensed under a Creative Commons License