TY - GEN
T1 - Comparative Analysis of de Bruijn Graph Parallel Genome Assemblers
AU - Gamboa-Venegas, Carlos
AU - Meneses, Esteban
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/9/12
Y1 - 2018/9/12
N2 - Finding the genome of new species remains as one of the most crucial tasks in molecular biology. To achieve that end, de novo sequence assembly feeds from the vast amount of data provided by Next-Generation Sequencing technology. Therefore, genome assemblers demand a high amount of computational resources, and parallel implementations of those assemblers are readily available. This paper presents a comparison of three well-known de novo genome assemblers: Velvet, ABySS and SOAPdenovo, all of them using de Bruijn graphs and having a parallel implementation. We based our analysis on parallel execution time, scalability, quality of assembly, and sensitivity to the choice of a critical parameter (k- mer size). We found one of the tools clearly stands out for providing faster execution time and better quality in the output. Also, all assemblers are mildly sensitive to the choice of k-mer size and they all show limited scalability. We expect the findings of this paper provide a guide to the development of new algorithms and tools for scalable parallel genome sequence assemblers.
AB - Finding the genome of new species remains as one of the most crucial tasks in molecular biology. To achieve that end, de novo sequence assembly feeds from the vast amount of data provided by Next-Generation Sequencing technology. Therefore, genome assemblers demand a high amount of computational resources, and parallel implementations of those assemblers are readily available. This paper presents a comparison of three well-known de novo genome assemblers: Velvet, ABySS and SOAPdenovo, all of them using de Bruijn graphs and having a parallel implementation. We based our analysis on parallel execution time, scalability, quality of assembly, and sensitivity to the choice of a critical parameter (k- mer size). We found one of the tools clearly stands out for providing faster execution time and better quality in the output. Also, all assemblers are mildly sensitive to the choice of k-mer size and they all show limited scalability. We expect the findings of this paper provide a guide to the development of new algorithms and tools for scalable parallel genome sequence assemblers.
KW - De Bruijn graph
KW - De novo
KW - Genome assembly
KW - Parallel performance
KW - Scalability
UR - http://www.scopus.com/inward/record.url?scp=85054500574&partnerID=8YFLogxK
U2 - 10.1109/IWOBI.2018.8464194
DO - 10.1109/IWOBI.2018.8464194
M3 - Contribución a la conferencia
AN - SCOPUS:85054500574
SN - 9781538675069
T3 - 2018 IEEE International Work Conference on Bioinspired Intelligence, IWOBI 2018 - Proceedings
BT - 2018 IEEE International Work Conference on Bioinspired Intelligence, IWOBI 2018 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 IEEE International Work Conference on Bioinspired Intelligence, IWOBI 2018
Y2 - 18 July 2018 through 20 July 2018
ER -