Overlap Graph Reduction for Genome Assembly using Apache Spark

Published: 01 Jan 2017, Last Modified: 22 Jul 2024BCB 2017EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: The advent of third-generation long-range DNA sequencing and mapping techniques has permitted nearly perfect or very high quality de novo assemblies of genomes. However, most overlap graph de novo assemblers still require large amounts of computer memory to resolve the large genome graphs. Here, we apply string graph reduction algorithms for genome assembly using Apache Spark on a distributed cloud computing platform.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview