Abstract: Highlights•Hash index is memory efficient.•We designed and implemented asynchronous I/O and computation.•Dynamic task scheduling and asynchronous data transfer achieve a better offload balance than static scheduling.•A vectorized version of the banded Myers algorithm was implemented on the SW26010 processor.•A distributed version of FMapper is developed for heterogeneous compute nodes.
Loading