Abstract: Highlights•A micro-benchmark suite is developed to illuminate uncharted areas of the SW26010.•The micro-architecture of the SW26010 is revealed by the benchmark results.•The key programming challenge of the SW26010 is identified with the roofline model.•A systematic guideline for performance optimizations on the SW26010 is proposed.