---
name: Bug report
about: Create a report to help us improve
title: ''
labels: ''
assignees: ''

---

**Describe the bug**
A clear and concise description of what the bug is.

**To Reproduce**
Steps to reproduce the behavior:
1. Compile with "..."
2. Run "..." with "..." processes on "..." nodes

**Expected behavior**
A clear and concise description of what you expected to happen.

**Logs**
If applicable, add logs to help explain your problem.

**Platform**
 - Device: [e.g. NVIDIA V100]
 - OS: [e.g. Debian 10.2 buster]
 - CUDA version: [e.g. 11.1]
 - NCCL version: [e.g. 2.7.8-1]
 - PyTorch version: [e.g. 1.8.0]

**Additional context**
Add any other context about the problem here.
