# Note

This code is still under developing and we are working on the paper. **DON'T SHARE ANY DETAIL OF this piece of code to others without an agreement of all the authors.**

# Introduction to DeBERTa 
Disentangle Attention BERT


# Requirements

The code has been tested on Nvidia DGX-2 node with Ubuntu 18.04LTS
- CUDA 10.0
- pytorch 1.3.0
- python 3.6
- bash shell 4.0


# Try the code

- Install python dependency `pip install -r requirement`

- Run in docker, run `./run_docker.sh` 

Run `applicaitons/glue/mnli_base.sh` to test mnli base model

