To use this code,
1) Put this code under fairseq/modules
2) MHBiS4Layer is the main module described in this paper; MHBiS4EncoderLayer is a module to use in fairseq's Roberta arch.