BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance

Abstract: R. Thomas McCoy, Junghyun Min, Tal Linzen. Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP. 2020.
0 Replies
Loading