Exploring the impact of dependency length on learnability
Keywords: language acquisition, length generalization, data complexity, LSTM
TL;DR: LSTMs fail to show computational support for the "less is more" hypothesis, likely due to fundamental architectural limitations that challenge the analogy between network training and human language acquisition.
Submission Number: 32
Loading