A New Alchemy: Language Model Development as a Subfield?

Published: 16 Feb 2024, Last Modified: 28 Mar 2024BT@ICLR2024EveryoneRevisionsBibTeXCC BY 4.0
Keywords: Large language models
Blogpost Url: https://iclr-blogposts.github.io/2024/blog/language-model-development-as-a-new-subfield/
Abstract: This blog post makes the case that the body of research on language models become sufficiently large and mature that we can start thinking about “language model development” as a new subfield. To support this claim, we sketch out the focuses and methodologies of this new subfield. In addition, we provide some personal reflections on what to do when your field of study gives birth to a new one.
Ref Papers: https://arxiv.org/abs/2005.14165, https://arxiv.org/abs/2201.11903, https://arxiv.org/abs/2205.14135, https://arxiv.org/abs/2211.17192, https://arxiv.org/abs/2306.14048, https://arxiv.org/abs/2208.07339, https://arxiv.org/abs/2102.11972, https://arxiv.org/abs/2205.05638
Id Of The Authors Of The Papers: ~Colin_Raffel1
Conflict Of Interest: I have an institutional conflict with anyone from Google, UNC Chapel Hill, University of Toronto, the Vector Institute, or Hugging Face.
Submission Number: 26
Loading