Zelda Rose: a tool for hassle-free training of transformer models

Published: 09 Oct 2023, Last Modified: 27 Oct 2023NLP-OSS 2023EveryoneRevisionsBibTeX
Keywords: transformers, pretraining
TL;DR: Zelda Rose is a command line interface for pretraining transformer-based models.
Abstract: Zelda Rose is a command line interface for pretraining transformer-based models. Its purpose is to enable an easy start for users interested in training these ubiquitous models, but unable or unwilling to engage with more comprehensive --- but more complex --- frameworks and the complex interactions between libraries for managing models, datasets and computations. Training a model requires no code on the user's part and produce models directly compatible with the HuggingFace ecosystem, allowing quick and easy distribution and reuse. A particular care is given to lowering the cost of maintainability and future-proofing, by making the code as modular as possible and taking advantage of third-party libraries to limit ad-hoc code to the strict minimum.
Submission Number: 10
Loading