# README

This repository provides the core modeling implementations for the **GPT-2**, **Pythia**, and **LLaMA** architectures used in our experiments.  
The implementations are compatible with the Hugging Face **Transformers** library, making them easy to integrate into existing workflows.

## Pretraining
We rely on Hugging Face Transformers together with **DeepSpeed** to conduct efficient large-scale pretraining.

## Evaluation
For downstream evaluation, we make use of the **lm-evaluation-harness** framework to ensure standardized benchmarking.

## Release Plan
All source code and pretrained checkpoints will be made available in the **camera-ready version** of our paper.
