# From Acceleration to Saturation: Scaling Behavior of Bootstrapped Language Model Pretraining

- This repository contains source codes to reproduce the results in **From Acceleration to Saturation: Scaling Behavior of Bootstrapped Language Model Pretraining**.

## Structure
  - `data`: contains the data obtained from our scaling law experiments. 
  - `analysis`:
    - `analysis/joint_scaling.ipynb`: contains example fitting the joint scaling law.
    - `analysis/fit_functions.ipynb`: contains example fitting various functional forms
  - `experiments`: contains example slurm scripts for running our scaling law experiments.

## Requirements
  - Megatron-LM: core_v0.8.0

## License
This implementation is licensed under the Apache License 2.0.