# preprocessing data
# filtering the data based on best-of-n correctness with the base model, and the generalizability on SQL complexity

# train_original.jsonl: all training data in distribution in all dimensions, answers sampled from the original model
# dev_original.jsonl: all dev data in distribution in all dimensions, answers sampled from the original model