To run the LongBench evaluation, use the following command (default model is ``meta-llama/Llama-3.1-8B-Instruct'')

python run_longbench.py --kv_type weighedbw2 --datasets qasper --e