# running testing with different long-context models and multi-document models
# taking the full texts of articles as input, outputting only the final answer