========================Generation for [dolly, mpt] for instance 0 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [falcon, llama] for instance 0 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [koala, gpt4] for instance 1 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [baize, cohere] for instance 1 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [chatgpt, cohere] for instance 2 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [alpaca, llama] for instance 2 ============================
---------RAW GENERATION--------
Model_alpaca is better
---------PATTERN MATCHED-------
Model_alpaca is better
========================Generation for [mpt, falcon] for instance 3 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [koala, falcon] for instance 3 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [openassist, redpajama] for instance 4 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [wizardlm, gpt4] for instance 4 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [mpt, falcon] for instance 5 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [vicuna, gpt4] for instance 5 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [redpajama, mpt] for instance 6 ============================
---------RAW GENERATION--------
Model redpajama is better
---------PATTERN MATCHED-------
Model redpajama is better
========================Generation for [wizardlm, alpaca] for instance 6 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [wizardlm, dolly] for instance 7 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [chatgpt, alpaca] for instance 7 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [openassist, mpt] for instance 8 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [baize, gpt4] for instance 8 ============================
---------RAW GENERATION--------
Model baize
---------PATTERN MATCHED-------
Model baize
========================Generation for [baize, falcon] for instance 9 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [wizardlm, cohere] for instance 9 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [vicuna, llama] for instance 9 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [openassist, mpt] for instance 10 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [alpaca, llama] for instance 10 ============================
---------RAW GENERATION--------
Model llama is better
---------PATTERN MATCHED-------
Model llama is better
========================Generation for [wizardlm, llama] for instance 11 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [vicuna, openassist] for instance 11 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [mpt, falcon] for instance 12 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [instructgpt, cohere] for instance 12 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [mpt, falcon] for instance 13 ============================
---------RAW GENERATION--------
Model falcon is better.
---------PATTERN MATCHED-------
Model falcon is better.
========================Generation for [cohere, mpt] for instance 13 ============================
---------RAW GENERATION--------
Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [cohere, alpaca] for instance 14 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [falcon, llama] for instance 14 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [wizardlm, openassist] for instance 15 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [vicuna, cohere] for instance 15 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [instructgpt, llama] for instance 16 ============================
---------RAW GENERATION--------
Model instructgpt is better.
---------PATTERN MATCHED-------
Model instructgpt is better.
========================Generation for [baize, mpt] for instance 16 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [gpt4, falcon] for instance 17 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [vicuna, mpt] for instance 17 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [vicuna, cohere] for instance 18 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [wizardlm, cohere] for instance 18 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [cohere, redpajama] for instance 19 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [koala, falcon] for instance 19 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [wizardlm, instructgpt] for instance 19 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [baize, llama] for instance 20 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [vicuna, mpt] for instance 20 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [baize, redpajama] for instance 21 ============================
---------RAW GENERATION--------
Model baize is better.
---------PATTERN MATCHED-------
Model baize is better.
========================Generation for [vicuna, gpt4] for instance 21 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [chatgpt, mpt] for instance 22 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [baize, cohere] for instance 22 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [vicuna, baize] for instance 23 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [instructgpt, openassist] for instance 23 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [openassist, falcon] for instance 24 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [koala, redpajama] for instance 24 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [openassist, redpajama] for instance 25 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [koala, alpaca] for instance 25 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [mpt, llama] for instance 26 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [openassist, mpt] for instance 26 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [chatgpt, alpaca] for instance 27 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [wizardlm, instructgpt] for instance 27 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [mpt, llama] for instance 28 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [openassist, redpajama] for instance 28 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [dolly, openassist] for instance 29 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [chatgpt, falcon] for instance 29 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [baize, instructgpt] for instance 29 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [openassist, redpajama] for instance 30 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [openassist, alpaca] for instance 30 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [koala, instructgpt] for instance 31 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [baize, dolly] for instance 31 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [cohere, redpajama] for instance 32 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [cohere, mpt] for instance 32 ============================
---------RAW GENERATION--------
Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [dolly, falcon] for instance 33 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [chatgpt, alpaca] for instance 33 ============================
---------RAW GENERATION--------
Model chatgpt is better.
---------PATTERN MATCHED-------
Model chatgpt is better.
========================Generation for [openassist, mpt] for instance 34 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [vicuna, mpt] for instance 34 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [koala, dolly] for instance 35 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [chatgpt, falcon] for instance 35 ============================
---------RAW GENERATION--------
Model falcon is better.
---------PATTERN MATCHED-------
Model falcon is better.
========================Generation for [vicuna, wizardlm] for instance 36 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [baize, redpajama] for instance 36 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [wizardlm, llama] for instance 37 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [chatgpt, alpaca] for instance 37 ============================
---------RAW GENERATION--------
Model chatgpt is better.
---------PATTERN MATCHED-------
Model chatgpt is better.
========================Generation for [gpt4, dolly] for instance 38 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [vicuna, instructgpt] for instance 38 ============================
---------RAW GENERATION--------
Model vicuna is better.
---------PATTERN MATCHED-------
Model vicuna is better.
========================Generation for [chatgpt, redpajama] for instance 39 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [baize, openassist] for instance 39 ============================
---------RAW GENERATION--------
Model baize is better.
---------PATTERN MATCHED-------
Model baize is better.
========================Generation for [gpt4, cohere] for instance 39 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [baize, cohere] for instance 40 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [vicuna, cohere] for instance 40 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [chatgpt, dolly] for instance 41 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [vicuna, dolly] for instance 41 ============================
---------RAW GENERATION--------
Model vicuna
---------PATTERN MATCHED-------
Model vicuna
========================Generation for [baize, alpaca] for instance 42 ============================
---------RAW GENERATION--------
Model baize
---------PATTERN MATCHED-------
Model baize
========================Generation for [gpt4, mpt] for instance 42 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [gpt4, dolly] for instance 43 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [instructgpt, redpajama] for instance 43 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [gpt4, llama] for instance 44 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [redpajama, falcon] for instance 44 ============================
---------RAW GENERATION--------
Model falcon is better.
---------PATTERN MATCHED-------
Model falcon is better.
========================Generation for [baize, redpajama] for instance 45 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [koala, dolly] for instance 45 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [gpt4, cohere] for instance 46 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [vicuna, llama] for instance 46 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [baize, mpt] for instance 47 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [cohere, mpt] for instance 47 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [chatgpt, openassist] for instance 48 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [koala, gpt4] for instance 48 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [koala, alpaca] for instance 49 ============================
---------RAW GENERATION--------
Model alpaca is better
---------PATTERN MATCHED-------
Model alpaca is better
========================Generation for [vicuna, falcon] for instance 49 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [chatgpt, gpt4] for instance 49 ============================
---------RAW GENERATION--------
Model chatgpt is better.
---------PATTERN MATCHED-------
Model chatgpt is better.
