========================Generation for [chatgpt, instructgpt] for instance 0 ============================
---------RAW GENERATION--------
Model instructgpt is better.
---------PATTERN MATCHED-------
Model instructgpt is better.
========================Generation for [chatgpt, falcon] for instance 0 ============================
---------RAW GENERATION--------

Model falcon is better.
---------PATTERN MATCHED-------
Model falcon is better.
========================Generation for [redpajama, mpt] for instance 1 ============================
---------RAW GENERATION--------

Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [openassist, redpajama] for instance 1 ============================
---------RAW GENERATION--------

Model redpajama is better.
---------PATTERN MATCHED-------
Model redpajama is better.
========================Generation for [mpt, alpaca] for instance 2 ============================
---------RAW GENERATION--------

Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [vicuna, alpaca] for instance 2 ============================
---------RAW GENERATION--------

Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [dolly, mpt] for instance 3 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [chatgpt, redpajama] for instance 3 ============================
---------RAW GENERATION--------

Model Redpajama is better.
---------PATTERN MATCHED-------
Model Redpajama is better.
========================Generation for [vicuna, baize] for instance 4 ============================
---------RAW GENERATION--------

Model baize is better.
---------PATTERN MATCHED-------
Model baize is better.
========================Generation for [gpt4, alpaca] for instance 4 ============================
---------RAW GENERATION--------
Model alpaca is better.
---------PATTERN MATCHED-------
Model alpaca is better.
========================Generation for [baize, wizardlm] for instance 5 ============================
---------RAW GENERATION--------

Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [chatgpt, openassist] for instance 5 ============================
---------RAW GENERATION--------

Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [wizardlm, alpaca] for instance 6 ============================
---------RAW GENERATION--------

Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [vicuna, openassist] for instance 6 ============================
---------RAW GENERATION--------
Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [openassist, alpaca] for instance 7 ============================
---------RAW GENERATION--------
Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [instructgpt, cohere] for instance 7 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [wizardlm, instructgpt] for instance 8 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [baize, falcon] for instance 8 ============================
---------RAW GENERATION--------

Model falcon is better.
---------PATTERN MATCHED-------
Model falcon is better.
========================Generation for [instructgpt, alpaca] for instance 9 ============================
---------RAW GENERATION--------
Model alpaca is better.
---------PATTERN MATCHED-------
Model alpaca is better.
========================Generation for [baize, mpt] for instance 9 ============================
---------RAW GENERATION--------
Model mpt is better.
---------PATTERN MATCHED-------
Model mpt is better.
========================Generation for [redpajama, llama] for instance 9 ============================
---------RAW GENERATION--------

Model llama is better
---------PATTERN MATCHED-------
Model llama is better
========================Generation for [baize, alpaca] for instance 10 ============================
---------RAW GENERATION--------

Model alpaca is better.
---------PATTERN MATCHED-------
Model alpaca is better.
========================Generation for [wizardlm, redpajama] for instance 10 ============================
---------RAW GENERATION--------

Model wizardlm is better.
---------PATTERN MATCHED-------
Model wizardlm is better.
========================Generation for [vicuna, dolly] for instance 11 ============================
---------RAW GENERATION--------

Model vicuna is better.
---------PATTERN MATCHED-------
Model vicuna is better.
========================Generation for [cohere, alpaca] for instance 11 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [wizardlm, redpajama] for instance 12 ============================
---------RAW GENERATION--------

Model wizardlm is better.
---------PATTERN MATCHED-------
Model wizardlm is better.
========================Generation for [baize, koala] for instance 12 ============================
---------RAW GENERATION--------

Model koala is better.
---------PATTERN MATCHED-------
Model koala is better.
========================Generation for [cohere, alpaca] for instance 13 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [baize, chatgpt] for instance 13 ============================
---------RAW GENERATION--------

Model chatgpt is better.
---------PATTERN MATCHED-------
Model chatgpt is better.
========================Generation for [baize, chatgpt] for instance 14 ============================
---------RAW GENERATION--------

Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [baize, cohere] for instance 14 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [cohere, alpaca] for instance 15 ============================
---------RAW GENERATION--------

Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [wizardlm, dolly] for instance 15 ============================
---------RAW GENERATION--------

Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [koala, falcon] for instance 16 ============================
---------RAW GENERATION--------

Model falcon is better.
---------PATTERN MATCHED-------
Model falcon is better.
========================Generation for [mpt, alpaca] for instance 16 ============================
---------RAW GENERATION--------

Model alpaca is better.
---------PATTERN MATCHED-------
Model alpaca is better.
========================Generation for [alpaca, falcon] for instance 17 ============================
---------RAW GENERATION--------

Model falcon is better.
---------PATTERN MATCHED-------
Model falcon is better.
========================Generation for [wizardlm, openassist] for instance 17 ============================
---------RAW GENERATION--------
Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [koala, wizardlm] for instance 18 ============================
---------RAW GENERATION--------

Model wizardlm is better.
---------PATTERN MATCHED-------
Model wizardlm is better.
========================Generation for [cohere, openassist] for instance 18 ============================
---------RAW GENERATION--------
Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [vicuna, baize] for instance 19 ============================
---------RAW GENERATION--------

Model vicuna is better.
---------PATTERN MATCHED-------
Model vicuna is better.
========================Generation for [chatgpt, openassist] for instance 19 ============================
---------RAW GENERATION--------
Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [gpt4, alpaca] for instance 19 ============================
---------RAW GENERATION--------

Model alpaca is better
---------PATTERN MATCHED-------
Model alpaca is better
========================Generation for [baize, dolly] for instance 20 ============================
---------RAW GENERATION--------

Model dolly is better.
---------PATTERN MATCHED-------
Model dolly is better.
========================Generation for [chatgpt, gpt4] for instance 20 ============================
---------RAW GENERATION--------
Model GPT4 is better.
---------PATTERN MATCHED-------
Model GPT4 is better.
========================Generation for [openassist, llama] for instance 21 ============================
---------RAW GENERATION--------
Model llama is better.
---------PATTERN MATCHED-------
Model llama is better.
========================Generation for [cohere, alpaca] for instance 21 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [gpt4, openassist] for instance 22 ============================
---------RAW GENERATION--------
Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [openassist, alpaca] for instance 22 ============================
---------RAW GENERATION--------

Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [wizardlm, llama] for instance 23 ============================
---------RAW GENERATION--------

Model llama is better.
---------PATTERN MATCHED-------
Model llama is better.
========================Generation for [chatgpt, gpt4] for instance 23 ============================
---------RAW GENERATION--------
Model gpt4 is better.
---------PATTERN MATCHED-------
Model gpt4 is better.
========================Generation for [instructgpt, cohere] for instance 24 ============================
---------RAW GENERATION--------

Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [cohere, falcon] for instance 24 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [wizardlm, openassist] for instance 25 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [koala, gpt4] for instance 25 ============================
---------RAW GENERATION--------

Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [chatgpt, dolly] for instance 26 ============================
---------RAW GENERATION--------

Model dolly is better.
---------PATTERN MATCHED-------
Model dolly is better.
========================Generation for [instructgpt, gpt4] for instance 26 ============================
---------RAW GENERATION--------

Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [chatgpt, mpt] for instance 27 ============================
---------RAW GENERATION--------

Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [baize, koala] for instance 27 ============================
---------RAW GENERATION--------

Model baize is better.
---------PATTERN MATCHED-------
Model baize is better.
========================Generation for [vicuna, gpt4] for instance 28 ============================
---------RAW GENERATION--------
Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [vicuna, cohere] for instance 28 ============================
---------RAW GENERATION--------

Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [instructgpt, dolly] for instance 29 ============================
---------RAW GENERATION--------

Model dolly is better.
---------PATTERN MATCHED-------
Model dolly is better.
========================Generation for [baize, dolly] for instance 29 ============================
---------RAW GENERATION--------

Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [koala, falcon] for instance 29 ============================
---------RAW GENERATION--------

Model falcon is better.
---------PATTERN MATCHED-------
Model falcon is better.
========================Generation for [wizardlm, alpaca] for instance 30 ============================
---------RAW GENERATION--------

Model alpaca is better.
---------PATTERN MATCHED-------
Model alpaca is better.
========================Generation for [cohere, alpaca] for instance 30 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [instructgpt, alpaca] for instance 31 ============================
---------RAW GENERATION--------

Model alpaca is better
---------PATTERN MATCHED-------
Model alpaca is better
========================Generation for [vicuna, mpt] for instance 31 ============================
---------RAW GENERATION--------

Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [vicuna, koala] for instance 32 ============================
---------RAW GENERATION--------
Model vicuna is better.
---------PATTERN MATCHED-------
Model vicuna is better.
========================Generation for [vicuna, gpt4] for instance 32 ============================
---------RAW GENERATION--------
Model gpt4 is better.
---------PATTERN MATCHED-------
Model gpt4 is better.
========================Generation for [gpt4, redpajama] for instance 33 ============================
---------RAW GENERATION--------
Model redpajama is better
---------PATTERN MATCHED-------
Model redpajama is better
========================Generation for [instructgpt, llama] for instance 33 ============================
---------RAW GENERATION--------

Model instructgpt is better.
---------PATTERN MATCHED-------
Model instructgpt is better.
========================Generation for [openassist, llama] for instance 34 ============================
---------RAW GENERATION--------

Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [cohere, llama] for instance 34 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [koala, wizardlm] for instance 35 ============================
---------RAW GENERATION--------

Model wizardlm is better.
---------PATTERN MATCHED-------
Model wizardlm is better.
========================Generation for [vicuna, chatgpt] for instance 35 ============================
---------RAW GENERATION--------

Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [dolly, falcon] for instance 36 ============================
---------RAW GENERATION--------

Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [openassist, redpajama] for instance 36 ============================
---------RAW GENERATION--------

Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [baize, dolly] for instance 37 ============================
---------RAW GENERATION--------

Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [wizardlm, alpaca] for instance 37 ============================
---------RAW GENERATION--------

Model alpaca is better.
---------PATTERN MATCHED-------
Model alpaca is better.
========================Generation for [vicuna, wizardlm] for instance 38 ============================
---------RAW GENERATION--------

Model vicuna is better.
---------PATTERN MATCHED-------
Model vicuna is better.
========================Generation for [cohere, redpajama] for instance 38 ============================
---------RAW GENERATION--------

Model redpajama is better.
---------PATTERN MATCHED-------
Model redpajama is better.
========================Generation for [cohere, falcon] for instance 39 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [mpt, falcon] for instance 39 ============================
---------RAW GENERATION--------

Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [wizardlm, cohere] for instance 39 ============================
---------RAW GENERATION--------
Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [redpajama, mpt] for instance 40 ============================
---------RAW GENERATION--------

Model mpt is better.
---------PATTERN MATCHED-------
Model mpt is better.
========================Generation for [chatgpt, redpajama] for instance 40 ============================
---------RAW GENERATION--------
Model redpajama is better.
---------PATTERN MATCHED-------
Model redpajama is better.
========================Generation for [openassist, mpt] for instance 41 ============================
---------RAW GENERATION--------

Model mpt is better.
---------PATTERN MATCHED-------
Model mpt is better.
========================Generation for [koala, gpt4] for instance 41 ============================
---------RAW GENERATION--------

Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [redpajama, mpt] for instance 42 ============================
---------RAW GENERATION--------

Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [chatgpt, gpt4] for instance 42 ============================
---------RAW GENERATION--------

Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [koala, llama] for instance 43 ============================
---------RAW GENERATION--------

Model llama is better
---------PATTERN MATCHED-------
Model llama is better
========================Generation for [cohere, alpaca] for instance 43 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [instructgpt, redpajama] for instance 44 ============================
---------RAW GENERATION--------

Model redpajama is better.
---------PATTERN MATCHED-------
Model redpajama is better.
========================Generation for [cohere, alpaca] for instance 44 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [chatgpt, mpt] for instance 45 ============================
---------RAW GENERATION--------

Model mpt is better.
---------PATTERN MATCHED-------
Model mpt is better.
========================Generation for [koala, llama] for instance 45 ============================
---------RAW GENERATION--------

Model koala is better.
---------PATTERN MATCHED-------
Model koala is better.
========================Generation for [wizardlm, gpt4] for instance 46 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [vicuna, falcon] for instance 46 ============================
---------RAW GENERATION--------

Model vicuna is better
---------PATTERN MATCHED-------
Model vicuna is better
========================Generation for [dolly, openassist] for instance 47 ============================
---------RAW GENERATION--------
Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [chatgpt, cohere] for instance 47 ============================
---------RAW GENERATION--------

Model cohere is better.
---------PATTERN MATCHED-------
Model cohere is better.
========================Generation for [wizardlm, alpaca] for instance 48 ============================
---------RAW GENERATION--------

Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [baize, openassist] for instance 48 ============================
---------RAW GENERATION--------
Model openassist is better.
---------PATTERN MATCHED-------
Model openassist is better.
========================Generation for [baize, koala] for instance 49 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [cohere, mpt] for instance 49 ============================
---------RAW GENERATION--------

Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [wizardlm, chatgpt] for instance 49 ============================
---------RAW GENERATION--------
Model chatgpt is better.
---------PATTERN MATCHED-------
Model chatgpt is better.
