========================Generation for [chatgpt, instructgpt] for instance 0 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [koala, mpt] for instance 0 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [instructgpt, redpajama] for instance 1 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [koala, instructgpt] for instance 1 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [dolly, llama] for instance 2 ============================
---------RAW GENERATION--------
Model llama is better
---------PATTERN MATCHED-------
Model llama is better
========================Generation for [gpt4, dolly] for instance 2 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [vicuna, alpaca] for instance 3 ============================
---------RAW GENERATION--------
Model alpaca is better
---------PATTERN MATCHED-------
Model alpaca is better
========================Generation for [vicuna, baize] for instance 3 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [dolly, falcon] for instance 4 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [wizardlm, openassist] for instance 4 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [wizardlm, cohere] for instance 5 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [instructgpt, falcon] for instance 5 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [vicuna, koala] for instance 6 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [vicuna, falcon] for instance 6 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [baize, dolly] for instance 7 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [koala, dolly] for instance 7 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [chatgpt, mpt] for instance 8 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [instructgpt, cohere] for instance 8 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [gpt4, openassist] for instance 9 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [chatgpt, openassist] for instance 9 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [mpt, llama] for instance 9 ============================
---------RAW GENERATION--------
Model llama is better
---------PATTERN MATCHED-------
Model llama is better
========================Generation for [cohere, mpt] for instance 10 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [koala, gpt4] for instance 10 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [chatgpt, openassist] for instance 11 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [koala, chatgpt] for instance 11 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [instructgpt, falcon] for instance 12 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [wizardlm, gpt4] for instance 12 ============================
---------RAW GENERATION--------
Model gpt4 (You) is better
---------PATTERN MATCHED-------
Model gpt4  is better
========================Generation for [chatgpt, instructgpt] for instance 13 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [instructgpt, falcon] for instance 13 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [chatgpt, falcon] for instance 14 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [baize, redpajama] for instance 14 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [chatgpt, mpt] for instance 15 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [gpt4, alpaca] for instance 15 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [dolly, llama] for instance 16 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [wizardlm, falcon] for instance 16 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [wizardlm, alpaca] for instance 17 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [instructgpt, dolly] for instance 17 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [instructgpt, redpajama] for instance 18 ============================
---------RAW GENERATION--------
Model redpajama is better
---------PATTERN MATCHED-------
Model redpajama is better
========================Generation for [cohere, openassist] for instance 18 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [koala, openassist] for instance 19 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [vicuna, koala] for instance 19 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [chatgpt, mpt] for instance 19 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [wizardlm, falcon] for instance 20 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [wizardlm, gpt4] for instance 20 ============================
---------RAW GENERATION--------
Model gpt4 (You) is better
---------PATTERN MATCHED-------
Model gpt4  is better
========================Generation for [chatgpt, cohere] for instance 21 ============================
---------RAW GENERATION--------
Neither
---------PATTERN MATCHED-------
Neither
========================Generation for [wizardlm, falcon] for instance 21 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [cohere, llama] for instance 22 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [vicuna, falcon] for instance 22 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [cohere, mpt] for instance 23 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [dolly, mpt] for instance 23 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [chatgpt, falcon] for instance 24 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [chatgpt, openassist] for instance 24 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [chatgpt, mpt] for instance 25 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [wizardlm, llama] for instance 25 ============================
---------RAW GENERATION--------
Model llama is better
---------PATTERN MATCHED-------
Model llama is better
========================Generation for [koala, falcon] for instance 26 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [redpajama, mpt] for instance 26 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [koala, chatgpt] for instance 27 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [instructgpt, mpt] for instance 27 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [gpt4, mpt] for instance 28 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [openassist, redpajama] for instance 28 ============================
---------RAW GENERATION--------
Model redpajama is better
---------PATTERN MATCHED-------
Model redpajama is better
========================Generation for [alpaca, falcon] for instance 29 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [openassist, llama] for instance 29 ============================
---------RAW GENERATION--------
Model llama is better
---------PATTERN MATCHED-------
Model llama is better
========================Generation for [dolly, redpajama] for instance 29 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [dolly, redpajama] for instance 30 ============================
---------RAW GENERATION--------
Model redpajama is better
---------PATTERN MATCHED-------
Model redpajama is better
========================Generation for [baize, llama] for instance 30 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [chatgpt, instructgpt] for instance 31 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [chatgpt, dolly] for instance 31 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [vicuna, instructgpt] for instance 32 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [chatgpt, dolly] for instance 32 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [gpt4, llama] for instance 33 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [openassist, falcon] for instance 33 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [gpt4, openassist] for instance 34 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [cohere, mpt] for instance 34 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [gpt4, cohere] for instance 35 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [koala, alpaca] for instance 35 ============================
---------RAW GENERATION--------
Model alpaca is better
---------PATTERN MATCHED-------
Model alpaca is better
========================Generation for [koala, redpajama] for instance 36 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [instructgpt, dolly] for instance 36 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [mpt, llama] for instance 37 ============================
---------RAW GENERATION--------
Model llama is better
---------PATTERN MATCHED-------
Model llama is better
========================Generation for [chatgpt, instructgpt] for instance 37 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [redpajama, mpt] for instance 38 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [chatgpt, alpaca] for instance 38 ============================
---------RAW GENERATION--------
Model alpaca is better
---------PATTERN MATCHED-------
Model alpaca is better
========================Generation for [chatgpt, dolly] for instance 39 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [baize, chatgpt] for instance 39 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [koala, mpt] for instance 39 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [baize, falcon] for instance 40 ============================
---------RAW GENERATION--------
Model falcon is better
---------PATTERN MATCHED-------
Model falcon is better
========================Generation for [koala, alpaca] for instance 40 ============================
---------RAW GENERATION--------
Model koala is better
---------PATTERN MATCHED-------
Model koala is better
========================Generation for [openassist, alpaca] for instance 41 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [koala, dolly] for instance 41 ============================
---------RAW GENERATION--------
Model dolly is better
---------PATTERN MATCHED-------
Model dolly is better
========================Generation for [wizardlm, gpt4] for instance 42 ============================
---------RAW GENERATION--------
Model wizardlm is better
---------PATTERN MATCHED-------
Model wizardlm is better
========================Generation for [koala, redpajama] for instance 42 ============================
---------RAW GENERATION--------
Model redpajama is better
---------PATTERN MATCHED-------
Model redpajama is better
========================Generation for [chatgpt, cohere] for instance 43 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [koala, gpt4] for instance 43 ============================
---------RAW GENERATION--------
Model gpt4 (You) is better
---------PATTERN MATCHED-------
Model gpt4  is better
========================Generation for [alpaca, llama] for instance 44 ============================
---------RAW GENERATION--------
Model alpaca is better
---------PATTERN MATCHED-------
Model alpaca is better
========================Generation for [koala, mpt] for instance 44 ============================
---------RAW GENERATION--------
Model mpt is better
---------PATTERN MATCHED-------
Model mpt is better
========================Generation for [vicuna, dolly] for instance 45 ============================
---------RAW GENERATION--------
Both are equal
---------PATTERN MATCHED-------
Both are equal
========================Generation for [chatgpt, llama] for instance 45 ============================
---------RAW GENERATION--------
Model chatgpt is better
---------PATTERN MATCHED-------
Model chatgpt is better
========================Generation for [koala, llama] for instance 46 ============================
---------RAW GENERATION--------
Both are the same
---------PATTERN MATCHED-------
Both are the same
========================Generation for [koala, instructgpt] for instance 46 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [chatgpt, instructgpt] for instance 47 ============================
---------RAW GENERATION--------
Model instructgpt is better
---------PATTERN MATCHED-------
Model instructgpt is better
========================Generation for [alpaca, llama] for instance 47 ============================
---------RAW GENERATION--------
Model alpaca is better
---------PATTERN MATCHED-------
Model alpaca is better
========================Generation for [cohere, mpt] for instance 48 ============================
---------RAW GENERATION--------
Model cohere is better
---------PATTERN MATCHED-------
Model cohere is better
========================Generation for [gpt4, alpaca] for instance 48 ============================
---------RAW GENERATION--------
Model gpt4 is better
---------PATTERN MATCHED-------
Model gpt4 is better
========================Generation for [baize, falcon] for instance 49 ============================
---------RAW GENERATION--------
Model baize is better
---------PATTERN MATCHED-------
Model baize is better
========================Generation for [vicuna, openassist] for instance 49 ============================
---------RAW GENERATION--------
Model openassist is better
---------PATTERN MATCHED-------
Model openassist is better
========================Generation for [redpajama, falcon] for instance 49 ============================
---------RAW GENERATION--------
Model redpajama is better
---------PATTERN MATCHED-------
Model redpajama is better
