# Qualitative Side-by-Side Review Inputs

Run directory: `full_gpt2_small_20260430_174439`
Validation examples reviewed: 20

This file contains raw model generations for manual review. Scores are intentionally left for the reviewer rather than inferred automatically.

## Sample 1: `check_all_pairs_equal`

Ground truth: time=O(n^2), space=O(1), correct=1

Reference explanation: Checks if all elements are equal via pairwise comparison. O(n^2) time, O(1) space.

Mismatched training explanation: Searches a list sequentially for a target value. O(n) time, O(1) space.

```python
def check_all_pairs_equal(lst):
    for i in range(len(lst)):
        for j in range(i + 1, len(lst)):
            if lst[i] != lst[j]:
                return False
    return True

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 2: `generate_all_pairs`

Ground truth: time=O(n^2), space=O(n^2), correct=1

Reference explanation: Generates all ordered pairs. O(n^2) time and O(n^2) space.

Mismatched training explanation: Builds a frequency map in O(n) time and O(n) space.

```python
def generate_all_pairs(lst):
    pairs = []
    for i in range(len(lst)):
        for j in range(len(lst)):
            pairs.append((lst[i], lst[j]))
    return pairs

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Re turnsuniqueelementspre se rv in gord er. O(n) time and O(n) space. | Re turnsuniqueelementspre se rv in gord er. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(n^2) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(n^2) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Re turnsuniqueelementspre se rv in gord er. O(n) time and O(n) space. | Re turnsuniqueelementspre se rv in gord er. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(n^2) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computestheabsolutevalue of anumb er in constant time and space. | Computestheabsolutevalue of anumb er in constant time and space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(n^2) </claim> <claim> correctness=1 </claim> |

## Sample 3: `selection_sort`

Ground truth: time=O(n^2), space=O(1), correct=1

Reference explanation: Selection sort: selects the minimum in O(n^2) nested passes, O(1) space.

Mismatched training explanation: Detects duplicates with nested loops. O(n^2) time, O(1) space.

```python
def selection_sort(lst):
    for i in range(len(lst)):
        min_idx = i
        for j in range(i + 1, len(lst)):
            if lst[j] < lst[min_idx]:
                min_idx = j
        lst[i], lst[min_idx] = lst[min_idx], lst[i]
    return lst

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 4: `selection_sort`

Ground truth: time=O(n^2), space=O(1), correct=1

Reference explanation: Selection sort: selects the minimum in O(n^2) nested passes, O(1) space.

Mismatched training explanation: Detects duplicates with nested loops. O(n^2) time, O(1) space.

```python
def selection_sort(lst):
    for i in range(len(lst)):
        min_idx = i
        for j in range(i + 1, len(lst)):
            if lst[j] < lst[min_idx]:
                min_idx = j
        lst[i], lst[min_idx] = lst[min_idx], lst[i]
    return lst

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 5: `matrix_multiply_buggy`

Ground truth: time=O(n^2), space=O(1), correct=0

Reference explanation: Attempts 2x2 matrix multiplication but is buggy (missing accumulation). O(n^2) time, O(1) auxiliary space.

Mismatched training explanation: Builds a frequency map in O(n) time and O(n) space.

```python
def matrix_multiply_2x2(A, B):
    C = [[0, 0], [0, 0]]
    for i in range(2):
        for j in range(2):
            C[i][j] = A[i][0] * B[0][j]  # bug: missing second term
    return C

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Checks if astr in gconta in sa ch aract er vial in earscan. O(n) time, O(1) space. | Checks if astr in gconta in sa ch aract er vial in earscan. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `no_consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |

## Sample 6: `matrix_multiply_buggy`

Ground truth: time=O(n^2), space=O(1), correct=0

Reference explanation: Attempts 2x2 matrix multiplication but is buggy (missing accumulation). O(n^2) time, O(1) auxiliary space.

Mismatched training explanation: Attempts to swap two values but contains a logic bug, returning incorrect results.

```python
def matrix_multiply_2x2(A, B):
    C = [[0, 0], [0, 0]]
    for i in range(2):
        for j in range(2):
            C[i][j] = A[i][0] * B[0][j]  # bug: missing second term
    return C

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Checks if astr in gconta in sa ch aract er vial in earscan. O(n) time, O(1) space. | Checks if astr in gconta in sa ch aract er vial in earscan. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `no_consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |

## Sample 7: `is_sorted`

Ground truth: time=O(n), space=O(1), correct=1

Reference explanation: Checks whether a list is sorted in ascending order. O(n) time, O(1) space.

Mismatched training explanation: Returns unique elements preserving order. O(n) time and O(n) space.

```python
def is_sorted(lst):
    for i in range(len(lst) - 1):
        if lst[i] > lst[i + 1]:
            return False
    return True

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 8: `insertion_sort`

Ground truth: time=O(n^2), space=O(1), correct=1

Reference explanation: Insertion sort: O(n^2) worst-case time, O(1) space.

Mismatched training explanation: Searches a list sequentially for a target value. O(n) time, O(1) space.

```python
def insertion_sort(lst):
    for i in range(1, len(lst)):
        key = lst[i]
        j = i - 1
        while j >= 0 and lst[j] > key:
            lst[j + 1] = lst[j]
            j -= 1
        lst[j + 1] = key
    return lst

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 9: `selection_sort`

Ground truth: time=O(n^2), space=O(1), correct=1

Reference explanation: Selection sort: selects the minimum in O(n^2) nested passes, O(1) space.

Mismatched training explanation: Insertion sort: O(n^2) worst-case time, O(1) space.

```python
def selection_sort(lst):
    for i in range(len(lst)):
        min_idx = i
        for j in range(i + 1, len(lst)):
            if lst[j] < lst[min_idx]:
                min_idx = j
        lst[i], lst[min_idx] = lst[min_idx], lst[i]
    return lst

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). | Computesthesum of a list withas in glepass. Timecom pl exity O(n), sp a ce O(1). <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 10: `matrix_multiply_buggy`

Ground truth: time=O(n^2), space=O(1), correct=0

Reference explanation: Attempts 2x2 matrix multiplication but is buggy (missing accumulation). O(n^2) time, O(1) auxiliary space.

Mismatched training explanation: Attempts to reverse a list in-place but has a bug: it swaps an element with itself.

```python
def matrix_multiply_2x2(A, B):
    C = [[0, 0], [0, 0]]
    for i in range(2):
        for j in range(2):
            C[i][j] = A[i][0] * B[0][j]  # bug: missing second term
    return C

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Checks if astr in gconta in sa ch aract er vial in earscan. O(n) time, O(1) space. | Checks if astr in gconta in sa ch aract er vial in earscan. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `no_consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=0 </claim> |

## Sample 11: `sign`

Ground truth: time=O(1), space=O(1), correct=1

Reference explanation: Returns the sign of a number (-1, 0, or 1) in O(1) time and space.

Mismatched training explanation: Checks if all elements are equal via pairwise comparison. O(n^2) time, O(1) space.

```python
def sign(x):
    if x > 0: return 1
    if x < 0: return -1
    return 0

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 12: `string_contains`

Ground truth: time=O(n), space=O(1), correct=1

Reference explanation: Checks if a string contains a character via linear scan. O(n) time, O(1) space.

Mismatched training explanation: Reverses a string character by character. O(n) time, O(n) space due to string accumulation.

```python
def string_contains(s, ch):
    for c in s:
        if c == ch:
            return True
    return False

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 13: `all_pairs_sum`

Ground truth: time=O(n^2), space=O(1), correct=1

Reference explanation: Sums all pairs from the list. Double nested loop gives O(n^2) time, O(1) space.

Mismatched training explanation: Checks whether a list is sorted in ascending order. O(n) time, O(1) space.

```python
def all_pairs_sum(lst):
    total = 0
    for i in range(len(lst)):
        for j in range(len(lst)):
            total += lst[i] + lst[j]
    return total

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 14: `max_of_two`

Ground truth: time=O(1), space=O(1), correct=1

Reference explanation: Returns the larger of two numbers using a conditional expression. O(1) time and space.

Mismatched training explanation: Naive polynomial/array convolution. O(n^2) time, O(n) output space.

```python
def max_of_two(a, b):
    return a if a >= b else b

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Checks if astr in gconta in sa ch aract er vial in earscan. O(n) time, O(1) space. | Checks if astr in gconta in sa ch aract er vial in earscan. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 15: `string_reverse`

Ground truth: time=O(n), space=O(n), correct=1

Reference explanation: Reverses a string character by character. O(n) time, O(n) space due to string accumulation.

Mismatched training explanation: Insertion sort: O(n^2) worst-case time, O(1) space.

```python
def string_reverse(s):
    result = ''
    for ch in s:
        result = ch + result
    return result

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Checks if allelementsareequalviapairwi se comparison. O( n ^ 2 ) time, O(1) space. | Checks if allelementsareequalviapairwi se comparison. O( n ^ 2 ) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |

## Sample 16: `clamp`

Ground truth: time=O(1), space=O(1), correct=1

Reference explanation: Clamps a value between lo and hi bounds in constant time and space.

Mismatched training explanation: Flattens one level of nesting in a list of lists. O(n) time and O(n) space.

```python
def clamp(x, lo, hi):
    return max(lo, min(x, hi))

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(1) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 17: `compute_mean`

Ground truth: time=O(n), space=O(1), correct=1

Reference explanation: Computes arithmetic mean. The built-in sum is O(n); overall O(n) time, O(1) space.

Mismatched training explanation: Naive polynomial/array convolution. O(n^2) time, O(n) output space.

```python
def compute_mean(lst):
    if not lst:
        return 0.0
    return sum(lst) / len(lst)

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computestheabsolutevalue of anumb er in constant time and space. | Computestheabsolutevalue of anumb er in constant time and space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computestheabsolutevalue of anumb er in constant time and space. | Computestheabsolutevalue of anumb er in constant time and space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computestheabsolutevalue of anumb er in constant time and space. | Computestheabsolutevalue of anumb er in constant time and space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computestheabsolutevalue of anumb er in constant time and space. | Computestheabsolutevalue of anumb er in constant time and space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 18: `prefix_sums`

Ground truth: time=O(n), space=O(n), correct=1

Reference explanation: Computes prefix sum array. O(n) time and O(n) space.

Mismatched training explanation: Converts Celsius to Fahrenheit in constant time and space.

```python
def prefix_sums(lst):
    ps = [0] * (len(lst) + 1)
    for i, x in enumerate(lst):
        ps[i + 1] = ps[i] + x
    return ps

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computestheabsolutevalue of anumb er in constant time and space. | Computestheabsolutevalue of anumb er in constant time and space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |

## Sample 19: `string_contains`

Ground truth: time=O(n), space=O(1), correct=1

Reference explanation: Checks if a string contains a character via linear scan. O(n) time, O(1) space.

Mismatched training explanation: Detects duplicates with nested loops. O(n^2) time, O(1) space.

```python
def string_contains(s, ch):
    for c in s:
        if c == ch:
            return True
    return False

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Computesprefixsumarray. O(n) time and O(n) space. | Computesprefixsumarray. O(n) time and O(n) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n) </claim> <claim> space_complexity=O(1) </claim> <claim> correctness=1 </claim> |

## Sample 20: `naive_convolution`

Ground truth: time=O(n^2), space=O(n), correct=1

Reference explanation: Naive polynomial/array convolution. O(n^2) time, O(n) output space.

Mismatched training explanation: Computes prefix sum array. O(n) time and O(n) space.

```python
def naive_convolution(a, b):
    n = len(a) + len(b) - 1
    result = [0] * n
    for i in range(len(a)):
        for j in range(len(b)):
            result[i + j] += a[i] * b[j]
    return result

```

| Variant | Generated prose | Full generation |
|---|---|---|
| `consistency_loss` | Checks if allelementsareequalviapairwi se comparison. O( n ^ 2 ) time, O(1) space. | Checks if allelementsareequalviapairwi se comparison. O( n ^ 2 ) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `no_consistency_loss` | Checkswheth er an in teg er isevenus in gthemoduloop er ator. Runs in O(1) time and O(1) space. | Checkswheth er an in teg er isevenus in gthemoduloop er ator. Runs in O(1) time and O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `claim_only_pooling` | Computesn! it er atively. O(n) time, O(1) space. | Computesn! it er atively. O(n) time, O(1) space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
| `random_label_consistency` | Computestheabsolutevalue of anumb er in constant time and space. | Computestheabsolutevalue of anumb er in constant time and space. <claim> time_complexity=O(n^2) </claim> <claim> space_complexity=O(n) </claim> <claim> correctness=1 </claim> |
