# World Representation Visualizations (Small)

Interactive 3D visualizations for the paper "Convergent World Representations and Divergent Tasks".

**Full dataset:** All runs, seeds, and visualization variants are available at:
https://osf.io/jb8an/?view_only=da001f31c0534dc0b6476141f30db90d

## Overview

This folder contains a subset of HTML visualizations showing how neural network representations of cities evolve under different training conditions. The paper studies how autoregressive transformers trained on geometric tasks (distance, angle, etc.) over 5,075 real-world city coordinates develop internal "world representations."

## Folder Structure

```
world_representation_vis_small/
├── README.md
├── well_integrated/              (3 files)
├── ill_integrated/               (3 files)
└── single_task_pretraining_seed0/ (7 files - raw only)
```

## Task Reference

| Task # | Task Name     | Description                                      |
|--------|---------------|--------------------------------------------------|
| 1      | distance      | Euclidean distance between 2 cities              |
| 2      | trianglearea  | Area of triangle formed by 3 cities              |
| 3      | angle         | Angle at vertex city given 3 cities              |
| 4      | compass       | Cardinal direction (N/NE/E/etc.) between 2 cities|
| 5      | inside        | Whether a city is inside convex hull of others   |
| 6      | perimeter     | Perimeter of polygon formed by n cities          |
| 7      | crossing      | Whether two line segments intersect              |

## well_integrated/ & ill_integrated/

Atlantis integration comparison from Figure 6 of the paper:

| Folder | Fine-tuning Tasks | Notes |
|--------|-------------------|-------|
| `well_integrated/` | inside + perimeter | No divergent task |
| `ill_integrated/` | perimeter + distance | Includes divergent distance task |

Each contains 3 files: `_mixed.html`, `_raw.html`, `_na.html`

## single_task_pretraining_seed0/

7 files showing models trained on a single task from scratch (seed 0, raw PCA only).

**Naming:** `pt1-{task}_seed0_raw.html`

## Visualization Types

| Type | Description |
|------|-------------|
| raw | Raw PCA of all activations |
| na | First 2 dims are x,y linear probed coordinates, 3rd is residual PC. Atlantis NOT in probe training |
| mixed | First 2 dims are x,y linear probed coordinates, 3rd is residual PC. Atlantis in probe training |
