## Code for ICLR 2026 Submission: 
# Targeted perturbations reveal brain-like local coding axes in robustified, but not standard, ANN-based brain models
### Anonymous authors
_______

## Install 
Note: For the purposes of anonymity and to keep this supplementary material under the file size limit, many of the necessary larger files to run this code are not attached (datasets, network activations, etc.). We will release the code present here, along with the datasets and necessary files to run it, upon acceptance of the paper. 

### Setup
To build the environment, run 
```
conda env create -f environment.yml
conda activate attack-brain
```

### Adversarial Attacks (Part 1)

To reproduce the results from Section 1, run the script `python attacks/attack.py --model_name`, where `--model_name` is a valid architecture as defined in `brainmodels/models.py`.

A short sample output for a base case is provided in `attacks/sample_results_output`. 

To reproduce the epsilon curve (that sensitivity increases as epsilon increases), run `python epsilon_curve/epsilon_curve.py`. See the related short sample output in `sample_output_alexnet_s1_FFA_attack_results.csv`.

### Model Discrimination (Part 2)

To reproduce the model discrimination results in Part 2, run the code `python attacks/model_discrimination.py` to visualize the difference between the sensitivity metric and predictivity (correlation) at discriminating between models.

### Transfer Attacks and Perturbation Subspaces (Part 3)

Code for Section 3 is mostly in the `transfer_attacks` folder. Here, the script `transfer_attacks.py` performs all transfer attacks and saves the corresponding matrices. An example visualization of the saved transfer matrices is in the same folder, titled `example_l2_transfer_matrix_ranked.png`. The script `FIM.py`, which takes arguments `--subject` and `--region`, will use the three metrics described in Section 3 to compare perturbation subspaces. An example output is provided in `example_subspace_similarity_img_284.pdf`.


### Shared Attacks (Part 4)

The visualizations for the last section of the paper are provided in the folder `feature_visualization`. The script `sensitivity_threshold_matrix.py` generates the attack transfer matrix where the diagonal (sensitivity) is fixed, highlighting the ability of robustified encoding models to transfer attacks to other models. This is illustrated in sample outpout `l2_transfer_thresh_ranked.pdf`. Last, the script `feature_visualization.py` can be used to take any arbitrary image as an input, and will output a map of its perturbations ranging from minimizing to maximizing a selected brain region.



