# Functional Equivalence in Attention: A Comprehensive Study with Applications to  Linear Mode Connectivity


[![Documentation](https://img.shields.io/badge/docs-passing-brightgreen)](https://github.com/repo/docs)
[![Paper](https://img.shields.io/badge/arXiv-XXXX.XXXXX-blue)](https://arxiv.org/abs/XXXX.XXXXX)

This repository accompanies the paper:
***“Functional Equivalence in Attention: A Comprehensive Study with Applications to  Linear Mode Connectivity”*** (ICLR 2026 Submission)
<p align="center"><strong>Linear Mode Connectivity</strong></p>
<p align="center">
  <img src="plots/lmc_visualization.png" width="500px"/>
</p>


## Installation

```bash
git clone https://github.com/repo/lmc-.git
cd moe-lmc
pip install -e .
pip install -r requirements.txt
```

## Repository Structure

```bash
src/
├── agnews
├── cifar10
├── cifar100
├── datasets.py
├── dbpedia
├── imagenet
├── imdbreview
├── lgmodeling
├── matching_utils.py
├── mnist
├── online_stats.py
├── run
├── utils.py
└── vision_transformer
```

## Citation

If you find this work helpful, please consider citing:

```bibtex
@article{our2025tranformerlmc,
  title={Functional Equivalence in Attention: A Comprehensive Study with Applications to  Linear Mode Connectivity},
  author={Coauthors},
  journal={arXiv:XXXX.XXXXX},
  year={2025}
}
```


## Acknowledgements

We thank contributors and maintainers of open-source libraries including PyTorch, JAX, Flax, and HuggingFace Transformers. Special thanks to the authors of recent works on LMC and MoE architectures for foundational insights.


## Contributing

We welcome pull requests and suggestions. Please ensure new features or bug fixes include tests where appropriate and follow existing code style.


## License

This project is licensed under the MIT License.

