# MAS<sup>3</sup>AC: Multi-Agent Safe and Stable Soft Actor-Critic

This repository contains the **anonymized implementation of MAS<sup>3</sup>AC for one cooperative multi-agent task**, provided as supplementary material for *ICLR review*. It includes codes for one **cooperative task only**. General-sum tasks (with different task structures and training code) and more codes for additional cooperative tasks will be released in the full public repository after acceptance.

---

## Overview

**MAS<sup>3</sup>AC (Multi-Agent Safe and Stable Soft Actor-Critic)** is an algorithm developed for **safe and stable multi-agent reinforcement learning in general settings**, *without strictly requiring that the task is cooperative, or relying on tricks like parameter sharing*.

---

## Installation

Install required packages listed in `requirements.txt`.

Then navigate to the repository root (`MAS3AC-main`), and install it by running:

```
pip install -e .
```

Ensure **MuJoCo** is correctly installed, and set the library path (adjusting the path to your local installation; `mujoco200` is used as an example).
```
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/path/to/.mujoco/mujoco200/bin
```

---

## Running Experiments

To run MAS<sup>3</sup>AC on cooperative MuJoCo tasks, first navigate to the directory `examples`, and then use the following commands (*replace* `/path/to/the/repository/` *with your actual path*). You may modify the `--seed` argument to run with different random seeds.



### Coupled HalfCheetah 

```
python train.py --load_config /path/to/the/repository/tuned_configs/mamujoco/Coupled_HalfCheetah-1p1/mas3ac/config.json --seed 0
```


---

> ⚠️ **Note for Reviewers**
>
> - The code is provided to reproduce one **cooperative MARL** result for ICLR review.  
> - A complete repository (including **general-sum** tasks and more cooperative tasks) will be released publicly **after acceptance**.  
> - This repository is developed based on **[Heterogeneous-Agent Reinforcement Learning (HARL)](https://github.com/PKU-MARL/HARL)**.