# NanoAssemblyGym

This is the README for the supplementary material of the paper "Building Molecular Assemblies via SMT-Based Planning and Reinforcement Learning".

The supplementary material contains an extended version of the paper with appendix (`paper_appendix.pdf`), videos showing the execution of various assemblies (`VIDEOS`), the source code for `NanoAssemblyGym`, and policies used to construct assemblies for both of the molecules that are discussed in the paper.

In order to access the files, please unzip the zip archive:

```
unzip supplementary_material_icml.zip
```

In the following, we describe how the library can be installed and used to construct assemblies.

## Videos

If you do not wish to install the library, we provide several videos showing the execution of our framework for different assemblies.

## Installation

NanoAssemblyGym is implemented as a python libary and can be installed using the package manager pip.
It requires python version >= 3.10.
We recommend installing the library and all dependencies in an individual virtual environment.
Such a virtual environment can be created by issuing the following commands:

```
cd NanoAssemblyGym
python -m venv env
```

This will create the virtual environment `env` in the directory `NanoAssemblyGym`.

Before any script using `NanoAssemblyGym` can be executed, the virtual environent needs to be activated once, by issuing:

```
source env/bin/activate
```

You can then install `NanoAssemblyGym` and all other requirements to reproduce the experiments via:

```
pip install -r requirements.txt
pip install -e .
```

### Trained Policies

The policies that were used to evaluate the assembly construction are in the directory `NanoAssemblyGym/policies`. All policies were exported during training and represent the models that achieved the best rewards during training.

The policy for the circular shaped molecule can be found in `NanoAssemblyGym/policies/circ` and the policy for the cross-shaped molecules in `NanoAssemblyGym/policies/cross` and `NanoAssemblyGym/policies/cross_high_variance`.

#### Environments

The following environments were used to evaluate our complete framework. The naming scheme for these environments follows the following structure:

Each name encodes the type of molecule, surface, the initial distribution assumed for the molecules, and the identifier of the assembly. We note that the the quantum corral environment is the only exception that does not follow this naming scheme, but only encodes which type of molecule is simulated.


 - MoleculeMovement/FePc-Au111-MockUp-Uniform-Square40N36-90x90-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-Square110N96-180x180-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-Square220N180-310x310-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-Square360N288-470x470-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-Square520N420-640x640-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-Square40N36-90x90-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-Square110N96-180x180-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-Square220N180-310x310-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-Square360N288-470x470-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-Square520N420-640x640-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-Square40N36-90x90-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-Square110N96-180x180-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-Square220N180-310x310-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-Square360N288-470x470-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-Square520N420-640x640-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-HoneyComb-1-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-HoneyComb-1-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-HoneyComb-1-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-HoneyComb-2-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-HoneyComb-2-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-HoneyComb-2-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-HoneyComb-3-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-HoneyComb-3-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-HoneyComb-3-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-HoneyComb-4-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-HoneyComb-4-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-HoneyComb-4-v0
 - MoleculeMovement/FePc-Au111-MockUp-Uniform-HoneyComb-5-v0
 - MoleculeMovement/FePc-Au111-HighVar-MockUp-Uniform-HoneyComb-5-v0
 - MoleculeMovement/CiRc-Au111-MockUp-Uniform-HoneyComb-5-v0
 - MoleculeMovement/CiRc-Quantum-Corral-250x250
 - MoleculeMovement/FePc-Quantum-Corral-250x250
 - MoleculeMovement/FePc-HighVar-Quantum-Corral-250x250


### Running an Experiment


python evaluate_CrossQ.py --env <env> --render-mode human --scale <int> --max-steps 1000 --model-path policies/cross --corridor-width 6.0

Some examples are:
```
python evaluate_CrossQ.py --env "MoleculeMovement/FePc-Au111-MockUp-Uniform-HoneyComb-2-v0" --render-mode human --scale 7 --max-steps 1000 --model-path policies/cross --corridor-width 6.0
python evaluate_CrossQ.py --env "MoleculeMovement/FePc-Au111-MockUp-Uniform-HoneyComb-2-v0" --render-mode human --scale 7 --max-steps 1000 --model-path policies/cross_high_variance --corridor-width 6.0
python evaluate_TQC.py --env "MoleculeMovement/CiRc-Au111-MockUp-Uniform-HoneyComb-2-v0" --render-mode human --scale 7 --max-steps 1000 --model-path policies/cross --corridor-width 6.0
```
The results will be stored as individual csv-files in the directory `evaluation`.


### Evaluating the Results

Each run of an experiment stores the complete log of the execution as a csv-file in the directory `evaluation`. This directory contains two scripts that allow you to analyze each run.

You first have to change into that directory by calling:

```
cd evaluation
```

You can then check the results for the construction plan by calling:

```
./analyze_construction_plan.py --pattern <csv-file>
```

(Note that the result will output the median, quantiles, etc. as this script also allows multile files to be passed via a regular expression.)

In order to check the results of the assembly construction, you can run:

```
python analyze_assembly_run.py <csv-file>
```

To exit the directory, call

```
cd ..
```


### Training of RL Agents

You can also retrain agents using the environment from Figure 6b:

The environments for the circular and cross-shaped molecules are the following:
 - MoleculeMovement/CiRc-MockUp-Star-40x40-v0
 - MoleculeMovement/FePc-MockUp-Star-40x40-v0
 - MoleculeMovement/FePc-MockUp-HighVar-Star-40x40-v0

In order to train an agent you can run:

```
python train.py --render-mode none --env "MoleculeMovement/FePc-MockUp-Star-40x40-v0" --corridor-width 6.0 --algo CrossQ --total-timesteps 500000
```
to train an agent to manipulate the cross-shaped molecules with the lower covariance.

The policies can then be found in the directory `trained_policies` (this can be changed by passing a directory via the `--dir` parameter).
