# Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

[Website](https://projectwebsite7.github.io/gene-dex-manip/) | [Videos](https://projectwebsite7.github.io/gene-dex-manip/static/videos-lowres/video_7.mp4)



## Environment Steup & Installation

The project is developed using Python 3.8. Please install Isaac Gym Preview 4 from the [website](https://developer.nvidia.com/isaac-gym). We use PyTorch 2.4 with cuda 12.1. Install `rl_games`, `gym`, `scipy`. 

Install `torch_cluster`: 
```bash
cd IsaacGymEnvs2/isaacgymenvs
cd whls
pip install torch_cluster-1.6.3+pt24cu121-cp38-cp38-linux_x86_64.whl
```


## Key Implementations

### Training the tracking controller using RL and IL
**RL based training scheme**

Please check `IsaacGymEnvs2/isaacgymenvs/tasks/allegro_hand_tracking_generalist.py` for details. 

For the observation space design, please refer the `compute_observations` and `comput_full_state` functions of the task class. 

For reward computation, please refer to `compute_hand_reward_tracking` functon for detailed implementations. 

**IL based training scheme**

Please check `IsaacGymEnvs2/isaacgymenvs/learning/a2c_supervised.py` for details. The imitation loss is calculated in the function `actor_loss_supervised`. 

### Homotopy optimization scheme 

**Homotopy optimization path searching** 

Please check `IsaacGymEnvs2/isaacgymenvs/train_selfexp_pool.py` for details. 

**Homotopy path generator training**

For the kinematic trajectory distribution modeling and the conditional diffusion model fine-tuning, please check `softzoo/softzoo/diffusion/main_3d_pc.py` for details. 
- Main function: `softzoo/softzoo/diffusion/main_3d_pc.py`; 
- Script: `scripts/K2_run_main_3d_pc.sh`. 


### Iterative optimization strategy

Currently this process is not fully automatic. So one need to alternate between the above two stages. No key implementations are introduced in this stage. 

### Additional detailed implementations

**Interpolating TACO sequences with GRAB trajectories**

Please check the function `interp_grab_traj_w_taco_traj` in `IsaacGymEnvs2/isaacgymenvs/utils/taco_data_utils.py` for details. 


## Usage

### Training the tracking controller using RL and IL

```bash
cd IsaacGymEnvs2/isaacgymenvs
bash scripts/run_tracking_headless_single_grab.sh
```

Note that one need to specifiy the dataset type and the path to robot tracking demonstrations in the `.sh` file. 

### Homotopy optimization scheme 

**Homotopy optimization path searching** 

Currently this process is not fully automatic. One need to first optimize per-trajectory tracking results, identify neighbours for each trajectory, set `obj_type_to_base_traj_fn` in the script file. After that, the script for the searching part should be launched with a correct search type specified. 

For single iteration search, 
```bash
cd IsaacGymEnvs2/isaacgymenvs
bash scripts/run_tracking_selfexp_headless_pool_grab.sh
```

After that, one need to update `obj_type_to_base_traj_fn`, which should be set to the best result acheived currently. 

Iteratively conducting the above two steps, we can complete the homotopy path searching process. 

**Homotopy path generator training**

Currently this process is not fully automatic either.  One need to summarize the above serached results into (child task, parent task) pairs. 

After that, to train the homotopy path generator, one need to first train a kinematic trajectory diffusion model and fine tune it into a conditional diffusion model as the homotopy path generator. 

Commands to run both the kinematics trajectory distribution modeling and the conditional diffusion model training are as follows:
```bash
cd softzoo/softzoo/diffusion
bash scripts/K2_run_main_3d_pc.sh
```

One need to specify arguments in the `.sh` file. 

For kinematic trajectory distribution modeling, we need first train a autoencoder using the following arguments: 

```bash
export train_AE="--train_AE"
export train_Diff=""
export task_cond=""
```

We then need to train the latnet diffusion model using the following arguments: 
<!-- the following modifications: -->

```bash
export train_AE=""
export train_Diff="--train_Diff"
export task_cond=""
```


For conditional diffusion model tuning, we need the following modifications: 


```bash
export train_AE=""
export train_Diff="--train_Diff"
export task_cond="--task_cond"
```

**Generating a homotopy path**

### Iterative optimization strategy


This step involves alternating between the above two stages. 


Currently, the code is a little bit unstructured and unorganized. We will devote more time in organizing the code and the training pipeline. 


### Ack 

The repo is built upon [IsaacGymEnvs](https://github.com/isaac-sim/IsaacGymEnvs). We thank authors for their great project. 



