# Supplementary Material for "KARMA: Knowledge-Aware Reward Mechanism Adjustment via Causal AI"

This supplementary material provides additional details, proofs, experimental results, and code examples to support the main paper titled "KARMA: Knowledge-Aware Reward Mechanism Adjustment via Causal AI". 

## Directory Structure

The supplementary material is organized into the following directories:

- `algorithms/`: Contains detailed pseudocode and algorithmic descriptions for the KARMA framework components, split into individual files for clarity.
- `proofs/`: Contains comprehensive theoretical proofs for the theorems stated in the main paper.
- `experiments/`: Provides additional experimental details, extended results, and in-depth analysis of the evaluation.
- `code/`: Contains the full source code for the KARMA framework, organized into a clear project structure.

## Content Overview

### 1. Detailed Algorithms and Pseudocode (`algorithms/`)

This directory provides a step-by-step breakdown of the algorithms underpinning the KARMA framework, with each major algorithm in its own dedicated Markdown file:
- **`knowledge_graph_algorithm.md`**: Details on how domain knowledge is structured and embedded (Algorithm 1).
- **`causal_structure_learning_algorithm.md`**: Explains the modified PC algorithm for discovering causal relationships (Algorithm 2).
- **`reward_adjustment_algorithm.md`**: Describes the mechanism for computing and combining adjusted rewards (Algorithm 3).
- **`karma_training_loop.md`**: Integrates all modules into the overall reinforcement learning process (Algorithm 4).

### 2. Theoretical Proofs (`proofs/theoretical_proofs.md`)

This section offers rigorous mathematical proofs for the theorems presented in the main paper, ensuring the theoretical soundness of KARMA. It covers:
- **Proof of Theorem 1:** Convergence of Causal Structure Learning.
- **Proof of Theorem 2:** Policy Invariance for Knowledge-Based Shaping.
- **Proof of Theorem 3:** Sample Efficiency Improvement.
- **Proof of Theorem 4:** Convergence of KARMA-RL.
- **Additional Theoretical Results:** Including lemmas on knowledge graph embedding stability, identifiability of causal effects, and robustness to knowledge inconsistencies.

### 3. Supplementary Experimental Details and Additional Results (`experiments/experimental_details.md`)

This document expands on the experimental methodology and presents further results. It includes:
- **Detailed Environment Descriptions:** In-depth explanations of the GridWorld Causal Interference (GCD), Robot Skill Acquisition (RSA), and Traffic Signal Control (TSC) environments.
- **Additional Experimental Results:** Comprehensive statistical significance analyses, detailed ablation study breakdowns, and extended generalization and robustness analyses with additional plots.
- **Implementation Details and Hyperparameters:** A complete list of software, hardware, and hyperparameter configurations for reproducibility.
- **Qualitative Analysis and Case Studies:** In-depth case studies illustrating KARMA's operational mechanisms in specific scenarios.
- **Broader Impact and Ethical Considerations:** An expanded discussion on the potential positive impacts, risks, and mitigation strategies.
- **Limitations and Future Work:** A detailed overview of current limitations and promising research directions.

### 4. Full Code Implementation (`code/`)

This directory contains the complete source code for the KARMA framework, designed for clarity, modularity, and reproducibility. Key components include:
- **`src/karma/`**: Core Python modules implementing the KARMA framework.
    - `knowledge_representation.py`: Handles knowledge graph embedding and state integration.
    - `causal_learning.py`: Implements knowledge-constrained causal discovery and SCM building.
    - `reward_adjustment.py`: Manages counterfactual-based reward computation and dynamic combination.
    - `karma_agent.py`: The main KARMA agent and training loop.
- **`requirements.txt`**: Lists all necessary Python dependencies.
- **`setup.py`**: Setup script for easy installation of the `karma-rl` package.
- **`README.md` (in `code/`)**: Provides detailed instructions on installation, usage, configuration, and testing of the code.

## Anonymization

All files have been carefully reviewed to ensure that no identifying information (e.g., author names, affiliations, specific project names) is present. The code examples are conceptual and do not contain any proprietary or sensitive data.

## Usage

To view the supplementary material, simply navigate through the directories and open the Markdown files in a compatible viewer. For a quick overview, start with this `README.md` file. For code usage, refer to `code/README.md`.

