# Knowledge Graph of Thoughts (KGoT)

<div align="center">

**[Architecture Overview](#architecture-overview)** |
**[Setup Guide](#setup-guide)** |
**[Quick Start](#quick-start)** |
**[Citations](#citations)**
</div>

<p align="center">
  <img src="paper/pics/task_transformation.png" width="80%">
</p>

This is the official implementation of Affordable AI Assistants with Knowledge Graph of Thoughts.

Knowledge Graph of Thoughts (KGoT) is an innovative AI assistant architecture that integrates LLM reasoning with dynamically constructed knowledge graphs (KGs).
KGoT extracts and structures task-relevant knowledge into a dynamic KG representation, iteratively enhanced through external tools such as math solvers, web crawlers, and Python scripts.
Such structured representation of task-relevant knowledge enables low-cost models to solve complex tasks effectively.

## Architecture Overview

The KGoT system is designed as a modular and flexible framework that consists of three main components: the **Controller**, the **Graph Store**, and the **Integrated Tools**, each playing a critical role in the task-solving process.

<p align="center">
  <img src="paper/pics/architecture.svg" width="80%">
</p>

- The [Controller](kgot/controller/README.md) component offers fine-grained control over the customizable parameters in the KGoT pipeline and orchestrates the KG-based reasoning procedure.
- The [Graph Store](kgot/knowledge_graph/README.md) component provides a modular interface for supporting various *Knowledge Graph Backends*. We initially support Neo4j, NetworkX and RDF4J.
- The [Integrated Tools](kgot/tools/README.md) component allows for flexible and extensible *Tool Usage* and enables the multi-modal reasoning capabilities of the framework.

## Setup Guide

In order to use this framework, you need to have a working installation of Python 3.10 or newer.

### Installing KGoT

Before running the installation, make sure to activate your Python environment (if any) beforehand.

```bash
git clone https://github.com/{anonymized}/knowledge-graph-of-thoughts.git
cd knowledge-graph-of-thoughts/
pip install -e .
playwright install
```

### Configuring API Keys and Models

To get started make a copy of the following template files inside the `kgot` directory:

- `kgot/config_llms.template.json`
- `kgot/config_tools.template.json`

Then rename them as follows:

- `config_llms.template.json` → `config_llms.json`
- `config_tools.template.json` → `config_tools.json`

Please update the API keys, if necessary, for the language models you intend to use in the `kgot/config_llms.json` file.
You can also add new models by incorporating their information into the JSON file.
The object key is the language model identifier used in KGoT, and the various attributes contain the information needed to run the model.

Local models are expected to be hosted using **[Ollama](https://ollama.com/)**. KGoT assumes that the model is accessible at the default Ollama API endpoint (`http://localhost:11434`) and integrates with it through **ChatOllama** via the [LangChain](https://www.langchain.com/) framework.

> [!NOTE]
> **Please be aware** that the values for `num_ctx`, `num_predict`, and `num_batch` in the configuration are based on the specific GPU type and VRAM capacity used during our experiments. You may need to adjust these parameters based on your own hardware setup to avoid out-of-memory errors or suboptimal performance.

For the `SurferAgent` tool we rely on SerpAPI for browsing necessary external information from the Internet.
To use this tool, please set the API key within the `kgot/config_tools.json` file.

### Setting Up the Containerized Environment

In order to provide a secure & consistent execution environment, we containerize critical modules such as the **Neo4j graph database**, the **RDF4J database** and the **Python Code Tool**. This allows the safe execution of LLM-generated code without security concerns.

#### Running the Container Instances

We provide a Docker and Sarus setup for the KGoT framework. The Docker setup is recommended for local development, while Sarus is intended for HPC environments.

```bash
cd containers/

# Docker
chmod -R 777 ../kgot/knowledge_graph/_snapshots # grant permission for snapshots logging
docker compose up

# Sarus
chmod +x sarus_launcher.sh # grant permission for execution
./sarus_launcher.sh
```

This will build and start the default container images for KGoT, which include:

- Neo4j image
- Python image

```bash
cd containers/kgot/

# Docker
docker compose up

# Sarus
chmod +x sarus_launcher.sh # grant permission for execution
./sarus_launcher.sh
```

This will build and start:

- KGoT image

> [!NOTE]
> Further instructions on RDF4J and on customizing the container images can be found under [Container Image Setup](containers/README.md).

> [!WARNING]
> The initial building phase of the KGoT container image can take a while (15 minutes), so be patient.
> If you need to make adjustments simply stop the instances and restart them with the following command `docker compose up --build` from the `containers/kgot` directory.
> Changes to `README.md`, `LICENSE`, `pyproject.toml`, `kgot/__init__.py` and `kgot/__main__.py` will cause the Docker instance to be rebuilt from scratch.

## Quick Start

We primarily evaluated the Knowledge Graph of Thoughts framework with the GAIA and SimpleQA benchmarks, which we discuss first and subsequently alternative ways to run KGoT.

### Datasets Download

To avoid sharing the GAIA and SimpleQA datasets in a crawlable format, we do not directly provide the datasets inside the repository. Instead, we offer a download script to help you acquire the datasets.
Please refer to the [download guide](benchmarks/datasets/README.md) inside the `benchmarks` directory for further instructions.

#### Evaluating KGoT on the Datasets

We provide two run scripts for evaluating KGoT on the GAIA and SimpleQA datasets.

```bash
chmod +x ./run_multiple_gaia.sh # grant permission for logging etc.
./run_multiple_gaia.sh          # perform the actual run with default parameters

chmod +x ./run_multiple_simpleqa.sh # grant permission for logging etc.
./run_multiple_simpleqa.sh          # perform the actual run with default parameters
```

The following instructions apply to the `run_multiple_gaia.sh` script, but are also applicable to the `run_multiple_simpleqa.sh` script; for more detailed instructions please refer to the complete guide [here](benchmarks/README.md).

You can run `./run_multiple_gaia.sh --help` to check the supported arguments, that generally match the options found [here](benchmarks/README.md) for the `gaia.py` Python script.
For optimal results, we recommend enabling the '--gaia_formatter' option, which will format the output in a GAIA-compatible format.

The following are the most commonly used arguments:

```
Arguments:
  --log_base_folder     - Directory where logs will be stored [path/to/log_folder]
  --controller_choice   - Type of solver to use               [directRetrieve/queryRetrieve]
  --backend_choice      - Backend database type               [neo4j/networkX/rdf4j]
  --tool_choice         - Tool configuration                  [tools_v2_3]
  --max_iterations      - Max iterations for KGoT             [integers > 0]
  --gaia_formatter      - Use GAIA formatter for the output   [True/False]

Example: ./run_multiple_gaia.sh --log_base_folder logs/test_1 --controller_choice directRetrieve --backend_choice networkX --tools "tools_v2_3" --max_iterations 5 --gaia_formatter
```

We offer three choices for storing the [knowledge graph](kgot/knowledge_graph/README.md) (Neo4j, NetworkX and RDF4J) as well as two choices for the [retrieval type](kgot/controller#knowledge-extraction) (direct and query-based retrieval).

### Using Knowledge Graph of Thoughts

We offer two ways to evaluate KGoT on the datasets as well as a way to use KGoT directly with any task description.

As discussed above, you can use the `run_multiple_gaia.sh` or `run_multiple_simpleqa.sh` scripts to evaluate KGoT on the GAIA and SimpleQA datasets respectively, which act as frontends for the `gaia.py` and `simpleqa.py` Python scripts.
They allow to evaluate multiple subsets of the datasets or to do multiple runs on these subsets, while also transfering the knowledge graph snapshots as well as plotting the results with various metrics.
We further discuss the use of the scripts [here](benchmarks#evaluating-multiple-subsets).
Please note, that if you use your own Neo4j or RDF4J server instead of the one inside the Docker container, the transfer of the knowledge graph snapshots will fail or needs to be adapted.

You can also directly run the Python script [gaia.py](benchmarks/gaia.py), which we further discuss [here](benchmarks#evaluate-kgot-on-gaia).
This Python script will however not plot the resulting data nor move the snapshots of the knowledge graph.

You can also directly use the command `kgot`, which is fully configurable and can be used to solve a single task:

```bash
kgot single -p "What is a knowledge graph?"
```

You can also, for example, select a desirable backend and pass files via the command line:

```bash
kgot --db_choice neo4j --controller_choice directRetrieve single --p "Could you summarize the content of these files for me?" --files [path/to/file1] [path/to/file2]
```

## Citations

If you find this repository useful, please consider giving it a star! If you have any questions or feedback, don't hesitate to reach out and open an issue.

When using this in your work, please reference us with the citation provided below:
