# KompeteAI

An autonomous multi-agent system for structured, multistage pipeline generation, which automates the development of a complete machine learning pipeline.

## ⚙ Getting Started
0. Installing dependencies

```bash
pip install -r requirements.txt
```

1. Download data
   * Kompete-bench -> `KompeteBench/README.md`
   * MLE-bench -> `github.com/openai/mle-bench`
   * Your custom data...
2. Ensure that the `competition` folder contains a folder with the competition data. For example:

    ```
    competition
    |____titanic
    |______train.csv
    |______test.csv
    |______sample_submission.csv
    |______overview.txt   <-- file with a competition description. It may contain information about the target, evaluation metric, data, backstory, etc.
    ```

    ❗**Important**❗

      * The folder must be named after the competition (i.e., the folder name is significant).
      * The competition folder must contain the following four files: `train.csv`, `test.csv`, `sample_submission.csv`, and `overview.txt`.
      * The competition folder ***may also contain other data*** (e.g., a folder with images, etc.).

3. Configure `config.yaml`. You need to specify the name of the competition, which must match the folder name (more details in `doc_en/config.md`). For example:
    ```yaml
    model_name: 'gemini-2.5-flash'
    model_type: "gemini"
    
    competition: "titanic"
    log_dir: logs
    ```

4. Run proxy server (more details in `server/README.md`)
    ```bash
    uvicorn main:app --host 0.0.0.0 --port 8000
    ```

5. Ensure your API keys in `.env` are correct. For example:
    ```
    # --- API KEYS ---
    API_KEY_COUNT=3 # Number of API keys

    # API keys must have the following structure API_KEY_NUMBER:
    API_KEY_1=xxx
    API_KEY_2=xxx
    API_KEY_3=xxx

    url="URL of the proxy server"
    ```

6. Run KompeteAI 

* To run the system on a single competition, execute the following command:
    ```bash
    python3 run.py
    ```

* To run it on several competitions, you need to list them in the `several_run.py` file, for example:
    ```python
    competition_names = [
        "CONLL",
        "Creep Image Dataset",
        "DUTS_image_dataset",
        "HTRU2"
    ]
    ```

Then, execute:
```bash
python3 several_run.py
```