# Kaggle Descriptions Database

This folder contains a **Kaggle database** that stores some of the best solution descriptions from Kaggle competitions.  
It is intended for use in idea generation, retrieval-augmented generation (RAG).

## Folder Structure

- **`competitions_ideas.json`**  
  Contains `solution_texts` — textual descriptions of top solutions for various Kaggle competitions.

- **`competitions.yml`**  
  A YAML file with curated links to the best solution writeups for each competition.

- **`create_kaggle_database.ipynb`**  
  A Jupyter Notebook example demonstrating how to build the Kaggle descriptions database.

- **`ideas_metadata.json`**  
  Metadata used to match each description to its corresponding competition link.

- **`ideas_vectorbase.faiss`**  
  The FAISS vector store containing embedded representations of solution descriptions.

- **`kaggle_data_class.py`**  
  Python script containing the logic for building and managing the Kaggle database.