# PromptGNN-sim: Deep Fusion and Alignment of GNN and LLMs for Text-Attributed Graph Learning

This repository contains the code for reproducing the experimental results on the Cora dataset from our paper, "PromptGNN-sim: Deep Fusion and Alignment of GNN and LLMs for Text-Attributed Graph Learning".

**Note:** The entire code will be made public upon paper acceptance. It is currently available for review purposes only.

## Overview

This project includes the core code used in our ICLR 2026 submission. The primary focus is to demonstrate the performance of our proposed method on the Cora dataset, which can be accessed \href{https://drive.google.com/file/d/1hxE0OPR7VLEHesr48WisynuoNMhXJbpl/view}{here}. The repository contains the following:

* Data preprocessing scripts
* The implementation of our model
* Training and evaluation scripts
* Utilities for generating the figures and tables presented in the paper

---

## Setup

We recommend using a `conda` or `venv` environment for a clean setup.

1. **Install Dependencies**
    All necessary dependencies are listed in the `environment.yml` file. You can install them with a single command:
    ```bash
    conda env create -f environment.yml
    ```
    **Note:** Ensure you have the correct version of PyTorch installed for your system.

---

## Dataset

We have preprocessed the Cora dataset and placed it in the `dataset` directory. Additionally, we have provided LLM-generated data.

## How to Reproduce the Experiments

To reproduce the main results from the paper, run the primary script with the appropriate configuration file.


```bash
python -m core.main --warmup 20 --train_task node --test_task node --node_type gat --node_embed_type tfidf --edge_type gcn --edge_embed_type tfidf --lr 1e-5 --weight_decay 5e-8 --batch_size 128 --reg 0.1 --seqlen 128 --temperature 0.07 --neg_k 1 --fixed_length 20 --epochs 30 --source_gnn_in_channels 1433 --target_gnn_in_channels 1433 --gnn_hidden_channels 512 --gnn_out_channels 384 --source_dataset_name cora --target_dataset_name cora --seed 0 --source_label_num 7 --target_label_num 7 --source_gcn_in_channels 1433 --target_gcn_in_channels 1433 --source_save_dir core/llm_llama3.2_response/cora --target_save_dir core/llm_llama3.2_response/cora --drop_edge_ratio 0.0 --drop_node_ratio 0.0 --text_mask_ratio 0.0 --user_cross --num_layers 4
