# PQ-Net: A Unified Framework for Time Series Forecasting

PQ-Net is a unified framework for training and evaluating time series forecasting models. The default model is PQNet (Periodic Query Network), but the framework also supports various state-of-the-art models such as Informer, Autoformer, Transformer, PatchTST, DLinear, Linear, NLinear, SegRNN, CycleNet, QuLTSF, iTransformer, and TimeXer. The framework provides a unified command-line interface via `run.py` for training, testing, and prediction.

Key features of PQNet include:
- **LPV (Periodic Query) Temporal Representation**: Uses `cycle` indices to query temporal context from a learnable table.
- **Channel Aggregation**: Employs quantum attention layers (DRQC in `layers/QANLayer.py`) for channel-wise aggregation.
- **RevIN (Optional)**: Instance normalization and denormalization to improve cross-distribution generalization.
- **Simple MLP Head**: For representation and output.

### Key Directory Structure
- `run.py`: Unified entry point for command-line argument parsing and experiment control.
- `exp/exp_main.py`: Core training, validation, testing logic, and model selection.
- `models/`: Model definitions (PQNet is in `models/PQNet.py`).
- `layers/`: Model components and quantum layers (e.g., `QANLayer.py`).
- `data_provider/`: Dataset definitions and loaders (ETT, Solar, PEMS, custom datasets).
- `utils/`: Metrics, visualization, and utility functions.
- `scripts/PQNet/`: Example shell scripts for running experiments on common datasets.

---

## Environment Setup

It is recommended to use a virtual environment. Install dependencies listed in `requirements.txt` (including PyTorch, PennyLane, Qiskit, etc.):

```bash
pip install -r requirements.txt
```

Notes:
- The dependencies include the GPU version of PyTorch (e.g., `torch==2.4.1+cu121`). Ensure your CUDA environment matches.
- For CPU-only or different CUDA versions, adjust the PyTorch installation accordingly.
- PQNet includes configurations for remote quantum computation via Qiskit (see `models/PQNet.py`). These are not exposed via CLI and can be left as default.

---

## Data Preparation

The framework supports the following datasets (defined in `data_provider/data_factory.py`):
- `ETTh1`, `ETTh2` (hourly-level ETT datasets)
- `ETTm1`, `ETTm2` (minute-level ETT datasets)
- `Solar`
- `PEMS`
- `custom` (custom datasets)

Dataset requirements (see dataset classes in `data_provider/data_loader.py`):
- CSV files must include a `date` column and feature columns. For `features=M`, all features are used for multivariate prediction. For `features=S`, only the target column is used for univariate prediction. For `features=MS`, multivariate input predicts a single target.
- The default target column is `OT` (can be changed via `--target`).
- Custom datasets (`custom`) reorganize columns based on `date` and `target` and split data into 70%/10%/20% for training/validation/testing.
- The `cycle` parameter specifies the periodicity for LPV queries (e.g., `24` for `ETTh1`, `96` for `ETTm1`, `168` for `electricity`).

Place dataset CSV files in the specified `--root_path` directory and use `--data_path` to specify the file name.

---

## Quick Start

To train a model, set `--is_training 1`. After training, testing will be performed automatically, and results will be saved in `./results/` and `./test_results/`.

### Example 1: ETTh1 (Hourly ETT Dataset)

```bash
CUDA_VISIBLE_DEVICES=0 python -u run.py --is_training 1 --root_path ./dataset/ --data_path ETTh1.csv --model_id ETTh1_96_96 --model PQNet --data ETTh1 --features M --seq_len 96 --pred_len 96 --enc_in 7 --cycle 24 --train_epochs 30 --patience 5 --dropout 0.5 --itr 1 --batch_size 256 --learning_rate 0.001 --random_seed 2025
```

### Example 2: ETTm1 (Minute-Level ETT Dataset)

```bash
CUDA_VISIBLE_DEVICES=0 python -u run.py --is_training 1 --root_path ./dataset/ --data_path ETTm1.csv --model_id ETTm1_96_96 --model PQNet --data ETTm1 --features M --seq_len 96 --pred_len 96 --enc_in 7 --cycle 96 --train_epochs 30 --patience 5 --dropout 0.5 --itr 1 --batch_size 256 --learning_rate 0.001 --random_seed 2024
```

### Example 3: Electricity (Custom Dataset)

```bash
CUDA_VISIBLE_DEVICES=0 python -u run.py --is_training 1 --root_path ./dataset/ --data_path electricity.csv --model_id Electricity_96_96 --model PQNet --data custom --features M --seq_len 96 --pred_len 96 --enc_in 321 --cycle 168 --train_epochs 30 --patience 5 --itr 1 --batch_size 32 --learning_rate 0.003 --random_seed 2025
```

Refer to the `.sh` scripts in `scripts/PQNet/` for batch runs with different prediction lengths (`pred_len`).

---

## Key Command-Line Arguments

Some important arguments for `run.py`:

- **Basic Settings**
  - `--is_training`: Training mode (1 for training + testing, 0 for testing only).
  - `--model_id`: Experiment name (used for result directories).
  - `--model`: Model name (e.g., `PQNet`, `Informer`, `Autoformer`, etc.).
  - `--random_seed`: Random seed (default: 2024).

- **Data Settings**
  - `--data`: Dataset type (`ETTh1`, `ETTm1`, `custom`, etc.).
  - `--root_path`: Root directory for datasets (e.g., `./dataset/`).
  - `--data_path`: CSV file name (e.g., `ETTh1.csv`).
  - `--features`: Task type (`M`/`S`/`MS`).
  - `--target`: Target column for `S` or `MS` tasks (default: `OT`).
  - `--freq`: Frequency for time encoding (`h`/`t`/`d`/`w`/`m`, etc.).
  - `--seq_len` / `--label_len` / `--pred_len`: Input/label/prediction lengths (`label_len` is usually 0).

- **PQNet / CycleNet Specific**
  - `--cycle`: Cycle length for LPV queries.
  - `--model_type`: `linear` or `mlp` (default: `mlp` for PQNet).
  - `--use_revin`: Enable RevIN (1/0).
  - `--num_qubits` / `--num_layers`: Hidden dimensions and layers for quantum layers (DRQC).
  - `--QML_device`: Pennylane device name (e.g., `default.qubit`, `lightning.gpu`). Note: PQNet primarily uses Qiskit remote configurations defined in `models/PQNet.py`.

- **Optimization and Training**
  - `--train_epochs` / `--batch_size` / `--learning_rate` / `--patience`.
  - `--use_amp`: Mixed precision training (default: False).
  - `--lradj` / `--pct_start`: Learning rate adjustment strategies (OneCycleLR/TST).

- **GPU Settings**
  - `--use_gpu`: Use GPU (default: True, depends on `torch.cuda.is_available()`).
  - `--gpu`: GPU index.
  - `--use_multi_gpu` / `--devices`: Multi-GPU training (e.g., `--devices "0,1"`).

- **Others**
  - `--output_attention`: Output attention for some Transformer models.
  - `--do_predict`: Perform future data prediction after training.
  - `--checkpoints`: Directory for saving model weights (default: `./checkpoints/`).

---

## Outputs and Evaluation

After execution, the framework will:
- Print and append `mse` and `mae` to `result.txt`.
- Save results:
  - `./results/<setting>/`: Metrics and (in prediction mode) `real_prediction.npy`.
  - `./test_results/<setting>/`: Visualizations of input-prediction-ground truth as PDFs.
- Save model weights:
  - `./checkpoints/<setting>/checkpoint.pth`: Best weights saved during early stopping.
- `<setting>` naming convention:
  ```
  {model_id}_{model}_{data}_ft{features}_sl{seq_len}_pl{pred_len}_cycle{cycle}_seed{random_seed}
  ```

---

## Multi-Model Support

Switch the `--model` argument to any of the following to use the corresponding model (see `model_dict` in `exp/exp_main.py`):
- `PQNet`, `CycleNet`
- `Informer`, `Autoformer`, `Transformer`
- `PatchTST`, `SegRNN`, `QuLTSF`, `iTransformer`, `TimeXer`
- `DLinear`, `Linear`, `NLinear`

Different models may have slightly different input requirements (e.g., time markers, decoder inputs). These are handled in `exp_main.py`.

---

## Custom Dataset Usage (`data=custom`)

Custom dataset CSV requirements:
- Must include a `date` column (for timestamps).
- Other columns are features and target (specified via `--target`, default: `OT`).
- Split into 70%/10%/20% for training/validation/testing.
- Set `--enc_in` to the number of feature channels (excluding the `date` column).

Example:
```bash
python -u run.py --is_training 1 --root_path ./dataset/ --data_path mydata.csv --model_id MyData_96_96 --model PQNet --data custom --features M --seq_len 96 --pred_len 96 --enc_in <number_of_features> --cycle <cycle_length> --train_epochs 30 --batch_size 64 --learning_rate 0.001
```

---

## Reproducibility Scripts

Example scripts for various datasets are provided in `scripts/PQNet/` (e.g., `etth1.sh`, `ettm1.sh`, `electricity.sh`, `traffic.sh`, `weather.sh`). Adjust and run as needed. Create a `logs/` directory for redirected log outputs before running.

---

## Notes and Recommendations

- Ensure CUDA/cuDNN environments match the GPU version of PyTorch.
- Quantum channel aggregation (DRQC) is enabled by default in PQNet. Control its scale via `num_qubits` and `num_layers`.
- For classical model comparisons, switch `--model` to the desired classical model.
- For testing pre-trained weights, set `--is_training 0` and ensure `checkpoints/<setting>/checkpoint.pth` exists.

For further customization or extension, refer to:
- Model definitions: `models/PQNet.py`
- Training logic: `exp/exp_main.py`
- Dataset implementations: `data_provider/data_loader.py`

Feel free to experiment and improve upon this framework!