# Frankenstein Optimizer

**Frankenstein_optimizer** is an advanced optimization framework designed to integrate seamlessly with TensorFlow and PyTorch environments. This project demonstrates various experimental applications and showcases optimization techniques across multiple domains.

---

## Project Structure

### 1. `1_PIP`
This folder provides 'pip install' to installing **Frankenstein_optimizer** in both TensorFlow and PyTorch frameworks.

---

### 2. `2_Experiment_Pytorch`
This folder contains experiments implemented in PyTorch, focusing on high-performance training and model optimization:

- **ImageNet Image Classification**:
  - Train ResNet models from scratch using **Frankenstein_optimizer**:
    - **ResNet18**: Use the script `run_resnet18.sh` to start training.
    - **ResNet50**: Use the script `run_resnet50.sh` to start training.
  - Both scripts are configured to perform distributed training across **4 GPUs**.

- **Alpaca-LoRA Fine-tuning (`llama-7b` model)**:
  - The `finetune.py` script, located in the `alpaca-lora` folder, is used to fine-tune the `llama-7b` model on the Alpaca dataset.
  - This fine-tuning process employs Parameter-Efficient Fine-Tuning (PEFT) techniques.
  - **Reference**: The implementation is based on the work found at [https://github.com/tloen/alpaca-lora](https://github.com/tloen/alpaca-lora).
  - **Optimizer Trials**:
    - The `finetune.py` script was tested with the following optimizers:
      - Sophia
      - Frankenstein
      - AdamW
  - **Hardware Note**:
    - Performing PEFT may require up to 4 GPUs.

- **Neural Network Quantum States**:
  - This directory explores the use of Neural Network Quantum States (NQS) to find the ground state of quantum many-body systems. The primary implementation is in `main.py`.
  - **`main.py` - Ground State Solver using NQS**:
    - **Objective**: To determine the ground state and corresponding energy for a 2D $J_1-J_2$ Heisenberg model defined on a configurable lattice (e.g., `A` x `B` sites).
    - **Methodology**:
      - Employs a variational approach where a neural network models the amplitudes of the quantum state.
      - The network's parameters are optimized by minimizing the expectation value of the Hamiltonian, $E = \langle \Psi | H | \Psi \rangle$.
    - **Key Features & Components**:
      - **Hamiltonian Construction (`J1J2` function)**:
        - Programmatically builds the sparse matrix for the $J_1-J_2$ Heisenberg model. This involves creating Pauli string representations for interactions (XX, YY, ZZ) on a 2D lattice with nearest-neighbor ($J_1$) and next-nearest-neighbor ($J_2$) couplings.
        - Utilizes Qiskit's `SparsePauliOp` for efficient representation of Hamiltonian terms.
        - The exact ground state energy is computed using SciPy's sparse eigenvalue solver (`scipy.sparse.linalg.eigs`) for benchmarking purposes.
      - **Neural Network Architecture (`RealLinear` class)**:
        - A Multi-Layer Perceptron (MLP) using `torch.nn.Linear` layers and `torch.relu` activation functions.
        - Designed to output a real-valued state vector, which is normalized.
        - It takes computational basis states (e.g., all possible spin configurations for $Q$ qubits) as input and outputs their respective amplitudes in the wave function.
      - **Training & Optimization**:
        - The script iteratively refines the neural network's weights to find the set of parameters that minimize the calculated energy of the system.
        - The loss function is the expectation value of the Hamiltonian: $E = \langle \Psi | H | \Psi \rangle$ (assuming the state $\Psi$ is normalized, i.e., $\langle \Psi | \Psi \rangle = 1$, which is handled by the network's output normalization).
        - A `CosineAnnealingLR` learning rate scheduler is used to adjust the learning rate during training.
        - The training process is repeated for multiple random seeds (e.g., 32 runs) to assess the stability and consistency of the results.
      - **Optimizer Implementations**:
        - **`Frankenstein`**
        - **`Adam`**
        - **`SGD`**
      - **Output & Logging**:
        - During training, it prints the difference between the NQS energy and the pre-calculated exact ground state energy at regular intervals.
        - The history of these energy differences for each training run (across multiple seeds) is collected and saved into a NumPy file (e.g., `adam1.npy`).
    - **Core Technologies Used**:
      - `PyTorch`: For building and training the neural network, and for tensor operations (including sparse tensors).
      - `Qiskit`: Primarily for its `quantum_info` module to handle Pauli operators (`Pauli`, `SparsePauliOp`).
      - `NumPy`: For numerical computations and data handling.
      - `SciPy`: For sparse matrix operations and exact diagonalization (`eigs`).

- **Transfer Learning**:
  - Uses EfficientNet for transfer learning on the CIFAR-10 and CIFAR-100 datasets.

- **MAML (Model-Agnostic Meta-Learning)**:
  - A comprehensive implementation of MAML for rapid adaptation to new tasks.
  - To reproduce results, simply run the provided script `main.py`
  
- **Sentiment Analysis**:
  - Employs the BERT model for sentiment analysis based on the IMDB dataset.
  - Two training approaches are provided:
    - **`initial.py`**: Demonstrates fine-tuning starting from the initial weights of the BERT model.
    - **`transfer.py`**: Demonstrates fine-tuning starting from pre-trained BERT model parameters.
---

### 3. `3_Experiment_Tensorflow`
This folder provides experiments implemented in TensorFlow, demonstrating state-of-the-art techniques:



- **Simple Overfitting Problem**:
- This experiment addresses a simple overfitting problem and demonstrates the performance of **Frankenstein_optimizer**.
- To reproduce the results, simply run: "python main.py"

---

### 4. `4_Toy_loss_function`
This folder showcases optimization on a variety of toy problems, illustrating the performance of **Frankenstein_optimizer** and other optimizers on common convex and non-convex loss functions:

- **Beale Function**
- **Saddle Point Problem**
- **Rosenbrock Function**

#### Features
- Use `example.ipynb` to visualize the trajectories of different optimizers on specified optimization problems.
- Use `main.py` to generate animations including:
  - Each optimizer's learning preferences.
  - The convergence speed on various problems.

---