# OpenAI Finetuning with training.py

This version of `training.py` has been reimplemented to use OpenAI's API for finetuning GPT-4.1 instead of local training with unsloth.

## Key Changes

- **No local GPU requirements**: All training happens on OpenAI's infrastructure
- **Simplified setup**: No need for CUDA, PyTorch, or large model downloads
- **Automatic model management**: Fine-tuned models are automatically available through OpenAI's API
- **Cost-based**: Training costs are based on OpenAI's pricing for finetuning

## Prerequisites

1. **OpenAI API Key**: Set your OpenAI API key as an environment variable:
   ```bash
   export OPENAI_API_KEY="your-api-key-here"
   ```

2. **Install dependencies**:
   ```bash
   pip install -r requirements_openai.txt
   ```

## Usage

### Basic Usage

```bash
python training_openai.py train_openai.json
```

### Configuration

The configuration file (`train_openai.json`) supports these key parameters:

- `training_file`: Path to your training data (JSONL format)
- `openai_model`: Base model to finetune (e.g., "gpt-4o-mini", "gpt-4o")
- `epochs`: Number of training epochs
- `per_device_train_batch_size`: Batch size (note: OpenAI has limited batch size options)
- `output_dir`: Directory to save job information and temporary files

### Data Format

Your training data should be in JSONL format with one of these structures:

**Option 1: Messages format (recommended)**
```json
{"messages": [{"role": "user", "content": "..."}, {"role": "assistant", "content": "..."}]}
```

**Option 2: User/Assistant format**
```json
{"user": "...", "assistant": "..."}
```

**Option 3: Instruction/Response format**
```json
{"instruction": "...", "response": "..."}
```

**Option 4: Question/Answer format**
```json
{"question": "...", "answer": "..."}
```

## What Happens During Training

1. **Dataset Preparation**: Your data is converted to OpenAI's required format
2. **File Upload**: Training data is uploaded to OpenAI's servers
3. **Job Creation**: A finetuning job is created with your specified parameters
4. **Monitoring**: The script monitors the job progress and reports status
5. **Completion**: When done, you get the fine-tuned model ID

## Output

- **Job Information**: Saved to `{output_dir}/job_info.json`
- **Fine-tuned Model ID**: Available through OpenAI's API
- **Status Updates**: Real-time progress monitoring in the console

## Evaluation and Testing

After successful finetuning, you can evaluate and test your model in several ways:

### 1. Interactive Testing

Test your model interactively:
```bash
python test_openai_model.py ./tmp/job_info.json
```

This opens an interactive chat interface where you can test prompts and see responses.

### 2. Batch Evaluation

Evaluate your model on test data:
```bash
python evaluate_openai.py \
    --job_info ./tmp/job_info.json \
    --test_data ../data/darktriad/survey_messages.jsonl \
    --output evaluation_results.json \
    --temperature 0.7 \
    --max_tokens 500
```

### 3. Programmatic Usage

Use your fine-tuned model in your own code:
```python
from openai import OpenAI

client = OpenAI()

# Get your model ID from job_info.json
with open("./tmp/job_info.json", "r") as f:
    job_info = json.load(f)
    model_id = job_info["finetuned_model"]

# Use the model
response = client.chat.completions.create(
    model=model_id,
    messages=[{"role": "user", "content": "Your prompt here"}],
    temperature=0.7,
    max_tokens=500
)

print(response.choices[0].message.content)
```

### 4. Quick API Test

Test a single prompt:
```bash
curl -X POST "https://api.openai.com/v1/chat/completions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "ft:gpt-4o-mini:your-org:model-name:abc123",
    "messages": [{"role": "user", "content": "Test prompt"}],
    "temperature": 0.7,
    "max_tokens": 100
  }'
```

## Important Notes

- **Cost**: OpenAI finetuning has associated costs. Check [OpenAI's pricing](https://openai.com/pricing) for details
- **Model Availability**: Fine-tuned models are only available through OpenAI's API, not downloadable
- **Data Privacy**: Your training data is processed on OpenAI's servers
- **Limitations**: Some hyperparameters (like custom learning rates) are not configurable

## Example Output

```
Loading dataset from local file: ../data/darktriad/survey_messages.jsonl
Preparing dataset for OpenAI finetuning: ./tmp/training_data.jsonl
Dataset prepared: 199 examples saved to ./tmp/training_data.jsonl
Uploading file to OpenAI: ./tmp/training_data.jsonl
File uploaded successfully. File ID: file-abc123
File processing completed
Creating finetuning job for model: gpt-4o-mini
Finetuning job created successfully. Job ID: ftjob-xyz789
Monitoring finetuning job: ftjob-xyz789
Status: running
Status: running
Status: succeeded
Finetuning completed successfully!
Fine-tuned model: ft:gpt-4o-mini:your-org:model-name:abc123
Job information saved to: ./tmp/job_info.json
Training file cleaned up
```

## Troubleshooting

- **API Key Issues**: Ensure `OPENAI_API_KEY` is set correctly
- **File Format**: Check that your training data is valid JSONL
- **Rate Limits**: OpenAI has rate limits; the script includes backoff retry logic
- **Job Status**: Use the job ID to check status manually if needed

## Next Steps

After successful finetuning, you can use your fine-tuned model through OpenAI's API:

```python
from openai import OpenAI

client = OpenAI()
response = client.chat.completions.create(
    model="ft:gpt-4o-mini:your-org:model-name:abc123",
    messages=[{"role": "user", "content": "Your prompt here"}]
)
print(response.choices[0].message.content)
```
