🧠 OpenAI Batch Prompt Runner

A lightweight CLI tool for running batches of prompts through OpenAI's GPT models using Python. This repo contains three versions of a script that:

Reads prompts from a file (prompts.txt)
Sends them one by one to the OpenAI API
Logs each response
Estimates token usage and cost
Optionally saves results to .txt and .csv files

🔐 Setup: `.env` File

Before running, create a .env file in the project folder:

OPENAI_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

Make sure to:

Use your own OpenAI key
Never commit .env to public repos

📂 Script Versions

`batch_from_file.py` (v1)

Model: GPT-3.5 Turbo
Token Limit: 150
Output: Individual .txt files
No CSV logging
Simplest version – great for quick testing.

`batch_from_file_v2.py` (v2)

Model: GPT-4 Turbo
Token Limit: 600
Output:
- .txt file per prompt with usage summary
- Central CSV log (summary_log.csv) with tokens, cost, truncation info
Added rate limit protection

`batch_from_file_v3.py` (v3)

Model: GPT-3.5 Turbo
Token Limit: 1000
Output:
- .txt per response
- CSV log (summary_log_v3.csv)
Includes cost calculation per prompt
Detects and logs truncation
Clean terminal summaries

📊 Helper Scripts

📈 Report Builder Scripts

This repo includes two report builders that generate visual summaries from OpenAI batch CSV logs.

`report_builder.py`

Purpose:
Generate a lightweight summary report (HTML or Markdown) from a GPT batch CSV.
Requires standard columns: prompt, response, prompt_tokens, completion_tokens, total_tokens, cost

Features:

Summary statistics (min, mean, median, max)
Optional charts:
- Scatter: Total Tokens vs. Cost
- Histogram: Response Length
- Bar chart: Top-N costly prompts
Exports either:
- HTML with embedded Base64 images
- Markdown with PNG charts saved to disk

Example Commands:

# Generate HTML report
python report_builder.py -i results/summary_log_v3.csv -f html -o batch_report.html

# Generate Markdown report with PNGs
python report_builder.py -i results/summary_log_v3.csv -f md -o batch_report.md --top-n 10

`report_builder_v2.py`

Purpose:
An advanced version with extra metrics, filtering, and comparison features.

Key Features:

Cost-per-token, prompt/completion ratios
Correlation matrix + heatmap
Percentile tables
Before/after comparison (e.g., for optimization)
Near-duplicate prompt detection
Flexible CLI filters: --min-cost, --keyword, --opt-flag-col
Multiple chart types via --charts

Example Commands:

# Full-feature HTML report
python report_builder_v2.py -i results/summary_log_v3.csv -f html -o batch_report_v2.html \
  --title "GPT-3.5 Batch Report v2" \
  --charts scatter_cost_tokens hist_response_length heatmap_corr \
  --top-n 5 --min-cost 0.0002 --keyword "error" --opt-flag-col optimized

# Markdown version (lightweight)
python report_builder_v2.py -i results/summary_log_v3.csv -f md -o batch_report_v2.md \
  --charts scatter_cost_tokens hist_response_length

✅ Use report_builder.py for quick insights
🧠 Use report_builder_v2.py for advanced analysis

🔢 CSV Analyser Scripts

This repo includes two report builders that generate visual summaries from OpenAI batch CSV logs.

`csv_analyzer.py`

Purpose:
Analyzes results/summary_log_v3.csv generated by batch_from_file_v3.py.
Outputs token + cost stats, and saves a detailed report.

What it reports:

Total & average prompt, completion, and total tokens
Total & average cost
Cost per token / 1000 tokens / 100 prompts
Truncated responses (with Prompt #s)

How to use:

python csv_analyzer.py

Input: results/summary_log_v3.csv (must exist)
Output: results/summary_report.txt

📈 `generic_csv_analyzer.py`

Purpose:
Works with any CSV file. Gives numeric stats (mean, std, percentiles) and categorical summaries (most common values, unique counts).

What it reports:

Row & column count
Per-column stats for numbers and strings
Optional: Save report to a .txt file

How to use:

# Just print to console:
python generic_csv_analyzer.py path/to/your_file.csv

# Save report to a text file:
python generic_csv_analyzer.py path/to/your_file.csv --output reports/summary.txt

Dependencies:

pip install pandas

✅ Tip: Use csv_analyzer.py for OpenAI prompt logs
📂 Use generic_csv_analyzer.py for any custom CSV

📜 prompts.txt Format

Create a file named prompts.txt with one prompt per line. For example:

prompt1
prompt2
prompt3
prompt4
...
promptx

Avoid blank lines at the end.

▶️ How to Run

Install dependencies:
```
pip install openai python-dotenv
```
Prepare files:
- .env with your key
- prompts.txt with your prompts

Run a script:

python batch_from_file.py        # basic
python batch_from_file_v2.py     # with CSV and GPT-4
python batch_from_file_v3.py     # GPT-3.5, 1000-token limit

Check outputs:
- Text files: response_1.txt, response_2.txt, ...
- CSV logs: summary_log.csv or summary_log_v3.csv

🧾 Output Breakdown

Each script (except v1) tracks:

Prompt and response tokens
Truncation status
Cost per prompt and total cost
First 250 characters of response in CSV
Full response saved in .txt files with usage details

🤖 Model Comparison Summary

This chart summarizes the tradeoff between cost per prompt and average response quality:

Model	Avg. Cost per Prompt	Avg. Quality Score (out of 20)
GPT-4 Turbo	$0.0184	18.0
GPT-3.5 Turbo	$0.0006	16.0

GPT-3.5 Turbo is cost-efficient and fast.
GPT-4 Turbo is more powerful and accurate but more expensive.

🛠️ Tip for Beginners

Clone the repo and use batch_from_file.py first to get started. Then explore v2 and v3 for more advanced logging and flexibility. Use a cheaper model when testing and don't forget to limit the tokens of the responses. NEVER share this API key or commit your .env file publically.

📌 License & Usage

This project is intended for educational or research use. API usage costs are your responsibility.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
helper_scripts		helper_scripts
images		images
.gitignore		.gitignore
batch_from_file.py		batch_from_file.py
batch_from_file_v2.py		batch_from_file_v2.py
batch_from_file_v3.py		batch_from_file_v3.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 OpenAI Batch Prompt Runner

🔐 Setup: `.env` File

`batch_from_file.py` (v1)

`batch_from_file_v2.py` (v2)

`batch_from_file_v3.py` (v3)

📈 Report Builder Scripts

`report_builder.py`

`report_builder_v2.py`

🔢 CSV Analyser Scripts

`csv_analyzer.py`

📈 `generic_csv_analyzer.py`

🛠️ Tip for Beginners

📌 License & Usage

About

Uh oh!

Releases

Packages

Languages

habib13352/gpt-batch-logger

Folders and files

Latest commit

History

Repository files navigation

🧠 OpenAI Batch Prompt Runner

🔐 Setup: .env File

batch_from_file.py (v1)

batch_from_file_v2.py (v2)

batch_from_file_v3.py (v3)

📈 Report Builder Scripts

report_builder.py

report_builder_v2.py

🔢 CSV Analyser Scripts

csv_analyzer.py

📈 generic_csv_analyzer.py

🛠️ Tip for Beginners

📌 License & Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

🔐 Setup: `.env` File

`batch_from_file.py` (v1)

`batch_from_file_v2.py` (v2)

`batch_from_file_v3.py` (v3)

`report_builder.py`

`report_builder_v2.py`

`csv_analyzer.py`

📈 `generic_csv_analyzer.py`

Packages