GPT Usage Limiter

Multi-tenant budget-based proxy server for OpenAI API with cost monitoring, usage tracking, and budget limiting per period. Built with Rust (Axum) backend and Next.js 15 frontend.

Tech Stack

Backend

Rust - High-performance, safe systems programming language
Axum - Ergonomic web framework for Rust
SQLx - Async SQL toolkit with compile-time query checking
SQLite - Lightweight, serverless database
JWT - Token-based authentication

Frontend

Next.js 15 - React framework with App Router
TypeScript - Type-safe JavaScript
shadcn/ui - Re-usable components built with Radix UI
Tailwind CSS - Utility-first CSS framework
Recharts - Composable charting library

Features

💰 Budget-Based Limiting - Set budget limits per minute/hour/day/week/month/year
🏢 Multi-Tenant - Create multiple limiters for different clients/projects
🔒 Transparent Proxy - Works with any OpenAI SDK, just change baseURL
📊 Usage Tracking - Track requests, tokens, and cost per user
📈 Modern Dashboard - Beautiful React dashboard with shadcn/ui components
💸 Cost Monitoring - Automatic cost calculation for all OpenAI models
🎯 Per-Limiter Stats - Detailed statistics and budget usage per limiter
🔧 Easy Management - Create, edit, delete, and toggle limiters via UI
🗄️ SQLite Storage - Persistent storage with automatic migrations
🌐 Azure OpenAI Support - Works with both OpenAI and Azure OpenAI
⚡ High Performance - Built with Rust for maximum speed and safety
🔐 Secure Auth - JWT-based authentication system

Quick Start

Prerequisites

Rust 1.75 or higher
Node.js 20 or higher
Docker (optional, for containerized deployment)

Development Setup

1. Backend Setup

cd backend

# Copy environment file
cp .env.example .env

# Edit .env with your configuration
# Important: Change JWT_SECRET and ADMIN_PASSWORD in production!

# Run the backend
cargo run

# Or with auto-reload (requires cargo-watch)
cargo watch -x run

Backend will be running at http://localhost:8000

2. Frontend Setup

cd frontend

# Install dependencies
npm install

# Copy environment file
cp .env.example .env.local

# Run the development server
npm run dev

Frontend will be running at http://localhost:3000

Docker Deployment

# Copy environment file
cp .env.example .env

# Edit .env with your configuration
# Important: Change JWT_SECRET and ADMIN_PASSWORD in production!

# Build and run with Docker Compose
docker-compose up -d

# View logs
docker-compose logs -f

# Stop containers
docker-compose down

Services:

Frontend: http://localhost:3000
Backend API: http://localhost:8000
Health Check: http://localhost:8000/health

How It Works

1. Login to Dashboard

Access http://localhost:3000/login

Default credentials (change in production!):

Username: admin
Password: changeme

2. Create a Limiter

In the dashboard:

Click "Create Limiter"
Name: e.g., "Customer A - $20/day"
Budget Limit: e.g., $20.00
Period: day (or minute, hour, week, month, year)
Base URL: https://api.openai.com (or Azure OpenAI URL)

You'll get a unique proxy endpoint like:

http://localhost:8000/v1/abc123def456

3. Use the Proxy Endpoint in Your Code

Just change the baseURL to your limiter's proxy endpoint:

Python

from openai import OpenAI

client = OpenAI(
    api_key="your-openai-api-key",
    base_url="http://localhost:8000/v1/abc123def456"  # Your limiter endpoint
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}]
)

JavaScript/TypeScript

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: 'your-openai-api-key',
  baseURL: 'http://localhost:8000/v1/abc123def456'  // Your limiter endpoint
});

const response = await client.chat.completions.create({
  model: 'gpt-4o',
  messages: [{ role: 'user', content: 'Hello!' }]
});

4. Monitor Usage

View real-time statistics in the dashboard
Check budget usage percentage
See requests, tokens, and cost breakdown
Monitor per-user statistics

Budget Limit Response

When budget limit is exceeded, the proxy returns HTTP 429:

{
  "error": {
    "message": "Budget limit exceeded: $20.50/$20.00 per day. Current usage: 102.5%",
    "type": "429"
  }
}

Dashboard Features

Limiters Page

Create Limiters - Add new limiters with custom budgets and periods
Responsive Grid - View all limiters in a responsive card layout
Limiter Cards - Each card shows:
- Active/Inactive status with toggle
- Budget limit and period type
- Proxy endpoint with copy button
- Real-time statistics
Quick Actions - Toggle, view stats, or delete limiters
Stats Dialog - View detailed usage and budget information

Statistics Page

Overview Cards:
- Total Requests
- Total Tokens
- Total Cost
- Active Users
Usage Table:
- Detailed breakdown by limiter
- Sortable columns
- Cost and token metrics

API Documentation

Authentication

All protected endpoints require a JWT token in the Authorization header:

Authorization: Bearer <your-jwt-token>

Get token by logging in:

POST /api/auth/login
Content-Type: application/json

{
  "username": "admin",
  "password": "changeme"
}

Response:

{
  "token": "eyJhbGc...",
  "username": "admin"
}

Limiter Management

Create Limiter

POST /api/limiters
Authorization: Bearer <token>
Content-Type: application/json

{
  "name": "Customer A - $20/day",
  "budget_limit": 20.00,
  "period_type": "day",
  "base_url": "https://api.openai.com"
}

Get All Limiters

GET /api/limiters
Authorization: Bearer <token>

Get Limiter Stats

GET /api/limiters/{limiter_id}/stats
Authorization: Bearer <token>

Response:

{
  "limiter_id": "uuid-here",
  "limiter_name": "Customer A",
  "current_spend": 5.25,
  "budget_limit": 20.00,
  "remaining_budget": 14.75,
  "usage_percentage": 26.25,
  "total_requests": 150,
  "period_start": "2025-10-10T00:00:00Z",
  "period_end": "2025-10-11T00:00:00Z"
}

Update Limiter

PUT /api/limiters/{limiter_id}
Authorization: Bearer <token>
Content-Type: application/json

{
  "name": "Customer A - $30/day",
  "budget_limit": 30.00,
  "is_active": true
}

Delete Limiter

DELETE /api/limiters/{limiter_id}
Authorization: Bearer <token>

Statistics API

Get All User Stats

GET /api/stats
Authorization: Bearer <token>

Period Types

Budget limits can be set for different periods:

Period	Description	Resets
`minute`	Per minute	Every minute
`hour`	Per hour	Every hour
`day`	Per day	Every day at midnight UTC
`week`	Per week	Every Monday at midnight UTC
`month`	Per month	First day of month
`year`	Per year	January 1st

Cost Calculation

Automatic cost calculation for OpenAI models:

Model	Prompt (per 1K tokens)	Completion (per 1K tokens)
gpt-4o	$0.005	$0.015
gpt-4o-mini	$0.00015	$0.0006
gpt-4-turbo	$0.01	$0.03
gpt-4	$0.03	$0.06
gpt-3.5-turbo	$0.0005	$0.0015

Architecture

┌──────────────┐
│   Client     │
│  (OpenAI SDK)│
└──────┬───────┘
       │
       ▼
┌─────────────────────────────────────┐
│  Rust Backend (Axum)                │
│                                     │
│  ┌───────────────────────────────┐ │
│  │  /v1/{limiter_code}/*         │ │
│  │  - Validate limiter           │ │
│  │  - Check budget limit         │ │
│  │  - Forward to OpenAI API      │ │
│  │  - Track usage & cost         │ │
│  └───────────────────────────────┘ │
│                                     │
│  ┌───────────────────────────────┐ │
│  │  Management API               │ │
│  │  - JWT Authentication         │ │
│  │  - Limiter CRUD               │ │
│  │  - Statistics endpoints       │ │
│  └───────────────────────────────┘ │
└─────────────┬───────────────────────┘
              │
              ▼
      ┌──────────────┐
      │  SQLite DB   │
      │  - Limiters  │
      │  - Usage     │
      └──────────────┘

┌─────────────────────────────────────┐
│  Next.js Frontend                   │
│                                     │
│  ┌───────────────────────────────┐ │
│  │  /login                       │ │
│  │  - JWT authentication         │ │
│  └───────────────────────────────┘ │
│                                     │
│  ┌───────────────────────────────┐ │
│  │  / (Dashboard)                │ │
│  │  - Limiter management         │ │
│  │  - Real-time stats            │ │
│  └───────────────────────────────┘ │
│                                     │
│  ┌───────────────────────────────┐ │
│  │  /statistics                  │ │
│  │  - Usage overview             │ │
│  │  - Cost breakdown             │ │
│  └───────────────────────────────┘ │
└─────────────────────────────────────┘

Project Structure

gpt-usage-limiter/
├── backend/                   # Rust backend
│   ├── src/
│   │   ├── main.rs           # Application entry point
│   │   ├── config.rs         # Configuration management
│   │   ├── db.rs             # Database layer
│   │   ├── error.rs          # Error handling
│   │   ├── models.rs         # Data models
│   │   ├── handlers/         # HTTP handlers
│   │   │   ├── auth.rs       # Authentication
│   │   │   ├── limiters.rs   # Limiter management
│   │   │   ├── proxy.rs      # OpenAI proxy
│   │   │   └── stats.rs      # Statistics
│   │   ├── services/         # Business logic
│   │   │   ├── limiter.rs    # Limiter service
│   │   │   ├── proxy.rs      # Proxy service
│   │   │   ├── pricing.rs    # Pricing calculations
│   │   │   └── stats.rs      # Statistics service
│   │   └── middleware/       # Middleware
│   │       └── auth.rs       # JWT middleware
│   ├── migrations/           # Database migrations
│   ├── Cargo.toml           # Rust dependencies
│   ├── Dockerfile           # Backend Docker image
│   └── .env.example         # Environment variables template
│
├── frontend/                 # Next.js frontend
│   ├── app/
│   │   ├── (dashboard)/     # Dashboard layout group
│   │   │   ├── layout.tsx   # Dashboard layout
│   │   │   ├── page.tsx     # Limiters page
│   │   │   └── statistics/
│   │   │       └── page.tsx # Statistics page
│   │   ├── login/
│   │   │   └── page.tsx     # Login page
│   │   ├── layout.tsx       # Root layout
│   │   └── globals.css      # Global styles
│   ├── components/
│   │   └── ui/              # shadcn/ui components
│   ├── lib/
│   │   ├── api.ts           # API client
│   │   └── utils.ts         # Utility functions
│   ├── package.json         # Node dependencies
│   ├── Dockerfile           # Frontend Docker image
│   └── .env.example         # Environment variables template
│
├── docker-compose.yml       # Docker Compose configuration
├── .env.example             # Root environment template
└── README.md               # This file

Use Cases

1. Multi-Customer SaaS

Create separate limiters for each customer with different budget tiers:

Basic: $10/month
Pro: $50/month
Enterprise: $500/month

2. Department Budgets

Separate limiters for different departments in your organization:

Marketing: $100/week
Engineering: $500/week
Support: $200/week

3. Project-Based Budgets

Create limiters for different projects with independent budgets:

Project Alpha: $1000/month
Project Beta: $500/month
POC Projects: $50/month

4. Rate Limiting for Free Tier

Create limiters with small budgets for free tier users:

Free User: $1/day
Prevent abuse while offering free tier

Production Deployment

Security Checklist

Environment Variables

Backend (.env)

# Server
HOST=0.0.0.0
PORT=8000
DATABASE_URL=sqlite:usage_data.db

# Security
JWT_SECRET=your-super-secret-key-min-32-characters
JWT_EXPIRE_HOURS=24
ADMIN_USERNAME=admin
ADMIN_PASSWORD=strong-password-here

# OpenAI
OPENAI_API_BASE_URL=https://api.openai.com

Frontend (.env.local)

NEXT_PUBLIC_API_URL=http://localhost:8000

Reverse Proxy Example (Nginx)

server {
    listen 80;
    server_name your-domain.com;
    return 301 https://$server_name$request_uri;
}

server {
    listen 443 ssl http2;
    server_name your-domain.com;

    ssl_certificate /path/to/cert.pem;
    ssl_certificate_key /path/to/key.pem;

    # Frontend
    location / {
        proxy_pass http://localhost:3000;
        proxy_http_version 1.1;
        proxy_set_header Upgrade $http_upgrade;
        proxy_set_header Connection 'upgrade';
        proxy_set_header Host $host;
        proxy_cache_bypass $http_upgrade;
    }

    # Backend API
    location /api {
        proxy_pass http://localhost:8000;
        proxy_http_version 1.1;
        proxy_set_header Host $host;
        proxy_set_header X-Real-IP $remote_addr;
        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
        proxy_set_header X-Forwarded-Proto $scheme;
    }

    # Proxy endpoints
    location /v1 {
        proxy_pass http://localhost:8000;
        proxy_http_version 1.1;
        proxy_set_header Host $host;
        proxy_set_header X-Real-IP $remote_addr;
    }
}

Development

Backend Development

cd backend

# Run with auto-reload
cargo watch -x run

# Run tests
cargo test

# Check code
cargo clippy

# Format code
cargo fmt

# Build release
cargo build --release

Frontend Development

cd frontend

# Run development server
npm run dev

# Build for production
npm run build

# Start production server
npm start

# Lint code
npm run lint

Troubleshooting

Budget not being tracked correctly

Check that limiter is active (not deactivated)
Verify period type is correct
Check if period has rolled over (e.g., new day started)
Review SQLite database for usage records

Limiter not found error

Verify limiter code in URL is correct
Check if limiter was deleted
Ensure limiter is active
Check backend logs for errors

Authentication issues

Verify JWT_SECRET is set correctly
Check token expiration (default 24 hours)
Clear browser localStorage and login again
Check browser console for CORS errors

Cost calculation seems wrong

Verify model name matches OpenAI's naming
Check pricing configuration in pricing.rs
Review usage records in database

Docker container issues

Check logs: docker-compose logs backend
Verify environment variables are set
Ensure ports 3000 and 8000 are not in use
Check disk space for SQLite database

Performance

Backend: Built with Rust for maximum performance and memory safety
Database: SQLite with optimized indexes for fast queries
Frontend: Next.js 15 with Server Components for optimal loading
API: Async Rust with Axum for high concurrency

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes
Write tests if applicable
Submit a pull request

License

MIT License

Support

If you encounter issues or have questions:

Open a GitHub issue
Review server logs for errors
Check the troubleshooting section

High-performance OpenAI API budget management built with Rust and Next.js

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.claude		.claude
.github/workflows		.github/workflows
app		app
backend		backend
frontend		frontend
k8s		k8s
static		static
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
example_client.py		example_client.py
main.py		main.py
pricing.json.example		pricing.json.example
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

GPT Usage Limiter

Tech Stack

Backend

Frontend

Features

Quick Start

Prerequisites

Development Setup

1. Backend Setup

2. Frontend Setup

Docker Deployment

How It Works

1. Login to Dashboard

2. Create a Limiter

3. Use the Proxy Endpoint in Your Code

Python

JavaScript/TypeScript

4. Monitor Usage

Budget Limit Response

Dashboard Features

Limiters Page

Statistics Page

API Documentation

Authentication

Limiter Management

Create Limiter

Get All Limiters

Get Limiter Stats

Update Limiter

Delete Limiter

Statistics API

Get All User Stats

Period Types

Cost Calculation

Architecture

Project Structure

Use Cases

1. Multi-Customer SaaS

2. Department Budgets

3. Project-Based Budgets

4. Rate Limiting for Free Tier

Production Deployment

Security Checklist

Environment Variables

Backend (.env)

Frontend (.env.local)

Reverse Proxy Example (Nginx)

Development

Backend Development

Frontend Development

Troubleshooting

Budget not being tracked correctly

Limiter not found error

Authentication issues

Cost calculation seems wrong

Docker container issues

Performance

Contributing

License

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages