Limitry Documentation
Usage metering for AI and LLM applications
Track AI consumption, enforce quotas, and integrate with billing platforms.
Choose Your Path
📚 Guides
Learn the fundamentals of usage metering
🐍 Python SDK
Track usage with Python
⚡ TypeScript SDK
Track usage with TypeScript
📖 API Reference
Complete REST API documentation
Why Limitry?
📊
Real-time Tracking
Track every API call, token, and resource in real-time.🚦
Quota Enforcement
Set limits and enforce them before overage occurs.💰
Billing Integration
Connect to Stripe, Orb, or any billing platform.⚡
Sub-millisecond Latency
Check quotas without impacting your API response times.Quick Start
New to Limitry? Get started in 3 steps:
-
Learn the basics - Read the Quickstart to send usage data in 5 minutes.
-
Understand the concepts - Learn about Core Concepts to understand events, quotas, and rate limits.
-
Choose your SDK - Dive into the Python SDK or TypeScript SDK for hands-on guides.
Use Cases
| Use Case | Description |
|---|---|
| Token Metering | Track LLM token consumption per customer |
| API Rate Limiting | Enforce request limits per plan tier |
| Usage-Based Billing | Send accurate usage data to your billing system |
| Cost Allocation | Attribute AI costs to specific customers or projects |