Rate Limiting
Control request frequency with rate limits
Rate limits help you control how frequently requests can be made, preventing abuse and ensuring fair usage.
Creating a Rate Limit
const rateLimit = await client.rateLimits.create({
customerId: 'customer_123',
eventType: 'model_call',
limit: 100,
window: 'minute',
});rate_limit = await client.rate_limits.create(
customer_id="customer_123",
event_type="model_call",
limit=100,
window="minute",
)Checking Rate Limits
const result = await client.rateLimits.check({
customerId: 'customer_123',
eventType: 'model_call',
});
if (!result.allowed) {
// Rate limit exceeded, retry after result.retryAfter
}result = await client.rate_limits.check(
customer_id="customer_123",
event_type="model_call",
)
if not result.allowed:
# Rate limit exceeded, retry after result.retry_after
passNext Steps
- Setting Quotas - Set usage limits
- Best Practices - Learn best practices