Rate Limits
Understand API rate limits, plan-based quotas, and best practices to optimize performance.
Rate Limits Overview
Rate limits are enforced to ensure platform stability and fair usage across all customers. Limits are applied per API key and are the same for all request types.
You can monitor your real-time usage and remaining quota from the dashboard or via the Usage API.
Rate Limits
Unified Limit
10 requests per minute (per API key)
Applies to all endpoints and request types
Consistent enforcement across all plans
Rate Limit Enforcement
When a rate limit is exceeded, the API responds with a429 Too Many Requestserror.
{
"status": 429,
"message": "Rate limit exceeded",
"result": null
}Best Practices
- Implement exponential backoff
Retry requests gradually when receiving 429 responses.
- Cache API responses
Reduce unnecessary requests by caching frequently accessed data.
- Use batch & bulk operations
Upload or process multiple documents in a single request when possible.
- Monitor usage via dashboard
Track consumption in real time and adjust usage before hitting limits.
