Stable v1.0Last updated recently
Get Started

Rate Limits

Understand API rate limits, plan-based quotas, and best practices to optimize performance.

Rate Limits Overview

Rate limits are enforced to ensure platform stability and fair usage across all customers. Limits are applied per API key and are the same for all request types.

You can monitor your real-time usage and remaining quota from the dashboard or via the Usage API.

Rate Limits

Unified Limit

10 requests per minute (per API key)

Applies to all endpoints and request types

Consistent enforcement across all plans

Rate Limit Enforcement

When a rate limit is exceeded, the API responds with a429 Too Many Requestserror.

429 ErrorRate limit exceeded
Response Body
{
    "status": 429,
    "message": "Rate limit exceeded",
    "result": null
}

Best Practices

  • Implement exponential backoff

    Retry requests gradually when receiving 429 responses.

  • Cache API responses

    Reduce unnecessary requests by caching frequently accessed data.

  • Use batch & bulk operations

    Upload or process multiple documents in a single request when possible.

  • Monitor usage via dashboard

    Track consumption in real time and adjust usage before hitting limits.