Credits system
How credits work
- Each API request consumes credits based on the endpoint and features used
- Credits are deducted from your account balance when requests complete successfully
- Failed requests (4xx errors) don’t consume credits
- Credit information is included in response headers
- See Request & Response for endpoint pricing details
Response headers
cloro uses different response headers depending on the endpoint type: All endpoints (/v1/*): Rate limit information
Monitor endpoints (/v1/monitor/*): Credit and concurrency information (in addition to rate limits)
Rate limit information (all endpoints)
| Header | Description |
|---|---|
X-RateLimit-Limit | Maximum requests per second allowed (500) |
X-RateLimit-Remaining | Remaining requests available in this second |
Credit information (monitor endpoints)
| Header | Description |
|---|---|
X-Credits-Remaining | Number of credits remaining in your account |
X-Credits-Charged | Number of credits charged for this request |
Concurrency information (monitor endpoints)
| Header | Description |
|---|---|
X-Concurrent-Limit | Maximum number of concurrent requests allowed |
X-Concurrent-Current | Current number of concurrent requests |
X-Concurrent-Remaining | Number of remaining concurrent slots available |
Concurrency limits
How concurrency works
- Concurrency limits control how many requests you can run simultaneously
- Limits are per API key/account
- Exceeding limits results in 429 Too Many Requests errors
- Limits help ensure service stability for all users
Example scenario
Handling concurrency limits
When you receive a 429 error:Monitoring usage
Track credit consumption
Monitor concurrency
Monitor rate limits (all endpoints)
Getting help
Credit issues
- Low balance: Top up your account via your dashboard
- Unexpected charges: Check request logs and verify which features were enabled
- Billing questions: Contact support with request IDs
Limit issues
- Frequent 429 errors: Consider implementing request queuing or throttling
- Rate limit issues: Monitor
X-RateLimit-Remainingand implement throttling to stay within 500/sec - Concurrency issues: Monitor
X-Concurrent-Remainingand stay within your plan’s concurrent request limit - Need higher limits: Contact support to discuss limit increases
Troubleshooting checklist
- Check headers: Monitor
X-Credits-Remaining,X-Concurrent-Remaining, andX-RateLimit-Remaining - Verify requests: Ensure you’re only requesting needed features
- Implement retries: Use exponential backoff for 429 errors
- Monitor usage: Track credit consumption, concurrency patterns, and rate limit usage
- Optimize queries: Make prompts efficient and specific