Credits, Rate Limits & Quotas

Check Your Credit Balance

curl https://api.bithuman.ai/v2/credit-summaries \
  -H "api-secret: YOUR_API_SECRET"

Response:

{
  "success": true,
  "data": {
    "balance": 5240,
    "plan_credits": 240,
    "topup_credits": 5000,
    "minutes_estimate": {
      "voice_chat": 524,
      "video_chat": 174,
      "standard_avatar_cloud": 2620,
      "standard_avatar_self_hosted": 5240,
      "advanced_avatar_cloud": 1310,
      "advanced_avatar_self_hosted": 2620
    }
  }
}

Field	Description
`balance`	Total usable credits (plan + topup)
`plan_credits`	Monthly subscription credits — reset each billing cycle
`topup_credits`	Purchased credits — carry over until used
`minutes_estimate`	Pre-calculated minutes remaining per session type

Credit Rates

Different features consume credits at different rates per minute:

Feature	Credits/min	Description
Voice Chat	10	Cloud agent with STT + LLM + TTS
Video Chat	30	Cloud agent with user camera enabled
Standard Avatar (Cloud)	2	Cloud-hosted avatar rendering
Standard Avatar (Self-Hosted)	1	Your infrastructure, avatar rendering
Advanced Avatar (Cloud)	4	Cloud-hosted high-quality avatar rendering
Advanced Avatar (Self-Hosted)	2	Your infrastructure, high-quality avatar rendering

One-time costs:

Operation	Credits	Description
Agent Generation	250	Create a new avatar agent
Dynamics Generation	250	Generate gestures/movements for an agent

Request Limits

API endpoints are rate-limited to protect service quality. Limits are applied per API secret.

Tier	Concurrent Sessions	Agent Generations/day
Free	2	5
Creator ($20/mo)	5	20
Pro ($99/mo)	10	50
Enterprise	Custom	Custom

Check your current tier and usage at www.bithuman.ai → Developer → API Keys.

Handling Errors

If you exceed limits or run out of credits, the API returns an error:

{
  "error": {
    "code": "INSUFFICIENT_BALANCE",
    "message": "Insufficient credits",
    "httpStatus": 402
  },
  "status": "error",
  "status_code": 402
}

Common status codes: 402 (no credits), 429 (rate limited), 503 (workers busy).

Recommended Retry Strategy

Use exponential backoff with jitter:

import time
import random
import requests

def api_request_with_retry(url, headers, max_retries=3):
    for attempt in range(max_retries):
        resp = requests.post(url, headers=headers)

        if resp.status_code not in (429, 503):
            return resp

        # Exponential backoff with jitter
        wait = (2 ** attempt) + random.uniform(0, 1)
        time.sleep(wait)

    return resp  # Return last response if all retries exhausted

Concurrency Limits

Avatar sessions have per-account concurrency limits:

Resource	Limit	Notes
Cloud avatar sessions	Based on tier	Active WebRTC sessions
Agent generation	3 concurrent	Queued if exceeded
Dynamics generation	2 concurrent	Queued if exceeded

Endpoint Guidelines

Endpoint	Guidance	Notes
`POST /v1/validate`	Lightweight	Use for health checks
`POST /v1/agent/generate`	Heavy	~2-5 min async operation
`GET /v1/agent/status/*`	Poll at 5s intervals	Avoid sub-second polling
`POST /v1/agent/*/speak`	Per active session	Agent must be in a room
`POST /v1/files/upload`	10 MB image, 100 MB video	Size limits enforced
`POST /v1/dynamics/generate`	Heavy	Triggers video generation

Best Practices

Use webhooks instead of polling

Instead of polling /v1/agent/status/{id} in a loop, configure webhooks to get notified when generation completes.

Cache agent details

Agent data rarely changes. Cache GET /v1/agent/{code} responses locally and refresh only when needed.

Reuse sessions

Keep avatar sessions alive between conversations instead of creating new ones. Session creation is the most expensive operation.

Check credits before heavy operations

Call GET /v2/credit-summaries to check your balance before starting agent generation (250 credits) or dynamics creation (250 credits). This avoids wasted API calls that fail with 402.

curl https://api.bithuman.ai/v2/credit-summaries \
  -H "api-secret: YOUR_API_SECRET"

Overview

Agents

Assets

Credits, Rate Limits & Quotas

Check Your Credit Balance

Credit Rates

Request Limits

Handling Errors

Recommended Retry Strategy

Concurrency Limits

Endpoint Guidelines

Best Practices

Need Higher Limits?

Overview

Agents

Assets

Documentation Index

​Check Your Credit Balance

​Credit Rates

​Request Limits

​Handling Errors

​Recommended Retry Strategy

​Concurrency Limits

​Endpoint Guidelines

​Best Practices

​Need Higher Limits?

Check Your Credit Balance

Credit Rates

Request Limits

Handling Errors

Recommended Retry Strategy

Concurrency Limits

Endpoint Guidelines

Best Practices

Need Higher Limits?