Documentation Index
Fetch the complete documentation index at: https://docs.bithuman.ai/llms.txt
Use this file to discover all available pages before exploring further.

Pick your path
bitHuman ships two models and three runtime surfaces. Your choice depends on what you’re shipping (web, server, or native app) and which avatar style fits.Two models
| Model | What it is | Avatar source | Compute | Best for |
|---|---|---|---|---|
| Essence | CPU-only pre-rendered animations with real-time lip-sync | .imx model file built from a photo or video on bithuman.ai | CPU (any laptop, Raspberry Pi, edge box) | Kiosks, 24/7 displays, full body + animal mode, custom gestures |
| Expression | AI-generated facial animation from a single image | Any face image — no avatar generation step | NVIDIA GPU or Apple Silicon M3+ (on-device) | Dynamic conversations, custom faces, native consumer apps |
Three runtime surfaces
| Surface | What it is | Models supported | Best for |
|---|---|---|---|
| bitHuman Cloud | Managed runtime on api.bithuman.ai. Use via the LiveKit plugin (pip install livekit-plugins-bithuman) or the REST API directly. | Essence, Expression | Web apps, fastest setup, no infra. |
| Self-hosted server | Run on your own hardware. Python SDK (pip install bithuman) for Essence on CPU, or Docker container for Expression on Linux + NVIDIA. | Essence (CPU), Expression (GPU) | On-prem privacy, edge devices, high volume. |
| On-device Apple Silicon | bitHumanKit Swift Package + bithuman-cli Homebrew tool. All inference (STT, LLM, TTS, lip-sync) runs locally; only a 1-req/min billing heartbeat goes out. | Expression (M3+ Mac, M4+ iPad Pro, A18 Pro+ iPhone) | Native consumer / pro-sumer apps where everything must run locally. |
5-minute quickstart
Pick the surface that matches what you’re building:What can you build?
Customer Support
Replace hold music with a talking avatar that answers questions.
AI Tutor
A patient virtual teacher that explains and adapts to students.
Digital Receptionist
Lobby kiosks and tablets that greet and direct visitors.
AI Companion
Persistent characters with personality and memory.
Game NPC
Dynamic characters that react to player actions with gestures.
Video Generation
Batch-generate talking-head videos from scripts.
Developer guides
Quickstart
Python SDK in 5 minutes
Swift SDK
On-device for Mac/iPad/iPhone
REST API
Agent generation, speak, dynamics
Avatar Sessions
LiveKit Cloud, CPU, GPU — every mode
Examples
Working examples for every surface
Chat Widget
Floating chat with text, voice, and video modes
Core SDK API (Python)
| Method | Description |
|---|---|
AsyncBithuman.create(model_path, api_secret) | Initialize the avatar runtime |
runtime.start() | Begin processing |
runtime.push_audio(data, sample_rate) | Send audio for lip-sync |
runtime.flush() | Signal end of audio input |
runtime.run() | Async generator yielding video + audio frames |
runtime.get_frame_size() | Returns (width, height) of output |
REST API
| Endpoint | Method | Description |
|---|---|---|
/v1/validate | POST | Verify API secret |
/v1/agent/generate | POST | Generate new avatar agent |
/v1/agent/{code} | GET/POST | Get or update agent |
/v1/agent/{code}/speak | POST | Make avatar speak text |
/v1/agent/{code}/add-context | POST | Inject silent knowledge |
/v1/files/upload | POST | Upload image/video/audio |
/v1/dynamics/generate | POST | Generate gesture animations |
/v2/credit-summaries | GET | Check credit balance and plan details |
/v1/embed-tokens/request | POST | Generate JWT token for website embedding |
https://api.bithuman.ai — Auth: api-secret header from Developer → API Keys — Full reference →
Platform support
| Platform | Status | Notes |
|---|---|---|
| Linux (x86_64) | Full Support | Production-ready; Essence CPU + Expression GPU container |
| Linux (ARM64) | Full Support | Edge deployments (Essence CPU) |
| macOS (Apple Silicon) | Full Support | Essence on any Apple Silicon; Expression on-device requires M3+ |
| iPadOS / iOS | Full Support | Expression on-device — iPad Pro M4+, iPhone 16 Pro+ via Swift SDK |
| Windows (x86_64) | Full Support | Essence native wheels for Python 3.9–3.14, or WSL |
| NVIDIA GPU container | Full Support | Linux + Docker — Expression with dynamic face from any image |
AI agent integration
bitHuman providesllms.txt and an OpenAPI specification for AI coding agent discoverability:
- llms.txt — Curated documentation index for LLM consumption
- llms-full.txt — Complete documentation in single markdown file
- OpenAPI Spec — Machine-readable API contract
- AGENTS.md — Repository-level agent instructions

