bitHuman — Real-Time Avatar Animation API

bitHuman creates digital avatars that lip-sync to audio in real time. Feed audio in, get an animated face out at 25 FPS. Use it to build AI companions, customer-support avatars, virtual tutors, game NPCs, and anything that needs a visual character that speaks.

Pick your path

bitHuman ships two models and three runtime surfaces. Your choice depends on what you’re shipping (web, server, or native app) and which avatar style fits.

Two models

Model	What it is	Avatar source	Compute	Best for
Essence	CPU-only pre-rendered animations with real-time lip-sync	`.imx` model file built from a photo or video on bithuman.ai	CPU (any laptop, Raspberry Pi, edge box)	Kiosks, 24/7 displays, full body + animal mode, custom gestures
Expression	AI-generated facial animation from a single image	Any face image — no avatar generation step	NVIDIA GPU or Apple Silicon M3+ (on-device)	Dynamic conversations, custom faces, native consumer apps

Three runtime surfaces

Surface	What it is	Models supported	Best for
bitHuman Cloud	Managed runtime on `api.bithuman.ai`. Use via the LiveKit plugin (`pip install livekit-plugins-bithuman`) or the REST API directly.	Essence, Expression	Web apps, fastest setup, no infra.
Self-hosted server	Run on your own hardware. Python SDK (`pip install bithuman`) for Essence on CPU, or Docker container for Expression on Linux + NVIDIA.	Essence (CPU), Expression (GPU)	On-prem privacy, edge devices, high volume.
On-device Apple Silicon	`bitHumanKit` Swift Package + `bithuman-cli` Homebrew tool. All inference (STT, LLM, TTS, lip-sync) runs locally; only a 1-req/min billing heartbeat goes out.	Expression (M3+ Mac, M4+ iPad Pro, A18 Pro+ iPhone)	Native consumer / pro-sumer apps where everything must run locally.

The full matrix and decision tree: How it works → · Essence vs Expression →.

5-minute quickstart

Pick the surface that matches what you’re building:

git clone https://github.com/bithuman-product/bithuman-examples.git
cd bithuman-examples/essence-cloud

cp .env.example .env
# Edit .env: set BITHUMAN_API_SECRET, BITHUMAN_AGENT_ID, OPENAI_API_KEY

docker compose up
# Open http://localhost:4202

Each tab maps to a working example in bithuman-examples.

What can you build?

Customer Support

Replace hold music with a talking avatar that answers questions.

AI Tutor

A patient virtual teacher that explains and adapts to students.

Digital Receptionist

Lobby kiosks and tablets that greet and direct visitors.

AI Companion

Persistent characters with personality and memory.

Game NPC

Dynamic characters that react to player actions with gestures.

Video Generation

Batch-generate talking-head videos from scripts.

See all use cases with architecture patterns →

Developer guides

Quickstart

Python SDK in 5 minutes

Swift SDK

On-device for Mac/iPad/iPhone

REST API

Agent generation, speak, dynamics

Avatar Sessions

LiveKit Cloud, CPU, GPU — every mode

Examples

Working examples for every surface

Chat Widget

Floating chat with text, voice, and video modes

Core SDK API (Python)

Method	Description
`AsyncBithuman.create(model_path, api_secret)`	Initialize the avatar runtime
`runtime.start()`	Begin processing
`runtime.push_audio(data, sample_rate)`	Send audio for lip-sync
`runtime.flush()`	Signal end of audio input
`runtime.run()`	Async generator yielding video + audio frames
`runtime.get_frame_size()`	Returns `(width, height)` of output

REST API

Endpoint	Method	Description
`/v1/validate`	POST	Verify API secret
`/v1/agent/generate`	POST	Generate new avatar agent
`/v1/agent/{code}`	GET/POST	Get or update agent
`/v1/agent/{code}/speak`	POST	Make avatar speak text
`/v1/agent/{code}/add-context`	POST	Inject silent knowledge
`/v1/files/upload`	POST	Upload image/video/audio
`/v1/dynamics/generate`	POST	Generate gesture animations
`/v2/credit-summaries`	GET	Check credit balance and plan details
`/v1/embed-tokens/request`	POST	Generate JWT token for website embedding

Base URL: https://api.bithuman.ai — Auth: api-secret header from Developer → API Keys — Full reference →

Platform support

Platform	Status	Notes
Linux (x86_64)	Full Support	Production-ready; Essence CPU + Expression GPU container
Linux (ARM64)	Full Support	Edge deployments (Essence CPU)
macOS (Apple Silicon)	Full Support	Essence on any Apple Silicon; Expression on-device requires M3+
iPadOS / iOS	Full Support	Expression on-device — iPad Pro M4+, iPhone 16 Pro+ via Swift SDK
Windows (x86_64)	Full Support	Essence native wheels for Python 3.9–3.14, or WSL
NVIDIA GPU container	Full Support	Linux + Docker — Expression with dynamic face from any image

AI agent integration

bitHuman provides llms.txt and an OpenAPI specification for AI coding agent discoverability:

llms.txt — Curated documentation index for LLM consumption
llms-full.txt — Complete documentation in single markdown file
OpenAPI Spec — Machine-readable API contract
AGENTS.md — Repository-level agent instructions

Getting Started

Swift SDK

Deployment

Integrations

Changelog

bitHuman — Real-Time Avatar Animation API

Pick your path

Two models

Three runtime surfaces

5-minute quickstart

What can you build?

Customer Support

AI Tutor

Digital Receptionist

AI Companion

Game NPC

Video Generation

Developer guides

Quickstart

Swift SDK

REST API

Avatar Sessions

Examples

Chat Widget

Core SDK API (Python)

REST API

Platform support

AI agent integration

Getting Started

Swift SDK

Deployment

Integrations

Changelog

Documentation Index

​Pick your path

​Two models

​Three runtime surfaces

​5-minute quickstart

​What can you build?

Customer Support

AI Tutor

Digital Receptionist

AI Companion

Game NPC

Video Generation

​Developer guides

Quickstart

Swift SDK

REST API

Avatar Sessions

Examples

Chat Widget

​Core SDK API (Python)

​REST API

​Platform support

​AI agent integration

Pick your path

Two models

Three runtime surfaces

5-minute quickstart

What can you build?

Developer guides

Core SDK API (Python)

REST API

Platform support

AI agent integration