Local Voice Pipeline on macOS (Essence avatar + Apple Speech)
Run an Essence avatar on Mac with the local Apple speech stack. Voice never leaves your device. M2+ supported. For on-device Expression avatars on M3+, see Expression on macOS.
Use this file to discover all available pages before exploring further.
Looking for the on-device Expression avatar (custom face, 25 FPS, no GPU)? See Expression on macOS. This page covers the local voice stack (STT/TTS/VAD) paired with an Essence avatar.
Preview — private distribution. The bithuman-voice wheel referenced below is not currently published on PyPI. Reach out on Discord or through bitHuman support to request access to the voice pipeline.
export BITHUMAN_API_SECRET="your_secret"export BITHUMAN_MODEL_PATH="/path/to/model.imx"export LIVEKIT_API_KEY="your_livekit_key"export LIVEKIT_API_SECRET="your_livekit_secret"export LIVEKIT_URL="wss://your-project.livekit.cloud"export OPENAI_API_KEY="your_openai_key" # Only for AI brain