Quick Start
What It Does
- Captures audio from your default microphone
- Creates real-time avatar animation as you speak
- Shows live video using LocalVideoPlayer
- Automatically detects voice activity and silence
- Real-time audio processing at 24kHz
- Voice activity detection with configurable threshold (-40dB)
- Automatic silence detection (3-second timeout)
- Local audio/video processing (no web interface)
Command Line Options
| Option | Default | Description |
|---|---|---|
--model | env | Path to .imx model file |
--api-secret | env | Your bitHuman API secret |
--volume | 1.0 | Audio volume multiplier |
--silent-threshold-db | -40 | Silence threshold in dB |
--echo | off | Enable audio echo for testing |
Advanced Usage
Common Issues
| Problem | Solution |
|---|---|
| No microphone input | Check microphone permissions in system settings |
| Avatar not responding | Speak louder or adjust --silent-threshold-db to lower value |
| Performance lag | Close other audio applications, use wired microphone |
| Audio echo/feedback | Don’t use --echo flag, use headphones |
Technical Details
| Component | Specification |
|---|---|
| Audio sample rate | 24kHz |
| Input | Mono microphone |
| Buffer | 240 samples per chunk (10ms) |
| Silence detection | -40dB threshold, 3s timeout |
