OpenAI-compatible audio server in Docker. 7 ASR backends (Whisper, Distil-Whisper, Parakeet, Canary, Canary-Qwen) + 2 TTS engines (Kokoro, Qwen3-TTS voice cloning). Single /v1/audio/{transcriptions,speech,voices} surface. CPU + CUDA images. Hot model swap. MCP server built in.
subscribe to this server's releases (RSS) →
| version | published | age | src |
|---|---|---|---|
| v0.9.0 | 2026-06-09 07:37 | 8d ago | github |
| v0.8.0 | 2026-05-31 16:34 | 16d ago | github |
| v0.7.0 | 2026-05-31 10:17 | 16d ago | github |
| v0.6.1 | 2026-05-30 16:10 | 17d ago | github |
| v0.6.0 | 2026-05-30 15:45 | 17d ago | github |
| v0.5.0 | 2026-05-28 18:23 | 19d ago | github |
| v0.4.1 | 2026-05-28 17:30 | 19d ago | github |
| v0.4.0 | 2026-05-28 17:08 | 19d ago | github |
| v0.3.0 | 2026-05-28 14:32 | 19d ago | github |
| v0.2.1 | 2026-05-28 11:58 | 19d ago | github |
| v0.2.0 | 2026-05-28 11:39 | 19d ago | github |
| v0.1.0 | 2026-05-28 09:09 | 19d ago | github |