Self-hosted ASR service with multi-model support, real-time streaming, and a drop-in replacement for the ElevenLabs STT API.
# Just change the URL — your ElevenLabs code works as-is
curl -X POST https://asr.doubtech.com/v1/speech-to-text \
-H "xi-api-key: YOUR_KEY" \
-F "file=@audio.mp3" \
-F "model_id=openai/whisper-large-v3-turbo"Production-ready speech recognition with the flexibility to choose your models, your infrastructure, your rules.
Deploy and compare multiple ASR models side-by-side. Whisper, Qwen3, and more — pick the best model for your use case.
Stream audio via WebSocket for live transcription. See results appear as you speak with sub-second latency.
Transcribe audio in over 50 languages with automatic language detection. From English to Cantonese, we've got you covered.
Run entirely on your own infrastructure. Your audio never leaves your network. Full control over data and models.
Drop-in replacement for the ElevenLabs STT API. Change the URL and API key — your existing integration just works.
Deploy, monitor, and manage ASR model containers with a desktop app. One-click deployment, rebuild, and health checks.
Switch from ElevenLabs to your own infrastructure by changing two lines of config. Same endpoints, same request format, same response schema. Your existing code just works.
/v1/models/v1/speech-to-text/v1/speech-to-text/realtimeDeploy multiple ASR models and compare accuracy, speed, and language support. Add new models with one click.
Three steps to your first transcription.
# Sign in and create a key at
https://asr.doubtech.com/api-keyscurl https://asr.doubtech.com/v1/models \
-H "xi-api-key: YOUR_KEY"curl -X POST https://asr.doubtech.com/v1/speech-to-text \
-H "xi-api-key: YOUR_KEY" \
-F "file=@audio.mp3" \
-F "model_id=openai/whisper-large-v3-turbo"Start using your own ASR service today. No usage limits, no data leaving your network.