Decibri ships audio to integrations. The three categories are Speech-to-text (STT), Voice activity detection (VAD), and Keyword spotting (KWS). Pick the category, then pick the provider that fits your accuracy, cost, and offline requirements.
Transcribe microphone audio. Cloud providers (Deepgram, AssemblyAI, OpenAI, AWS Transcribe, Azure AI Speech, Google Cloud Speech-to-Text, Mistral Voxtral) and local providers (Sherpa-ONNX, Whisper.cpp) are supported.
Detect when someone is speaking. Silero v5 neural model for accuracy-critical use cases, or the built-in RMS detector for simple pipelines. Both bundled with decibri.
Wake words and voice command triggers. Cheaper than full STT when you only need to catch specific phrases. Sherpa-ONNX runs entirely on-device.