Version: 2.0

Vectara and Speechmatics

Speechmatics provides real-time speech-to-text transcription with low latency in more than 55 languages. By combining Vectara's RAG capabilities and the LiveKit voice agent framework, you can build conversational AI assistants with enterprise-grade accuracy.

Integration benefits

Enables real-time voice interactions with your Vectara corpora.
Provides low latency (less than 1 second) speech-to-text transcription in 55+ languages.
Leverages LiveKit's open-source agent framework for seamless voice agent orchestration.
Supports flexible deployment options: cloud, on-premises, or on-device.

Architecture

The integration combines three key components:

Speechmatics - Real-time speech-to-text transcription and text-to-speech (TTS) for agent response.
Vectara - Knowledge retrieval and RAG-powered responses.
LiveKit - Voice agent orchestration and audio streaming.

Getting started

To build your own voice agent with Vectara and Speechmatics:

Set up LiveKit - Follow the LiveKit Agents quickstart.
Configure Speechmatics - Add your Speechmatics API key for real-time STT.
Connect Vectara - Integrate Vectara's query API for knowledge retrieval.
Add text-to-speech (TTS) - Choose a text-to-speech provider such as Speechmatics or ElevenLabs.

Integration benefits​

Architecture​

Getting started​

Integration benefits

Architecture

Getting started