Overview
LokutorTTSService generates speech from text using Lokutor’s
text-to-speech API over a persistent WebSocket connection. Lokutor runs on
CPU-only infrastructure and supports multiple languages (EN, ES, FR, PT, KO).
The service outputs TTSAudioRawFrames.
Source Repository
Source code, examples, and issues for the Lokutor integration
PyPI Package
The
pipecat-lokutor package on PyPILokutor App
Get an API key and test available voices
Documentation
Lokutor API and service documentation
Installation
This is a community-maintained package distributed separately frompipecat-ai:
Prerequisites
Lokutor Account Setup
Before using the Lokutor TTS service, you need:- Lokutor Account: Sign up and get an API key at app.lokutor.com
- API Key: Used to authenticate the WebSocket connection
Required Environment Variables
LOKUTOR_API_KEY: Your Lokutor API key (used by the example to populateapi_key)
Configuration
Lokutor API key used to authenticate the WebSocket connection.
Identifier of the voice to use. Browse and test voices at
app.lokutor.com.
Output audio sample rate in Hz.
Runtime input parameters for generation. See Input
Parameters below.
Optional Pipecat TTS settings applied on top of the defaults.
Base WebSocket URL for the Lokutor API.
Input Parameters
Runtime-configurable parameters passed via theparams constructor argument
using LokutorTTSService.InputParams(...).
| Parameter | Type | Default | Description |
|---|---|---|---|
language | Language | None | Synthesis language (EN, ES, FR, PT, KO). |
speed | float | 1.0 | Speech speed multiplier. |
steps | int | 5 | Number of synthesis steps. |
visemes | bool | False | Whether to request viseme data. |
Available parameters and defaults are defined by the integration. See the
source repository for the
authoritative, up-to-date list.