Docker Setup

AetherChat is a lightweight voice-chat API that converts user speech into text using VOSK, processes the request through an Ollama-hosted LLM, and responds using Kokoro TTS. A FastAPI backend (served via Uvicorn) powers the API, and the repository includes a browser-ready web UI.

Features

Speech-to-Text (STT) via VOSK
Locally hosted LLM through Ollama
Text-to-Speech (TTS) synthesis using Kokoro
FastAPI backend with automatic streaming responses
Caddy reverse proxy with optional HTTPS
Fully containerised with Docker Compose
Customisable voices and model selection

Docker Setup

docker-compose.yml defines two core services:

1. `voicechat`

Builds the Python backend
Mounts models/vosk and kokoro
Runs FastAPI on port 8000
Exposes it externally as 8888 (avoiding conflicts)

2. `caddy`

Fronts the API with HTTP/HTTPS
Serves static UI files
Proxies inbound traffic to the backend
Generates internal TLS certificates automatically
Listens on ports:
- 8080 (HTTP)
- 8443 (HTTPS)

Shared settings like OLLAMA_HOST, LLM_MODEL, and VOSK_MODEL_PATH are passed to voicechat. The DOMAIN environment variable controls Caddy’s routing (default: voicechat.local).

Running the Project

1. Ensure models are available

Place the required models in the following folders:

./models/vosk/
./kokoro/

(Include Kokoro binaries if you want TTS enabled.)

2. (Optional) Add /etc/hosts entry

If you're using the default internal domain:

voicechat.local   →   127.0.0.1

3. Start the stack

docker compose up --build

Caddy will automatically generate and trust a local certificate (via tls internal). Open:

https://voicechat.local:8443 – secure UI/API
http://voicechat.local:8888 – plain HTTP

Installing Caddy’s Local Certificate

When using tls internal, Caddy acts as a local CA. You must install its certificate on clients before Safari/iOS/macOS will allow microphone access.

Extract it with:

docker cp voice-chat-caddy:/data/caddy/pki/authorities/local/root.crt ./caddy-root.crt

macOS

Double-click the certificate
Add to “System” or “Login” keychain
Set Always Trust

iOS / iPadOS

Airdrop/email the .crt
Tap to install
Enable trust under Settings → General → About → Certificate Trust Settings

Once trusted, microphone access will work without warnings.

Available Voices

The default Kokoro pack exposes several selectable voices:

Voice ID	Name	Notes
`af_sarah`	Sarah (EN Female)	Neutral, natural
`af_bella`	Bella (EN Female)	Bright, friendly
`af_sky`	Sky (EN Female)	Soft, airy
`bf_emma`	Emma (Anime EN)	Energetic, expressive
`bf_isabella`	Isabella (Anime EN)	Bright anime style
`bf_lily`	Lily (Shy EN)	Soft, shy
`ff_siwis`	French (Siwis)	Native French

Add more voices by extending the Kokoro .bin file and registering them in app.py.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
caddy		caddy
static		static
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
demo.gif		demo.gif
demo.mp4		demo.mp4
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Features

Docker Setup

1. `voicechat`

2. `caddy`

Running the Project

1. Ensure models are available

2. (Optional) Add /etc/hosts entry

3. Start the stack

Installing Caddy’s Local Certificate

macOS

iOS / iPadOS

Available Voices

About

Uh oh!

Releases 1

Packages

Languages

Anishrkhadka/AetherChat

Folders and files

Latest commit

History

Repository files navigation

Features

Docker Setup

1. voicechat

2. caddy

Running the Project

1. Ensure models are available

2. (Optional) Add /etc/hosts entry

3. Start the stack

Installing Caddy’s Local Certificate

macOS

iOS / iPadOS

Available Voices

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

1. `voicechat`

2. `caddy`

Packages