Kitten TTS Chrome Extension

A locally-executed neural text-to-speech system leveraging transformer-based voice synthesis through a Chrome extension interface with FastAPI backend orchestration.

Overview

This project implements a client-side text-to-speech solution utilizing Kitten TTS—an open-source neural TTS engine—deployed as a local FastAPI microservice. The architecture eliminates API dependencies, cloud latency, and privacy concerns by performing all inference operations on-device.

Architecture

The system follows a distributed microservices pattern with a local inference server: