Piper TTS API

A self-hosted Text-to-Speech (TTS) API built using Flask and Piper TTS. This API allows you to generate speech from text, supporting multiple voices and output formats (WAV or MP3). The project is designed for ease of use, with features like caching, input validation, and error handling.

Overview

Repository: https://github.com/azim-charaniya/piper-tts-api.git
Technologies: Python, Flask, Piper TTS, Conda for dependency management, and Docker for containerization.
Features:
- Supports 3 high-quality voices (e.g., en_us, en_gb, en_us_female).
- Handles text up to 500 words, splitting longer texts into paragraphs.
- API endpoint: /tts (POST request).
- Caching for generated audio to improve performance.
- Runs on port 17100.

Prerequisites

Python 3.9+: For local development.
Conda: For managing dependencies (install via anaconda.com).
Docker and Docker Compose: For containerized deployment.
Dependencies: Listed in requirements.txt and environment.yml.

Installation

Clone the Repository: git clone https://github.com/azim-charaniya/piper-tts-api.git cd piper-tts-api
Set Up Dependencies (Local Development):

Create and activate a Conda environment:

conda env create -f environment.yml  # Creates the environment from environment.yml
conda activate tts-api-env

Install any additional pip dependencies:
```
pip install -r requirements.txt
```

Download Voice Models:

Place voice models (e.g., en_US-ryan-high.onnx) in the voices/ directory. You can download them from github.com/rhasspy/piper.

Running the Application

Locally

Activate the Conda environment: conda activate tts-api-env
Start the Flask app: python app.py
Access the API at http://localhost:17100.

Using Docker Compose

Build and start the container:

docker-compose up --build

Access the API at http://localhost:17100.
Stop the container:

docker-compose down

API Documentation

The API uses a POST endpoint at /tts. Send a JSON body with the following parameters:

Request Body (JSON)

{
"text": "Required. Text to synthesize (up to 500 words).",
"voice": "Required. One of: 'en_us', 'en_gb', 'en_us_female'.",
"format": "Optional. 'wav' or 'mp3' (default: 'wav').",
"speaker_id": "Optional. Speaker ID (default: 0).",
"length_scale": "Optional. Phoneme length scale (e.g., 1.0).",
"noise_scale": "Optional. Generator noise scale (e.g., 0.5).",
"noise_w": "Optional. Phoneme width noise (e.g., 0.3).",
"sentence_silence": "Optional. Seconds of silence after sentences (default: 0.0)."
}

Example cURL Requests

Use these to test the API:

Basic Request:

curl -X POST "http://localhost:17100/tts" \
     -H "Content-Type: application/json" \
     -d '{"text": "Hello, world!", "voice": "en_us"}' \
     -o output.wav

With Additional Parameters:

curl -X POST "http://localhost:17100/tts" \
     -H "Content-Type: application/json" \
     -d '{"text": "This is a test.", "voice": "en_gb", "format": "mp3", "speaker_id": 1}' \
     -o output.mp3

Example curl command to test Persian TTS engine:

curl -X POST http://localhost:17100/tts \
  -H "Content-Type: application/json" \
  -d '{"engine": "persian", "text": "سلام دنیا", "format": "wav"}' \
  --output output.wav

Example Curl command for Facebook TTS engine:

curl -X POST http://localhost:17100/tts \
  -H "Content-Type: application/json" \
  -d '{"engine": "facebook", "text": "Hello world", "format": "mp3"}' \
  --output output.mp3

Troubleshooting

Errors with Voice Models: Ensure files are in the voices/ directory. Check logs for file not found errors. Port Conflicts: If port 17100 is in use, change APP_PORT in app.py. Conda Issues: If dependencies fail, recreate the environment with conda env update -f environment.yml. Docker Problems: Run docker-compose logs for details, or rebuild with docker-compose build --no-cache.

Contributing

Feel free to fork this repository and submit pull requests. For issues, create a new ticket on GitHub.

Docker Compose

version: '3.9'

services:
  tts-api:
    image: piper-tts-api:latest
    build: # Build the image from the Dockerfile
      context: https://github.com/azim-charaniya/piper-tts-api.git
      dockerfile: Dockerfile
    ports:
      - "127.0.0.1:17100:17100"
    volumes:
      - ./cache:/app/cache  
    environment:
      - APP_PORT=17100
    restart: unless-stopped
    hostname: piper-tts-server

Additional Notes

Why These Files? The docker-compose.yml simplifies deployment and scaling, while the README.md makes your project more accessible and user-friendly, encouraging contributions.
Next Steps:
- Commit these files to your GitHub repository.
- Test the setup locally to ensure everything works as expected.
- If you need further customizations (e.g., adding more services or environment variables), let me know!

This should wrap up your project setup. If you have any more questions, I'm here to help!

Building the Docker Image

docker build -t piper-tts-api .

MIT License.

MIT License

Copyright (c) 2025 Azim Charaniya (azim.one)

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
engines		engines
piper_tts		piper_tts
voices		voices
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
environment.yml		environment.yml
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Piper TTS API

Overview

Prerequisites

Installation

Running the Application

Locally

Using Docker Compose

API Documentation

Request Body (JSON)

Example cURL Requests

Basic Request:

With Additional Parameters:

Example curl command to test Persian TTS engine:

Example Curl command for Facebook TTS engine:

Troubleshooting

Contributing

Docker Compose

Additional Notes

Building the Docker Image

MIT License.

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Piper TTS API

Overview

Prerequisites

Installation

Running the Application

Locally

Using Docker Compose

API Documentation

Request Body (JSON)

Example cURL Requests

Basic Request:

With Additional Parameters:

Example curl command to test Persian TTS engine:

Example Curl command for Facebook TTS engine:

Troubleshooting

Contributing

Docker Compose

Additional Notes

Building the Docker Image

MIT License.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages