GitHub - KaMeLoTmArMoT/Qwen_TTS_Api: FastAPI wrapper for Qwen3-TTS CustomVoice: generate chapter WAV from narr/dialog/pause spans for InfiniteBook.

Qwen TTS API

A small FastAPI service that wraps Qwen3-TTS (CustomVoice) behind a simple HTTP API for WAV generation.
Built as a companion module for InfiniteBook.

Report Bug · Request Feature

About The Project

Qwen_TTS_Api is a lightweight HTTP service to synthesize narration/dialog audio as a single WAV file from a list of spans (narr/dialog/pause).
It exists so InfiniteBook can use Qwen3-TTS like any other TTS provider without embedding heavy GPU model code into the web app process.

Key Features

Single request “chapter render”: send spans → receive one WAV (server handles batching + stitching).
Model lifecycle endpoints: load/unload + state check.
Docker-first deployment so the main app stays simple.

Getting Started

Prerequisites

NVIDIA GPU + drivers (recommended).
Docker (and NVIDIA Container Toolkit if you want --gpus all).

Installation (clone)

git clone https://github.com/KaMeLoTmArMoT/Qwen_TTS_Api.git
cd Qwen_TTS_Api

Docker

Build

docker build -t qwen-tts-api .

Run

Expose the API on port 8001 (pick any host port you want):

docker run --rm --gpus all -p 8001:8001 qwen-tts-api

Optional (if you want to preload a specific model at startup, depending on how you wired the container):

docker run --rm --gpus all -p 8001:8001 \
  -e QWEN_MODEL_ID="Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice" \
  qwen-tts-api

InfiniteBook integration

In InfiniteBook, select the provider and point it at the container URL. github

Minimal `.env` (InfiniteBook)

IB_TTS_PROVIDER=qwen
IB_QWEN_TTS_URL=http://127.0.0.1:8001
IB_QWEN_MODEL_ID=Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

License

This project is licensed under the MIT License — see LICENSE.

Acknowledgments

README structure inspired by Best-README-Template.
Built with assistance from generative AI tools for ideation and code suggestions; all changes were reviewed and tested by the author.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
server		server
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qwen TTS API

About The Project

Key Features

Getting Started

Prerequisites

Installation (clone)

Docker

Build

Run

InfiniteBook integration

Minimal `.env` (InfiniteBook)

License

Acknowledgments

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Qwen TTS API

About The Project

Key Features

Getting Started

Prerequisites

Installation (clone)

Docker

Build

Run

InfiniteBook integration

Minimal .env (InfiniteBook)

License

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages

Minimal `.env` (InfiniteBook)