OpenReader WebUI is a document reader with Text-to-Speech capabilities, offering a TTS read along experience with narration for EPUB, PDF, TXT, MD, and DOCX documents. It supports multiple TTS providers including OpenAI, Deepinfra, and custom OpenAI-compatible endpoints like Kokoro-FastAPI and Orpheus-FastAPI
- π― Multi-Provider TTS Support:
- OpenAI: tts-1, tts-1-hd, gpt-4o-mini-tts models with voices (alloy, echo, fable, onyx, nova, shimmer)
- Deepinfra: Kokoro-82M, Orpheus-3B, Sesame-1B models with extensive voice libraries
- Custom OpenAI-Compatible: Any OpenAI-compatible endpoint with custom voice sets
- πΎ Local-First Architecture: Uses IndexedDB browser storage for documents
- π Optional Server-side documents: Manually upload documents to the next backend for all users to download
- π Read Along Experience: Follow along with highlighted text as the TTS narrates
- π Document formats: EPUB, PDF, TXT, MD, DOCX (with libreoffice installed)
- π§ Audiobook Creation: Create and export audiobooks from PDF and ePub files (in m4b format with ffmpeg and aac TTS output)
- π¨ Customizable Experience:
- π Select TTS provider (OpenAI, Deepinfra, or Custom OpenAI-compatible)
- π Set TTS API base URL and optional API key
- π¨ Multiple app theme options
- And more...
- Native .docx support (currently requires libreoffice)
- Accessibility Improvements
- Recent version of Docker installed on your machine
- A TTS API server (Kokoro-FastAPI, Orpheus-FastAPI, Deepinfra, OpenAI, etc.) running and accessible
docker run --name openreader-webui \
-p 3003:3003 \
-v openreader_docstore:/app/docstore \
ghcr.io/richardr1126/openreader-webui:latest(Optionally): Set the TTS API_BASE URL and/or API_KEY to be default for all devices
docker run --name openreader-webui \
-e API_KEY=none \
-e API_BASE=http://host.docker.internal:8880/v1 \
-p 3003:3003 \
-v openreader_docstore:/app/docstore \
ghcr.io/richardr1126/openreader-webui:latestNote: Requesting audio from the TTS API happens on the Next.js server not the client. So the base URL for the TTS API should be accessible and relative to the Next.js server. If it is in a Docker you may need to use
host.docker.internalto access the host machine, instead oflocalhost.
Visit http://localhost:3003 to run the app and set your settings.
Note: The
openreader_docstorevolume is used to store server-side documents. You can mount a local directory instead. Or remove it if you don't need server-side documents.
- Set the TTS Provider and Model in the Settings modal
- Set the TTS API Base URL and API Key if needed (more secure to set in env vars)
- Select your model's voice from the dropdown (voices try to be fetched from TTS Provider API)
docker stop openreader-webui && \
docker rm openreader-webui && \
docker pull ghcr.io/richardr1126/openreader-webui:latestA complete example docker-compose file with Kokoro-FastAPI and OpenReader WebUI is available in examples/docker-compose.yml. You can download and use it:
mkdir -p openreader-compose
cd openreader-compose
curl -O https://raw.githubusercontent.com/richardr1126/OpenReader-WebUI/main/examples/docker-compose.yml
docker compose up -dOr add OpenReader WebUI to your existing docker-compose.yml:
services:
openreader-webui:
container_name: openreader-webui
image: ghcr.io/richardr1126/openreader-webui:latest
environment:
- API_BASE=http://host.docker.internal:8880/v1
ports:
- "3003:3003"
volumes:
- docstore:/app/docstore
restart: unless-stopped
volumes:
docstore:- Node.js & npm or pnpm (recommended: use nvm for Node.js) Optionally required for different features:
- FFmpeg (required for audiobook m4b creation only)
- On Linux:
sudo apt install ffmpeg - On MacOS:
brew install ffmpeg
- On Linux:
- libreoffice (required for DOCX files)
- On Linux:
sudo apt install libreoffice - On MacOS:
brew install libreoffice
- On Linux:
-
Clone the repository:
git clone https://github.com/richardr1126/OpenReader-WebUI.git cd OpenReader-WebUI -
Install dependencies:
With pnpm (recommended):
pnpm install
Or with npm:
npm install
-
Configure the environment:
cp template.env .env # Edit .env with your configuration settingsNote: The base URL for the TTS API should be accessible and relative to the Next.js server
-
Start the development server:
With pnpm (recommended):
pnpm dev
Or with npm:
npm run dev
or build and run the production server:
With pnpm:
pnpm build pnpm start
Or with npm:
npm run build npm start
Visit http://localhost:3003 to run the app.
For feature requests or ideas you have for the project, please use the Discussions tab.
If you encounter issues, please open an issue on GitHub following the template (which is very light).
Contributions are welcome! Fork the repository and submit a pull request with your changes.
This project would not be possible without standing on the shoulders of these giants:
- Kokoro-82M model
- Orpheus-TTS model
- Kokoro-FastAPI
- Orpheus-FastAPI
- react-pdf npm package
- react-reader npm package
- linux/amd64 (x86_64)
- linux/arm64 (Apple Silicon, Raspberry Pi, SBCs, etc.)
- Framework: Next.js (React)
- Containerization: Docker
- Storage: IndexedDB (in browser db store)
- PDF:
- EPUB:
- Markdown/Text:
- UI:
- TTS: (tested on)
- Deepinfra API (Kokoro-82M, Orpheus-3B, Sesame-1B)
- Kokoro FastAPI TTS
- Orpheus FastAPI TTS
- NLP: compromise NLP library for sentence splitting
This project is licensed under the MIT License.