OpenReader WebUI 📄🔊

OpenReader WebUI is a document reader with Text-to-Speech capabilities, offering a TTS read along experience with narration for EPUB, PDF, TXT, MD, and DOCX documents. It supports multiple TTS providers including OpenAI, Deepinfra, and custom OpenAI-compatible endpoints like Kokoro-FastAPI and Orpheus-FastAPI

🎯 Multi-Provider TTS Support:
- OpenAI: tts-1, tts-1-hd, gpt-4o-mini-tts models with voices (alloy, echo, fable, onyx, nova, shimmer)
- Deepinfra: Kokoro-82M, Orpheus-3B, Sesame-1B models with extensive voice libraries
- Custom OpenAI-Compatible: Any OpenAI-compatible endpoint with custom voice sets
💾 Local-First Architecture: Uses IndexedDB browser storage for documents
🛜 Optional Server-side documents: Manually upload documents to the next backend for all users to download
📖 Read Along Experience: Follow along with highlighted text as the TTS narrates
📄 Document formats: EPUB, PDF, TXT, MD, DOCX (with libreoffice installed)
🎧 Audiobook Creation: Create and export audiobooks from PDF and ePub files (in m4b format with ffmpeg and aac TTS output)
🎨 Customizable Experience:
- 🔑 Select TTS provider (OpenAI, Deepinfra, or Custom OpenAI-compatible)
- 🔐 Set TTS API base URL and optional API key
- 🎨 Multiple app theme options
- And more...

🛠️ Work in progress

Native .docx support (currently requires libreoffice)
Accessibility Improvements

🐳 Docker Quick Start

Prerequisites

Recent version of Docker installed on your machine
A TTS API server (Kokoro-FastAPI, Orpheus-FastAPI, Deepinfra, OpenAI, etc.) running and accessible

1. 🐳 Start the Docker container:

docker run --name openreader-webui \
  -p 3003:3003 \
  -v openreader_docstore:/app/docstore \
  ghcr.io/richardr1126/openreader-webui:latest

(Optionally): Set the TTS API_BASE URL and/or API_KEY to be default for all devices

docker run --name openreader-webui \
  -e API_KEY=none \
  -e API_BASE=http://host.docker.internal:8880/v1 \
  -p 3003:3003 \
  -v openreader_docstore:/app/docstore \
  ghcr.io/richardr1126/openreader-webui:latest

Note: Requesting audio from the TTS API happens on the Next.js server not the client. So the base URL for the TTS API should be accessible and relative to the Next.js server. If it is in a Docker you may need to use host.docker.internal to access the host machine, instead of localhost.

Visit http://localhost:3003 to run the app and set your settings.

Note: The openreader_docstore volume is used to store server-side documents. You can mount a local directory instead. Or remove it if you don't need server-side documents.

2. ⚙️ Configure the app settings in the UI:

Set the TTS Provider and Model in the Settings modal
Set the TTS API Base URL and API Key if needed (more secure to set in env vars)
Select your model's voice from the dropdown (voices try to be fetched from TTS Provider API)

3. ⬆️ Updating Docker Image

docker stop openreader-webui && \
docker rm openreader-webui && \
docker pull ghcr.io/richardr1126/openreader-webui:latest

(Alternate) 🐳 Configuration with Docker Compose and Kokoro-FastAPI

A complete example docker-compose file with Kokoro-FastAPI and OpenReader WebUI is available in examples/docker-compose.yml. You can download and use it:

mkdir -p openreader-compose
cd openreader-compose
curl -O https://raw.githubusercontent.com/richardr1126/OpenReader-WebUI/main/examples/docker-compose.yml
docker compose up -d

Or add OpenReader WebUI to your existing docker-compose.yml:

services:
  openreader-webui:
    container_name: openreader-webui
    image: ghcr.io/richardr1126/openreader-webui:latest
    environment:
      - API_BASE=http://host.docker.internal:8880/v1
    ports:
      - "3003:3003"
    volumes:
      - docstore:/app/docstore
    restart: unless-stopped

volumes:
  docstore:

Dev Installation

Prerequisites

Node.js & npm or pnpm (recommended: use nvm for Node.js) Optionally required for different features:
FFmpeg (required for audiobook m4b creation only)
- On Linux: sudo apt install ffmpeg
- On MacOS: brew install ffmpeg
libreoffice (required for DOCX files)
- On Linux: sudo apt install libreoffice
- On MacOS: brew install libreoffice

Steps

Clone the repository:

git clone https://github.com/richardr1126/OpenReader-WebUI.git
cd OpenReader-WebUI

Install dependencies:

With pnpm (recommended):
```
pnpm install
```
Or with npm:
```
npm install
```
Configure the environment:
```
cp template.env .env
# Edit .env with your configuration settings
```
Note: The base URL for the TTS API should be accessible and relative to the Next.js server
Start the development server:

With pnpm (recommended):
```
pnpm dev
```
Or with npm:
```
npm run dev
```
or build and run the production server:

With pnpm:
```
pnpm build
pnpm start
```
Or with npm:
```
npm run build
npm start
```
Visit http://localhost:3003 to run the app.

💡 Feature requests

For feature requests or ideas you have for the project, please use the Discussions tab.

🙋‍♂️ Support and issues

If you encounter issues, please open an issue on GitHub following the template (which is very light).

👥 Contributing

Contributions are welcome! Fork the repository and submit a pull request with your changes.

❤️ Acknowledgements

This project would not be possible without standing on the shoulders of these giants:

Docker Supported Architectures

linux/amd64 (x86_64)
linux/arm64 (Apple Silicon, Raspberry Pi, SBCs, etc.)

Stack

Framework: Next.js (React)
Containerization: Docker
Storage: IndexedDB (in browser db store)
PDF:
- react-pdf
- pdf.js
EPUB:
- react-reader
- epubjs
Markdown/Text:
- react-markdown
- remark-gfm
UI:
TTS: (tested on)
- Deepinfra API (Kokoro-82M, Orpheus-3B, Sesame-1B)
- Kokoro FastAPI TTS
- Orpheus FastAPI TTS
NLP: compromise NLP library for sentence splitting

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 235 Commits
.github		.github
docs		docs
examples		examples
public		public
src		src
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
empty-module.ts		empty-module.ts
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package.json		package.json
playwright.config.ts		playwright.config.ts
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
template.env		template.env
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenReader WebUI 📄🔊

🛠️ Work in progress

🐳 Docker Quick Start

Prerequisites

1. 🐳 Start the Docker container:

2. ⚙️ Configure the app settings in the UI:

3. ⬆️ Updating Docker Image

(Alternate) 🐳 Configuration with Docker Compose and Kokoro-FastAPI

Dev Installation

Prerequisites

Steps

💡 Feature requests

🙋‍♂️ Support and issues

👥 Contributing

❤️ Acknowledgements

Docker Supported Architectures

Stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OpenReader WebUI 📄🔊

🛠️ Work in progress

🐳 Docker Quick Start

Prerequisites

1. 🐳 Start the Docker container:

2. ⚙️ Configure the app settings in the UI:

3. ⬆️ Updating Docker Image

(Alternate) 🐳 Configuration with Docker Compose and Kokoro-FastAPI

Dev Installation

Prerequisites

Steps

💡 Feature requests

🙋‍♂️ Support and issues

👥 Contributing

❤️ Acknowledgements

Docker Supported Architectures

Stack

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages