A small Windows tray application for speech-to-text dictation.
The application records microphone audio on demand, transcribes it locally using an AI model, and copies the resulting text to the clipboard.
No API keys or cloud services are required.
Just a note: AI wrote this readme. It seems fine]
I just wanted to build a quick speech-to-text utility for myself and ChatGPT helped make this python script. It lives in your system tray and simply lets you dictate and outputs text.
I've been using it all day and it's working very well.
- Windows 10 or Windows 11
- Python 3.10 or newer
- A working microphone
git clone https://github.com/YOUR_USERNAME/dictation-tray.git
cd dictation-traypip install pyside6 sounddevice numpy pyperclip faster-whisperFFmpeg is required for audio processing.
Using winget:
winget install Gyan.FFmpegRestart your terminal after installing FFmpeg.
python dictate_tray.py- A window will open
- A system tray icon will appear
- Closing the window hides it to the tray instead of exiting
- Click Start
- Speak into the microphone
- Click Stop
- The transcribed text appears in the window and is copied to the clipboard
Paste the text anywhere using Ctrl + V.
Configuration is defined in the Config class at the top of dictate_tray.py.
class Config:
model_size = "small"
device = "cpu"
compute_type = "int8"
language = "en"model_size:tiny,base,small,medium,large-v3device:cpuorcudalanguage: set toNonefor auto-detection
- Transcription is performed locally
- Audio is not uploaded or stored
- The application only records when explicitly started
MIT License