A lightweight Windows system tray application that converts speech to text using OpenAI's Whisper API. Perfect for code-switching between Hindi and English!
- π΅ System Tray Application - Runs silently in background
- π Easy API Key Setup - Set your API key directly from the tray menu
- π― Two Output Modes:
- Hinglish (Roman): Preserves Hindi words in Roman script
- English: Translates everything to English
- β¨οΈ Global Hotkey -
Ctrl+Space
to start/stop recording - π Auto-Paste - Transcribed text automatically pastes at cursor
- π΄ Visual Feedback - Icon changes color when recording
- β±οΈ Long Recordings - Up to 5 minutes per session
- πΎ Settings Persistence - Remembers your API key and preferences
- Windows 10/11
- Python 3.8+
- OpenAI API Key (Get one here)
-
Clone the repository
git clone https://github.com/mvijay24/whisper-paste.git cd whisper-paste
-
Install dependencies
setup.bat
-
Run the application
start_silent.vbs
Or simply double-click
start_silent.vbs
for completely silent startup! -
Set your API key
- Right-click the tray icon
- Select "π Set API Key..."
- Enter your OpenAI API key
- Click Save
- Look for the mic icon in your system tray (near clock)
- Right-click the icon to access settings:
- Set/Update API Key
- Choose output mode (Hinglish or English)
- Press
Ctrl+Space
to start recording - Speak in Hindi, English, or Hinglish
- Press
Ctrl+Space
again to stop - Text automatically pastes at your cursor position!
- π Set API Key... - Add or update your OpenAI API key
- API Status - Shows connection status (β Connected or
β οΈ No API Key) - π Output Mode - Choose between Hinglish (Roman) or English
- π Enable and Display Debug Panel - Shows real-time execution logs
- π Session Log Summary - Shows detailed report after each recording
- Exit - Properly closes the application
Hinglish Mode:
- You say: "Bhai ye file jaldi bhej de"
- Output:
bhai ye file jaldi bhej de
English Mode:
- You say: "Bhai ye file jaldi bhej de"
- Output:
brother send this file quickly
To create a standalone .exe
file:
build.bat
The executable will be created in the dist
folder.
whisper-paste/
βββ whisper_tray.pyw # Main application (no console window)
βββ start_silent.vbs # Silent launcher
βββ start.bat # Standard launcher
βββ restart.bat # Kill old instances & restart
βββ setup.bat # Install dependencies
βββ build.bat # Build executable
βββ icon.ico # Application icon
βββ settings.json # Saved settings (auto-created)
βββ README.md # This file
Settings are automatically saved in settings.json
:
{
"output_mode": "hinglish",
"api_key": "sk-..."
}
Can't see the tray icon?
- Click "Show hidden icons" arrow in system tray
- Use
restart.bat
to kill old instances
API Key issues?
- Ensure you have a valid OpenAI API key
- Check your API usage limits at OpenAI Dashboard
No audio recorded?
- Check microphone permissions in Windows Settings
- Ensure default microphone is set correctly
Text not pasting or cursor errors?
- Close clipboard managers like BeefText, Ditto, or ClipboardFusion - they interfere with paste functionality
- Disable any text expander software temporarily
- If you see "[WinError 1402] Invalid cursor handle", it's likely due to clipboard manager interference
- Uses OpenAI's Whisper API
- Approximately $0.006 per minute of audio
- See OpenAI Pricing
The small popup notification that shows transcribed text is called a "toast" because:
- It "pops up" like bread from a toaster
- It appears briefly and then disappears
- Common UI term from Android/Windows for temporary notifications
- Shows at the corner of the screen without interrupting workflow
Feel free to open issues or submit pull requests!
MIT License - feel free to use this in your projects!
- OpenAI for the amazing Whisper API
- The Python community for excellent libraries
- Special thanks to the Hinglish-speaking community!
Made with β€οΈ for the Hinglish-speaking developers!