Dictation app for the terminal and Neovim, using Whisper for transcription and ChatGPT for post-processing.
You can use the installation script:
curl https://raw.githubusercontent.com/olekli/DrDictaphone/main/script/install.sh | sh
Or create a virtual environment and do:
pip install drdictaphone
python -m drdictaphone.cli install ~/DrDictaphone
Place OpenAI API key in ~/DrDictaphone/config/openai_api_key.
To start the standalone app, do ./drdictaphone.
To start only the server, do ./drdictaphone server.
Shutdown a running server by doing ./drdictaphone shutdown.
If you are not already using Python plugins in Neovim,
you need to create a virtual environment for Neovim to use.
Tell Neovim about it by adding to your init.vim:
let g:python3_host_prog = '~/.neovim-venv/bin/python'
(Or wherever your venv is located.)
Inside this virtual environment, install the Neovim plugin:
pip install drdictaphone-neovim-plugin
Now you need to add the plugin to your Neovim config directory:
ln -s ~/.neovim-venv/lib/python3.11/site-packages/drdictaphone_neovim/DrDictaphone.py ~/.config/nvim/rplugin/python3/.
(Your paths may vary.)
Then start the server. Do :UpdateRemotePlugins once in Neovim, restart. Use DrDictaphoneSetProfile, DrDictaphoneToggle vim commands.
s: select profilep: start / stop and transcribe recordingd: stop and discard recordingq: exit
Profiles consist of:
topicfor transcribing and post-processing, a list of stringslanguageto use for the transcriber, a stringoutputdirectory, a stringoutput_commandto pipe output toenable_vadwhether or not to enable VAD, a bool, defaults tofalse
Output will be written to a timestamped file in the output directory.
VAD will filter recordings for parts with voice before processing them.
The Post-Processor specs consist of:
instructionsfor the post-processor, either a filename to load from or a list of stringsgpt_modelto use for post-processing, either a filename to load from or an objectoptionsto use for post-processing, either a filename to load from or an objecttoolsto use for the post-processor, either a filename to load from or an object
The context for the post-processor is built from the profile and the post-processor specs. Settings in the profile take precedence over settings in the specs.