Comments
If this article inspires you, feel free to send me an email in response.
I’ve recently discovered Handy, a small speech-to-text (dictation) application, that is open source and can use only local AI models on Linux, preserving the privacy of what I’m saying out loud by not sending it to some cloud servers. I found it handy (lol), so I figured that I could share some hardships that I’ve run into while setting it up on ArchLinux.
Handy is a speech-to-text application, which means you can talk out loud and it will render the text on your screen after you’re done. This is especially nice if you are a quick talker but a slow typer; and while I consider that I would be one, I can still type quickly enough, but speech is so much faster and nicer, as a lazy interface. You can learn more about Handy on its official website, notably how you can install it and so on.
Here’s a small list of all the steps that I’ve had to do for setting it up for ArchLinux:
handy-bin. For instance, with yay, that would be yay -S handy-bin.handy‘s settings, I’ve chosen the Parakeet V3 model, which seems to be the best combination of speed and accuracy.gtk-layer-shell that I’ve installed with yay, which fixed it: yay gtk-layer-shell.handy --toggle-transcription. The trick is that it needs to be run once at the beginning of the speech to text transcription, and then once again when you want to stop, as it’s a toggle, but that’s fine.So I’m happy to report that after all these steps, this is working just fine 🥳 In fact, I’ve used mostly Handy to write this blog post, which is kind of a delight for me! I am not quite sure that this has been faster than typing by hand, but this has been more pleasant, for sure. It is still a bit of a pain when compared to the experience that I’ve had on macOS where I could install a single application called OpenWhispr 2, and it just worked from the start, showing me the output in real-time (whereas on Linux it’s displayed after stop-toggling the transcription). Well, Linux on the desktop can be quite an adventure, am I right? At least I do have a proper working solution now, which makes me happy, and probably will help me write ~~~more emails~~~ blog posts in the future.
Thanks to Jolivier who told me about its existence on Mastodon!
If this article inspires you, feel free to send me an email in response.