the avatar image of Benjamin Bouvier

Handy: privacy-preserving open-source speech-to-text

I’ve recently discovered Handy, a small speech-to-text (dictation) application, that is open source and can use only local AI models on Linux, preserving the privacy of what I’m saying out loud by not sending it to some cloud servers. I found it handy (lol), so I figured that I could share some hardships that I’ve run into while setting it up on ArchLinux.

Handy is a speech-to-text application, which means you can talk out loud and it will render the text on your screen after you’re done. This is especially nice if you are a quick talker but a slow typer; and while I consider that I would be one, I can still type quickly enough, but speech is so much faster and nicer, as a lazy interface. You can learn more about Handy on its official website, notably how you can install it and so on.

Here’s a small list of all the steps that I’ve had to do for setting it up for ArchLinux:

So I’m happy to report that after all these steps, this is working just fine 🥳 In fact, I’ve used mostly Handy to write this blog post, which is kind of a delight for me! I am not quite sure that this has been faster than typing by hand, but this has been more pleasant, for sure. It is still a bit of a pain when compared to the experience that I’ve had on macOS where I could install a single application called OpenWhispr 2, and it just worked from the start, showing me the output in real-time (whereas on Linux it’s displayed after stop-toggling the transcription). Well, Linux on the desktop can be quite an adventure, am I right? At least I do have a proper working solution now, which makes me happy, and probably will help me write ~~~more emails~~~ blog posts in the future.

Thanks to Jolivier who told me about its existence on Mastodon!

  1. by opening the GNOME Settings application, then keyboard, then “view and customize shortcuts”, then custom shortcuts.

  2. It’s unclear the application is free forever, even though it’s using local AI models as well, I think that I might be limited to two thousand words per week.