I’ve been using Speech Note (github link) for months, but it often gets things wildly wrong.
I thought it was my mic, so I got one that’s crystal clear. I also tried a ton of different models, and other than being slow (or fast), their accuracy is usually pretty similar.
But I’m still needing to take a lot of time to edit the results, and I wonder if there’s something I should be doing to get better results.
On other speech-to-text platforms (like Futo keyboard on Android), the results are fast and very accurate. I have a hard time believing that Speech Note can’t be as good.
Can any other users share their experience?
UPDATE: Ok, the best model that I’ve found for Speech Note is the WhisterCpp FUTO English-244, which, funny enough, is the model I use on Futo Keyboard for Android. It’s not the fastest, but fast enough. It is quite accurate, and that means less time editing text.
Had enough issues with it to not find it helpful. But I’m not a native English speaker and support for my local language is so-so, so might as well be me that’s the problem.