I’ve been using Speech Note (github link) for months, but it often gets things wildly wrong.

I thought it was my mic, so I got one that’s crystal clear. I also tried a ton of different models, and other than being slow (or fast), their accuracy is usually pretty similar.

But I’m still needing to take a lot of time to edit the results, and I wonder if there’s something I should be doing to get better results.

On other speech-to-text platforms (like Futo keyboard on Android), the results are fast and very accurate. I have a hard time believing that Speech Note can’t be as good.

Can any other users share their experience?

UPDATE: Ok, the best model that I’ve found for Speech Note is the WhisterCpp FUTO English-244, which, funny enough, is the model I use on Futo Keyboard for Android. It’s not the fastest, but fast enough. It is quite accurate, and that means less time editing text.

  • DrDystopia@lemy.lol
    link
    fedilink
    arrow-up
    4
    ·
    12 days ago

    Had enough issues with it to not find it helpful. But I’m not a native English speaker and support for my local language is so-so, so might as well be me that’s the problem.