Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This tool requires ffmpeg, but don't forget that the latest version of ffmpeg has speech-to-text built in!

I'm sure there are use cases where using Whisper directly is better, but it's a great addition to an already versatile tool.



I was going to go the opposite way and suggest that if you want python audio transcription, you can skip ffmpeg and just use whisper directly. Using the whisper module directly gives you a variety of outputs, including text and srt.


Yep. Whisper is great. I use it on podcasts as part of removing ads. Last time I used one of the official versions it would only accept .wav files so I had to convert with ffmpeg first.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: