Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
oidar
4 months ago
|
parent
|
context
|
favorite
| on:
Show HN: Python Audio Transcription: Convert Speec...
What's the best solution right now for TTS that supports speaker diarisation?
makaimc
4 months ago
|
next
[–]
AssemblyAI (YC S17) is currently the one that stands out in the WER and accuracy benchmarks (
https://www.assemblyai.com/benchmarks
). Though its models are accessed through a web API rather than locally hosted, and speaker diarization is enabled through a parameter in the API call (
https://www.assemblyai.com/docs/speech-to-text/pre-recorded-...
).
xnx
4 months ago
|
prev
[–]
I like this version of Whisper which has diarization built in:
https://github.com/Purfview/whisper-standalone-win
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: