I tried running it in realtime with live audio input (kind of). If you want to g...

kkielhofner · on Sept 22, 2022

Haven’t tried it yet but love the concept!

Have you thought of using VAD (voice activity detection) for breaks? Back in my day (a long time ago) the webrtc VAD stuff was considered decent:

Model isn’t optimized for this use but I like where you’re headed!

thuttinger · on Sept 22, 2022

Interesting. I'll take a look at this, thanks!

Curiositry · on Sept 22, 2022

Perhaps this could be adapted?

catfan · on Sept 22, 2022

[flagged]

secret-noun · on Sept 22, 2022

impressive