Any ideas on how to add Ampere support? I have a use case in mind that I would l...

deckar01 · 2026-01-08T19:10:18 1767899418

Magpie-TTS needs a kernel compiled targeting Ampere, but it appears to be closed source. It was compiled for the 2018 T4, but not 2020-2024 consumer cards, just 2025 consumer cards.

nsbk · 2026-01-11T16:20:55 1768148455

I actually forked the repo, modified the Dockerfile and build/run scripts targeting Ampere and the whole setup is running seamlessly on my 3090, Magpie is running fine and using under 3Gb of memory, ~2Gb for nemotron STT, and ~18Gb for Nemotron Nano 30b. Latencies are great and the turn detection works really well!

I'm going to use this setup as the base for a language learning App for my gf :)

deckar01 · 2026-01-25T01:45:01 1769305501

I got your fork working (also on a 3090). I was not impressed with the latency or the recommended LLM’s quality.

nsbk · 2026-01-25T09:21:35 1769332895

Make sure you’re using the nemotron-speech asr model. I added support for Spanish via Canary models but these have like 10x the latency: 160ms on nemotron-speech vs 1.5s canary.

For the LLM I’m currently using Mistral-Small-3.2-24B-Instruct instead of Nemotron 3 and it works well for my use case