Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You can get it to run fairly fast on modern hardware. Like run a text extraction, tokenization and POS-tagging workflow on a quarter billion documents on PC hardware, takes like 24-36 hours. That's doable and affordable. But ML-adjacent methods are not. Requires far too much GPU compute, have no A100s :-/


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: