Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In the context of State Machines and Automatas - Intel HyperScan might be a better reference point. But the idea is the same. With a trivial PoC using Python wrappers over SIMD libraries one can get a 3x boost over the native `wc` CLI on a modern CPU, memory-mapping a very average SSD: https://github.com/ashvardanian/StringZilla/tree/main/cli


Sorry, but your wc implementation does nothing to detect words, it just counts the spaces. Of course you don't need a state machine for that!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: