Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Tesseract can manage 99% accuracy on anything other than handwriting. Without being an LLM.

Is there an advantage of using an LLM here?



I'm really curious about this too! I don't know!

There's some comments I've run across saying Qwen2.5-VL's really good at handwriting recognition.

It'd also be interesting to see how Tesseract compares when trying to OCR more mixed text+graphic media. Some possible examples: high-design magazines with color backgrounds, TikTok posts, maps, cardboard hold-up signs at political gatherings.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: