Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It does increase the “error” (meaning it is less likely to predict the next word when compared against a dataset) but the losses are lower than your intuition would guide you to believe.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: