Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A bit of historical trivia: OpenAI disabled prefill in 2023 as a safety precaution (e.g., potential jailbreaks like " genocide is good because"), but Anthropic kept prefill around partly because they had greater confidence in their safety classifiers. (https://www.lesswrong.com/posts/HE3Styo9vpk7m8zi4/evhub-s-sh...).


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: