Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Hardcode by whom? Who do we trust with this task to do it correctly? Another LLM that suffers from the same fundamental flaw or by a low paid digital worker in a developing country? Because that's the current solution. And who's gonna pay for all that once the dumb investment money runs out, who's gonna stick around after the hype?


By the LLM team (Grok team, in this case). I don't mean for the LLM to be sentient enough to know it doesn't know the answer, I mean for the LLM to identify what is being asked of it, and checking to see if that's something on the 'blacklist of actions I cannot do yet', said list maintained by humans, before replying.

No different than when asking ChatGPT to generate images or videos or whatever before it could, it would just tell you it was unable to.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: