Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I disagree that there isn't an innovation.

The technology for reasoning models is the ability to do RL on verifiable tasks, with the some (as-of-yet unpublished, but well-known) search over reasoning chains, with a (presumably neural) reasoning fragment proposal machine, and a (presumably neural) scoring machine for those reasoning fragments.

The technology for agents is effectively the same, with some currently-in-R&D way to scale the training architecture for longer-horizon tasks. ChatGPT agent or o3/o4-mini are likely the first published models that take advantage of this research.

It's fairly obvious that this is the direction that all the AI labs are going if you go to SF house parties or listen to AI insiders like Dwarkesh Patel.



Fair enough I guess, even though the concept of agent/agentic task popped before reasoning models were really a thing


The idea of chatbots existed before ChatGPT, does that mean it's purely marketing hype?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: