More

zurfer · 2026-04-10T12:23:32 1775823812

You know some people grow up in untrustworthy environments and auto didact their way to something like first principles thinking and depending on things shaking out you might only believe what you've personally seen with your own eyes. And well, earth looks pretty flat in daily life.

zurfer · 2026-04-08T16:28:29 1775665709

> Muse Spark is available today at meta.ai and the Meta AI app. We’re opening a private API preview to select users.

m4r1k · 2026-04-08T16:34:20 1775666060

So no Open-weight .. why one would choose Muse Spark instead of Anthropic, OpenAI, or Google models all featuring from good to amazing harness?

zurfer · 2026-03-31T12:42:07 1774960927

too much pressure. the author deleted the real source code: https://github.com/instructkr/claude-code/commit/7c3c5f7eb96...

raesene9 · 2026-03-31T13:16:54 1774963014

there are a .....lot of forks already, no putting the genie back in the bottle for this one, I'd imagine.

tmp10423288442 · 2026-03-31T23:16:10 1774998970

Forks are easy for Github to shut down simultaneously. What you really want is to upload the code as a new repo (ideally a different name from the original one). But it shouldn't be too hard in practice to detect uploading the same codebase as one that's taken down if that's desired.

raesene9 · 2026-04-03T10:24:00 1775211840

but once forked people will have local copies, that can be put up onto other sites, if GH take them down.

zurfer · 2026-03-26T08:33:27 1774514007

it's just not binary. today's world is dominated by capitalistic competition and a lot of people earn a living by competing with their labor. If AI + robots can do the labor better, cheaper, faster, most (90%+) of today's jobs are gone without obvious replacement.

zurfer · 2026-03-24T08:24:50 1774340690

"In this scaffold, several other models were able to solve the problem as well: Opus 4.6 (max), Gemini 3.1 Pro, and GPT-5.4 (xhigh)."

I find that very surprising. This problem seems out of reach 3 months ago but now the 3 frontier models are able to solve it.

Is everybody distilling each others models? Companies sell the same data and RL environment to all big labs? Anybody more involved can share some rumors? :P

I do believe that AI can solve hard problems, but that progress is so distributed in a narrow domain makes me a bit suspicious somehow that there is a hidden factor. Like did some "data worker" solve a problem like that and it's now in the training data?

mike_hearn · 2026-03-24T11:12:01 1774350721

Yes there's a whole ecosystem of companies that create and sell RL gyms to AI labs and of course they develop their own internally too. You don't hear much about this ecosystem because RL at scale is all private. Nearly no academic research on it.

A lot of this is probably just throwing roughly equal amounts of compute at continuous RLVR training. I'm not convinced there's any big research breakthrough that separates GPT 5.4 from 5.2. The diff is probably more than just checkpoints but less than neural architecture changes and more towards the former than the latter.

I think it's just easy to underestimate how much impact continuous training+scaling can have on the underlying capabilities.

slopinthebag · 2026-03-24T09:24:13 1774344253

Is it possible the AI labs are seeding their models with these solved problems? Like, if I was Sam Altman with a bazillion dollars of investment I would pay some mathematicians to solve some of these problems so that the models could "solve" them later on. Not that I think it's what's happening here of course...

But it is pretty funny how 5.4 miscounted the number of 1's in 18475838184729 on the same day it solved this.

mrtesthah · 2026-03-25T04:53:34 1774414414

Maybe so, but GPT 5.4 is absolutely pulling ahead. You can see the differences visually on https://minebench.ai/.

zurfer · 2026-03-18T12:06:03 1773835563

Buy public openai investors, e.g. Microsoft. It's diluted but easy.

fjni · 2026-03-19T04:43:38 1773895418

Less diluted, but still: https://fundrise.com/vcx

zurfer · 2026-03-11T16:45:57 1773247557

this takes long enough for me to give codex a new try

yomismoaqui · 2026-03-11T17:23:27 1773249807

As a cheap user that only uses the 20$ month subscriptions I started with Claude Code as main & Codex as backup when the 5 hour quota was exhausted.

Then I saw that Codex worked better for me and cancelled my Claude Code subscription. And now for my moderate use (4-5 hours a day with no parallel agents) I have enough with Codex $20 and AMP free if I want to save some weekly quota.

But honestly I usually have enough usage to last the full week without using AMP.

winrid · 2026-03-11T16:49:01 1773247741

seriously, it's been going on for two hours, how complicated is their auth system?

ta988 · 2026-03-11T17:16:34 1773249394

They can't fix it if claude code isn't up, nobody understands the code anymore. /s(a little)

zurfer · 2026-03-11T15:52:50 1773244370

https://status.claude.com/incidents/jm3b4jjy2jrt https://news.ycombinator.com/item?id=47336889

zurfer · 2026-03-11T12:24:28 1773231868

I liked how it read. Not as a perfectly thought out post but more an ongoing conversation.

These are confusing times for engineers as the automators can now automate themselves away at even greater speed. Reminding ourselves to play positive sum games seems relevant.

The cake is too small to divide with humans and AI. We all feel that. Time to make more cakes :)

zurfer · 2026-03-10T12:21:04 1773145264

tldr: the author argues it is closer to costing 500 USD per month IF a user hits their weekly rate limits every week.

Which is probably a lot more correct than other claims. However it's also true that anybody who has to use the API might pay that much, creating a real cost per token moat for Anthropics Claude code vs other models as long as they are so far ahead in terms of productivity.