Hacker Newsnew | past | comments | ask | show | jobs | submit | extr's commentslogin

I think you are kidding if you think you are going to be remotely approximately the quantity/quality of output you get from a $100/max sub with Zed/Openrouter. I easily get $1K+ of usage out of my $100 max sub. And that's with Opus 4.6 on high thinking.

For personal use I've noticed Claude (via the web-based chat UI) making really bizarre mistakes lately like ignoring input or making completely random assumptions. At work Claude Code has turned into an absolute dog. It fails to follow instructions and builds stuff like a lazy junior developer without any architecture, tests, or verification. This is even with max effort, Opus 4.6, multiple agents, early compaction, etc. I don't know what they did but Anthropic's quality lead has basically evaporated for me. I hope they fix it because I've since adapted my project's Claude artifacts for use with Codex and started using it instead - it feels like Claude Code did earlier this year.

I'd like to give the new GLM models a try for personal stuff.


Same, I'm looking hard for an alternative to what I had.

And I'm seeing the same thing in my sphere- everyone is bailing Anthropic the past few weeks. I figure that's why we're seeing more posts like this.

I hope they're paying attention.


I've noticed the same thing, and even done side by side tests where I compare Claude Code with Cursor both running Opus 4.6.

It seems Cursor somehow builds a better contextual description of the workspace, so the model knows what I'm actually trying to achieve.

The problem is that with Cursor I'm paying per-token, so as GP suggested you can easily spend $100+ per month vs $20 on Claude Code.


I saw this immediately with 4.6 and dumped back to 4.5 because I actually asked it wtf it was doing and it's response was "being lazy"

> At work Claude Code has turned into an absolute dog.

Could it be related to this?: https://news.ycombinator.com/item?id=47660925


Some of the newer models available on OpenRouter are good, but I agree that none of them are a replacement for Opus 4.6 for coding.

If you're trying to minimize cost then having one of the inexpensive models do exploratory work and simple tasks while going back to Opus for the serious thinking and review is a good hybrid model. Having the $20/month Claude plan available is a good idea even if you're primarily using OpenRouter available models.

I think trying to use anything other than the best available SOTA model for important work is not a good tradeoff, though.


I've been thinking of doing this — using one of the "pretty good but not Opus 4.6-good, YET very cheap" models for the implementation part of more basic code features, AFTER first using Opus 4.6 high for the planning stage.

Do you think this would be a decent approach?

Also, which client would I use for this? OpenCode? I don't think Claude Code supports using other models. Thoughts?


I have been doing this and the results have been fairly good.

I use claude to build requirements.md -> implementation.md -> todo.md. Then I tell opencode + openrouter to read those files and follow the todo using a cheap (many times free) model.

It works 90% of the time. The other 10% it will get stuck, in which case I revert to claude.

That has allowed me to stay on the $20/month claude subscription as opposed to the $100.


> I easily get $1K+ of usage out of my $100 max sub. And that's with Opus 4.6 on high thinking.

And people keep claiming the token providers are running inference at a profit.


>And people keep claiming the token providers are running inference at a profit.

Not everyone gets $1K of usage, and you don't know how fat the per-token margins are. It's like saying the local buffet place is losing money because you eat $100 worth of takeout for $30.


> Not everyone gets $1K of usage, and you don't know how fat the per-token margins are.

Well, we're going to find out sooner rather than later. Right now you don't know how thin (or negative) the margins are, either, after all.

All we know for certain is how much VC cash they got. Revenue, spend, profit, etc calculated according to GAAP are still a secret.


In addition to usage distribution aspects others called out .

$1K is not actual cost, just API pricing being compared to subscription pricing. It is quite possible that API has a large operating margins, and say costs only $100 to deliver $1K worth of API credits.


Yes and when we say things like that we are not talking about plans. Running inference at a profit means api token use is run profitably. It’s a huge unknown what’s happening at the plan level, we know there is subsidy happening but in aggregate impossible to know if it’s profitable or not.

The model developers across the board stand by that most/all models are profitable by EOL, and losses come from R&D/Training.

Out of curiosity, how many tokens are people using? I checked my openrouter activity - I used about 550 million tokens in the last month, 320M with Gemini and 240M with Opus. This cost me $600 in the past 30 days. $200 on Gemini, $400 on Opus.

  My Claude Code usage stats after ~3 months of heavy use:

    Favorite model: Opus 4.6          Total tokens: 42.6m
    Sessions: 420                     Longest session: 10d 2h 13m
    Active days: 53/95                Longest streak: 16 days
    Most active day: Feb 9            Current streak: 4 days

    ~158x more tokens than Moby-Dick

  Monthly breakdown via claude-code-monitor (not sure how accurate this is):

    Month     Total Tokens     Cost (USD)
    2026-01     96,166,569       $112.66
    2026-02    340,158,917       $393.44
    2026-03  2,183,154,148     $3,794.51
    2026-04  1,832,917,712     $3,412.72
    ─────────────────────────────────────
    Total    4,452,397,346     $7,713.34

Yeah — I just created an anthropic API key to experiment with pi, and managed to spend $1 in about 30 minutes doing some basic work with Sonnet.

Extrapolating that out, the subscription pricing is HEAVILY subsidized. For similar work in Claude Code, I use a Pro plan for $20/month, and rarely bang up against the limits.


And it scales up - the $200 plan gets you something like 20x what the Pro plan gets you. I've never come close to hitting that limit.

It's obviously capital-subsidized and so I have zero expectation of that lasting, but it's pretty anti-competitive to Cursor and others that rely on API keys.


Ignoring the training costs, the marginal cost for inference is pretty low for providers. They are estimated to break even or better with their $20/month subscriptions.

That being said, they can't stop launching new models, so training is not a one time task. Therefore one might argue that it is part of the marginal cost.


I ran ccusage on my work Max account and I spend what would cost $300 a week if it was billed at API rates.

According to the meter, I used $15k in tokens with my Max plan (along with $5k of Codex tokens) in the last 30 days. That built an entire working and (lightly) optimized language, parser, compiler, runtime toolchain among other things.

Not everyone is just vibecoding everything and relying on agents running sota models to do anything tho.

I actually find Zed pretty reasonable in terms of memory usage. But yeah, like you say, there are lots of small UX/DX papercuts that are just unfortunate. In some cases I'm not sure it's even Zed's fault, it's just years and years of expecting things to work a certain way because of VS Code and they work differently in Zed.

Eg: Ctrl+P "Open Fol.." in Zed does not surface "Opening a Folder". Zed doesn't call them folders. You have to know that's called "Workspace". And even then, if you type "Open Work..." it doesn't surface! You have to purposefully start with "work..."


The issues you described show a critical lack of awareness from the Zed developers that people migrate to their IDE mainly from VS Code.

They are blowing their "weirdness budget" on nonsense.


I don't think it's conscious or even a result of not caring about UX/DX. But I do think you're right - I've noticed the loudest voices in their Issue queue are people wanting things like better vim support, helix keybind support (super niche terminal modal editor), etc. Fine if they want to make that their niche but if you are migrating from VS Code like 99% of people you can't have these kinds of papercuts, people will just uninstall.

I think explicit post-training is going to be needed to make this kind of approach effective.

As this repo notes is "The secret to good memory isn't remembering more. It's knowing what to forget." But knowing what is likely to be important in the future implies a working model of the future and your place in it. It's a fully AGI complete problem: "Given my current state and goals, what am I going to find important conditioned on the likelihood of any particular future...". Anyone working with these agents knows they are hopelessly bad at modeling their own capabilities much less projecting that forward.


What is Cursor doing? They need to relax a little bit. Recently I saw they released "Glass" which WAS here: https://cursor.com/glass, now just redirects to /download.

Is "Cursor 3" == Glass? I get they feel like their identity means they need to constantly be pushing the envelope in terms of agent UX. But they could stand to have like an "experimental" track and a "This is VS Code but with better AI integration" track.


Glass was a codename while the UI was in early alpha with testers. It redirects to download now because there is no special link anymore. It's just part of Cursor 3 itself.

Just emailed him. Ridiculous issue.

Disagree completely. Works great for me.

If you actually care about this stuff you are going to run something like https://github.com/waydabber/BetterDisplay which easily allows for HiDPI @ 4K resolution, it does not "look bizarre" or "require fractional scaling". This is what the OP is about. I do the same thing, I run native res w/ HiDPI on a 27" 4K screen as my only monitor, works great.

Unfortunately BetterDisplay cannot set HiDPI @ 4K on the M5 machines - that was the first thing I tried.

Sure, and that is the real tragedy here. The person I'm replying to is just pointing out that native support for high res sucks, which is true, but the real problem is what limits there are on 3rd party support.

It's widely reported and acknowledged as true.


Well, the only people with any ability to acknowledge it have a massive incentive to do so, and I've been around the block enough times to know that startups will use every trick in the book to paint a rosy financial picture, even when it's extremely misleading or occasionally just straight up lies. In the current climate of AI hype my skepticism is even greater.

I'll believe it when I see it.


Where and by who? Critical context missing here.



The CEO hyping his product and the viability of his business during an interview with Stripe does not, at least to me, qualify as “widely reported and acknowledged”


K2.5 is dog shit compared to leading OAI/Ant models.


The OpenCode guys have really surprised me in the way they've reacted to Anthropic shutting down the side-loaded auth scheme. Very petty and bitter. It's clearly just a business decision from Anthropic and a rational one at that, usage subsidization to keep people on the first party product surface is practically the oldest business move in the book and is completely valid.


This is not my impression, could you explain what you're talking about?


Ever since the shutdown of the side-load they've been pretty vocally anti-anthropic on twitter. Paranoid that anthropic is going to torpedo them via some backdoor now that they own bun, insinuating that anthropic shut down the auth from a position of weakness since OpenCode is a superior product, etc.

The thing is OpenCode IS a great product, I'm not sure it's "superior", but unfortunately the way things are evolving where the model + harness pairing is so important, it does seem like they are in a similar position to Cursor (and do not have the resources to try to pivot into developing their own foundational model).


I wouldn't call OpenCode a "great" product tbh. It's nice that it's FLOSS of course, but the overall quality is a bit underwhelming and it's clearly possible to build much better open agentic harnesses. It would be nice if more people tried to do this.


The OpenCode bun dependency is an unsettling issue I would imagine.


if you look at the last few weeks of commits, you can see they've been systematically ripping out everything bun-specific and moving to node


I think frankly OpenCode is delusional to think that Anthropic is actually "concerned" with them in any way. Anthropic's concerns at this point are on the geopolitical level. I doubt stamping out ToS-violating usage of their subscription services is even on executive radar. OpenAI only allows it because it's a cheap PR win and they take those where they can get them.


Opencode is not delusional, it would be delusional to think Anthropic won't after they have already threatened them.


Yeah, I recognized the PR author from Twitter (same avatar) and man he really does come across as incredibly juvenile. Shamelessly talking up OpenAI while shitting on Claude models and the motivation is just so transparent.


I have a huge issue 10416 on OpenCode

https://github.com/anomalyco/opencode/issues/10416

- their stance on privacy


not sure i follow - do they leak my information to their own servers by default?


This is probably the most exhaustive answer to your question as of Jan 7: https://github.com/anomalyco/opencode/issues/459#issuecommen...

The also leaked all prompts to OpenAI until very recently.


Why does Anthropic care how the tokens are consumed?


Valid question. It's because they have a separate product intended for use with general tools: Their API.

Their subscription plans aren't actually "Claude Code Plans". They're subscription plans for their tool suite, which includes claude code. It's offered at a discount because they know the usage of this customer base.

OpenCode used a private API to imitate Claude Code and connect as if it was an Anthropic product, bypassing the need to pay for the API that was for this purpose.

Anthropic has been consistent on this from the start. The subscription plans were never for general use with other tools. They looked the other way for a while but OpenCode was openly flaunting it, so they started doing detection and blocking.

OpenCode and maintainers have gone on the offense on Twitter with some rather juvenile behavior and now they're trying to cheekily allow a plugin system so they can claim they're not supporting it while very obviously putting work into supporting it.

Most of the anger in this thread comes from people who want their monthly subscription to be usable as a cheaper version of the public API, even though it was never sold as that.


Same reason movie theaters care about you not bringing your own snacks


You pay for snacks in the cinema and they lose money if you buy elsewhere. Where does Anthropic lose money when I use OpenCode?


This has been explained many times in this thread. Your subscription to Claude models for use in Claude Code is subsidized. That is, it is only meant to be used with that harness.

When you use that API key with OpenCode, you're circumventing that.


That doesn't make sense.

The PS5 is subsidized because the make money with the games.

Printers are subsidized because the make money with the ink.

The API use is subsidized because they make money with Claude Code? I would understand if Claude Code could only be used with Anthropics API but not the other way around. 1 million tokens is 1 million tokens unless Claude Code is burning tokens and others are more efficient in token use.


They want you to become dependent on Claude Code, so that later they can milk you.


I'd say that they want Claude Code to become the standard, so that they can milk corporations on enterprise plans. We individual subscribers are nothing, but we'll go to work and be vocal about specifically having Claude.


The AI companies can spare their whining about contempt of business model. They're selling a service.


Because models are quickly moving toward commoditization, whether the big three like it or not. The differentiator now is tooling around those models. By eliminating OpenCode's auth stuff, they prevent leaking customers onto another platform that allows model choice (they will likely lose paying customers to one of the major inference catalogs like OpenRouter once they move from Claude Code to OpenCode).


Why does Netflix care how the movies they stream to you are consumed? Shouldn't your $8/mo allow you to stream any movie to OpenFlix and consume however you like?


You are also not allowed to show these Netflix movies on a big screen in front of your house and charge people. The 8 dollar are for a specific use case, just like the tokens in the subscription.


Unironically, you should. In a more just world, laws would mandate service providers not obstruct third party clients.


The pricing would also be different.


Yes, content providers would have to compete with each other on price and library, and client providers could compete on UX and privacy.


Because they're selling discounted tokens to use with their tooling.


If you use Claude through an interface that’s not Claude Code, you’ll only stick with it for as long as it proves itself the best. With other interfaces, you can experiment with multiple models and switch from one to another for different tasks or different periods of time.

Those tokens going to other providers are tokens not going to Anthropic, so they want to lock you in with Claude Code. And it clearly works, since a lot of people swear by it.


because he is giving them at 90% discount in their subscription. they are more than happy if you use the tokens at api pricing, but when subsidized they want you to use their claude code surface.


> Paranoid that anthropic is going to torpedo them via some backdoor

Like with lawyers or something?


Rather the hypothetical situation where anthropic makes a code change to bun to have a backdoor.

Anthropic leadership is delusional, not suicidal, so they would rather use their lawyers.


[flagged]


Sad day when the hacker forum starts lamenting the poor copyright holders.


Hacker news is about hackers in the same way that the peoples democratic republic of Korea is about democracy.


I feel HN did have a more information-wants-to-be-free-ey, disrupt-the-incumbents-ey era, though. Or was it all a dream?


On what basis are you assuming that Anthropic committed greater copyright theft than Meta, OpenAI, and Google (not to mention many lesser-known options)?


Legally speaking they were found to have by a court and the others weren’t


When did that happen? Did they admit guilt in the big settlement, or was there a different case?


opencode is a very meh agent.

Source: i run pretty much all of these agents (codex, cc, droid, opencode, amp, etc) side-by-side in agentastic.dev and opencode had basically 0 win-rate over other agents.


I've been using opencode and would be curious to try something else. What would recommend for self hosted llms?


Very new to self-hosted LLM, but I was able to run Codex with my local ollama server. (codex --oss)


Anthropic provides subsidized access to Claude models through Claude Code. It is well understood to be 'a loss leader' so that they can incentivize people to use Claude Code.

OpenCode lets people take the Claude-Code-only-API-Key, and lets them use it in a different harness. Anthropic's preferred way for such interaction is getting a different, Claude API key (and not Claude Code SDK API key).

---

A rough analogy might be something like getting subsidized drinks from a cafe, provided you sit there a eat food. What if someone says, go to the cafe and get free drink and come sit over at our cafe and order food. It is a loose analogy, but you get the idea.


> It is well understood to be 'a loss leader'

You have zero proof for this claim. It's like people read somewhere that stuff and keep spitting it out again and again without understanding..


If it wasn't the case, the Claude API pricing would be the same, $200 for unlimited use. But it's metered.

We don't know if Claude Code bleeds money for every user that touches it. Probably not. But the different pricing is a strong enough clue that it's an appeal product with subsidized tokens consumption.


API is intended for a different audience - companies with a big pocket who aren't as price sensitive as private users. So the pricing will be different than for a private subscription.


That is not true at all. I, as an individual, can go and get access to Claude models via API today, for, I dont know, for a custom workflow I have.

What Anthropic is saying is - please dont use the API key from Claude Code for that.


There is huge value in getting people to subscribe to recurring payments. Giving people a discount to do so makes sense and does not mean that the subscription service loses money.


> If it wasn't the case, the Claude API pricing would be the same, $200 for unlimited use.

How do you figure? That doesn't make any sense to me.


It's not a loss leader - as in they're not making a loss on the subscription.

Because they control the harness(es) and the backend, they can optimise caching and thus the costs to them.


I'm giving up. Caching is optimized server-side on a product for which they can't control the client.


Loss leader doesnt mean $0. Loss leader means it is subsidized to attain another, larger goal.


Thank you, I understand all of this. My question was about the reference to "petty and bitter."


It revolves around how Open AI has much better models and how Claude Code engineers are a bunch of kids (which is kind of ironic).


What exactly are you referring to?


>usage subsidization

Is this actually the case though? Because I can't imagine what kind of hardware they're running to have costs per 1M tokens be above like $3.


This seems like pure misinformation. The code lines that are actually changed:

              hint: {
                opencode: "recommended",
                -anthropic: "API key",
                openai: "ChatGPT Plus/Pro or API key",
              }[x.id],
They're removing the ability to use OpenCode via Anthropic API key


This is what most people in the comments are missing. They are removing the ability to even use Anthropic APIs not just your Max subscription.


this not true. api keys are supported. only "claude code" is being dropped.

that code is just a cli hint to which LLM they recommend using. so they stop recommending anthropic. rightfully so.


Is this what the legal request demanded or is this just something that OpenCode is doing out of spite? Seems unclear. To me the meat of this change is that they're removing support for `opencode-anthropic-auth` and the prompt text that allows OpenCode to mimic Claude Code behavior. They have been skirting the intent of the original C&D for awhile now with these auth plugins and prompt text.


Using your API key in third-party harnesses has always been allowed. They just don't like using the subsidized subscription plan outside of first-party harnesses. So this seems to be out of spite


It is what the legal demands are. They requested removal of all Anthropic (trademark?) mentions.


Anthropic's issue was always them spoofing OpenCode as Claude Code, piggybacking on the subscription plan.

Banning them from using the pay-per-token API key would be bad business.


I believe parent is talking about a separate topic, not about this change.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: