> What's Anthropic's optimization target??? Getting you the right answer as fast...

hnbad · 2026-02-17T08:40:29 1771317629

> The current trend in all major providers seem to be: get you to spin up as many agents as possible so that you can get billed more and their number of requests goes up.

I was surprised when I saw that Cursor added a feature to set the number of agents for a given prompt. I figured it might be a performance thing - fan out complex tasks across multiple agents that can work on the problem in parallel and get a combined solution. I was extremely disappointed when I realized it's just "repeat the same prompt to N separate agents, let each one take a shot and then pick a winner". Especially when some tasks can run for several minutes, rapidly burning through millions of tokens per agent.

At that point it's just rolling dice. If an agent goes so far off-script that its result is trash, I would expect that to mean I need to rework the instructions and context I gave it, not that I should try the same thing again and hope that entropy fixes it. But editing your prompt offline doesn't burn tokens, so it's not what makes them money.

reasonableklout · 2026-02-17T08:48:47 1771318127

Cursor and others have a subagent feature, which sounds like what you wanted. However, there has to be some decision making around how to divide up a prompt into tasks. This is decided by the (parent) model currently.

The best-of-N feature is a bit like rolling N dice instead of one. But it can be quite useful if you use different models with different strengths and weaknesses (e.g. Claude/GPT-5/Gemini), rather than assigning all to N instances of Claude, for example. I like to use this feature in ask mode when diving into a codebase, to get an explanation a few different ways.

tokioyoyo · 2026-02-17T08:19:15 1771316355

> What makes you believe this?

Simply, cut-throat competition. Given multiple nations are funding different AI-labs, quality of output and speed are one of the most important things.

philipwhiuk · 2026-02-17T13:55:33 1771336533

Dating apps also have cut-throat competition and none of them are optimised for minimising the time you spend on the app.

dymk · 2026-02-17T15:41:11 1771342871

They don’t, they’re all owned by Match group

whattheheckheck · 2026-02-18T18:23:47 1771439027

So why can't a better one start?

dymk · 2026-02-19T00:20:57 1771460457

Because Match buys them out. PoF, Tinder, Hinge started independent, and were bought by Match once they showed promise.

_345 · 2026-02-17T20:53:09 1771361589

~90% of them are owned by Match

forgetfreeman · 2026-02-17T09:51:02 1771321862

sigh We're doing this lie again? Quality of Outcome is not, has never been, and if the last 40 years are anything to go on will never be a core or even tangential goal. Dudes are trying to make the stock numbers go up and get paid. That's it. That's all it ever is.

wordpad · 2026-02-17T14:27:01 1771338421

You're just being pedantic and cynical.

Goal of any business in principle is profit, by your terms all of them are misaligned.

Matter of fact is that customers are receiving value and the value has been a good proxy for which company will grow to be successful and which will fail.

forgetfreeman · 2026-02-17T15:17:04 1771341424

I'm being neither pedantic nor cynical. Do you need a refresher on value proposition vs actual outcomes on the last few decades of breathlessly hyped tech bubbles? Executive summary: the portions of tech industry that attract the most investment consistently produce the worst outcomes, the more cash the shittier the result. It's also worth noting that "value" is defined as anything you can manipulate someone to pay for.

piperswe · 2026-02-17T15:36:58 1771342618

I mean, yeah. All businesses are misaligned, unless a fluke aligns the profit motive with the consumers for a brief period.

stoneforger · 2026-02-17T12:44:42 1771332282

Hey man people either get it or they don't. We're doomed.

delusional · 2026-02-17T08:40:22 1771317622

How is nation-states funding private corporations "cut-throat competition"?

tokioyoyo · 2026-02-17T10:13:06 1771323186

Ok, to be very honest I wrote that in the middle of having a couple of drinks. I guess, what I mean is, countries are funding AI labs because it can turn into a “winner-takes-it-all” competition. Unless the country starts blocking the leading providers.

Private companies will turn towards the best, fastest, cheapest (or some average of them). Country borders don’t really matter. All labs are fighting to get the best thing out in the public for that reason, because winning comes with money, status, prestige, and actually changing the world. This kind of incentives are rare.

irishcoffee · 2026-02-17T16:39:32 1771346372

> countries are funding AI labs because it can turn into a “winner-takes-it-all” competition.

Winner takes what exactly? They can rip off react apps quicker than everyone else? How terrifying.

tokioyoyo · 2026-02-17T23:00:50 1771369250

Like I understand this commentary, but it’s so detached from reality. My dad in his 70s is writing Excel macros, even though he never touched that in his life. There are a ton of cases like this, but people can’t see reality out of their domains.

irishcoffee · 2026-02-18T01:02:55 1771376575

That’s so dope excel finally let old people learn this! They removed the agelock?!

tokioyoyo · 2026-02-18T06:15:49 1771395349

Come on man, you know exactly what I mean. You can keep coming up with these arguments, but the world has moved on already. I genuinely don’t know a single person in 3 different countries from age 12+ who does not use LLMs at least once a week. We have to adapt, or choose to not play the “game”.

irishcoffee · 2026-02-18T16:55:04 1771433704

Your counter to "excel was never hard to learn" is "people use LLMs all day long" ??

I uh, think the LLM use has compromised your critical thinking skills.

enraged_camel · 2026-02-17T09:47:35 1771321655

What does this even mean? Are you disputing the fact that AI labs are competing with each other because they are funded by nation-states?

malfist · 2026-02-17T13:33:08 1771335188

Why do you have to compete if you can just say "but China!" And get billions more dollars from the government

psychoslave · 2026-02-17T09:58:33 1771322313

Cut throat competition between nations is usually called war. In war, gathering as much information as possible on everyone is certainly a strategic wanna do. Selling psyops about how much benefits will come for everyone willing to join the one sided industrial dependency also is a thing. Giving significant boost to potentially adversarial actors is not a thing.

That said universe don't obligate us to think the cosmos is all about competition. Cooperation is always possible as a viable path, often with far more long term benefits at scale.

Competition is superfluous self inflict masochism.

d1sxeyes · 2026-02-17T15:40:48 1771342848

There’s a line to be trod between returning the best result immediately, and forcing multiple attempts. Google got caught red-handed reducing search quality to increase ad impressions, no reason to think the AI companies (of which Google is one) will slowly gravitate to the same.

NeutralCrane · 2026-02-17T16:24:03 1771345443

My (possibly dated) understanding is that OpenAI/Anthropic are charging less than it costs right now to run inference. They are losing money while they build the market.

Assuming that is still true, then they absolutely have an incentive to keep your tokens/requests to the absolute minimum required to solve your problem and wow you.

YetAnotherNick · 2026-02-17T07:25:04 1771313104

Bill is unrelated to their cost. If they can produce answer in 1/10th of the token, they can charge 10x more per token, likely even more.

Drakim · 2026-02-17T08:16:06 1771316166

That is simply not true, token price is largely determined by the token price of their rival services (even before their own operational costs). If everybody else charges about $1 per millions of tokens, then they will also charge about $1 per millions of tokens (or slightly above/below) regardless of how many answers per token they can provide.

YetAnotherNick · 2026-02-17T09:55:01 1771322101

It only matters if the rivals have same performance. Opus pricing is 50x Deepseek, and like >100x of small models. It should match rival if the performance is same, and if they can produce model with 10x lower token usage, they can charge 10x.

Gemini increased the same Flash's price by something like 5x IIRC when it got better.

shafyy · 2026-02-17T10:48:27 1771325307

I bet that the actual "performance" of all the top-tier providers is so similar, that branding has bigger impact on if you think Claude or ChatGPT peforms better.

wordpad · 2026-02-17T14:50:40 1771339840

Performance or perception of performance

Potato potato Tomato tomato

2026-02-17T16:15:14 1771344914

[dead]

shafyy · 2026-02-20T13:23:07 1771593787

Your cousin sounds like a solid dude, haha

sixtyj · 2026-02-17T08:34:19 1771317259

This applies when there is a large number of competitors.

Now companies are fighting for the attention of a finite number of customers, so they keep their prices in line with those around them.

I remember when Google started with PPC - because few companies were using it, it cost a fraction of recent prices.

And the other issue to solve is future lack of electricity for land data centers. If everyone wants to use LLM… but data centers capacity is finite due to available power -> token prices can go up. But IMHO devs will find an innovative approach for tokens, less energy demanding… so token prices will probably stay low.

Aerroon · 2026-02-17T11:11:51 1771326711

Opus 4.6 costs about 5-10x of GLM 5.

lelanthran · 2026-02-17T10:45:18 1771325118

What businesses charge for a product is completely unrelated to what it costs them.

They charge what the market will bear.

If "what the market will bear" is lower than the cost of production then they will stop offering it.

Hasnep · 2026-02-17T11:19:48 1771327188

Companies make a loss on purpose all the time.

lpnam0201 · 2026-02-17T12:36:46 1771331806

Not forever. If that's their main business then they will eventually have to profit or they die.

r_lee · 2026-02-18T15:46:19 1771429579

I'm also seeing a lot of new rambling in Sonnet 4.6 when compared to 4.5, more markdown slop and pointing out details and things in the context which isn't too useful etc...

which then causes increased token usage because you need to prompt multiple times.

Idk, maybe it's just me though.