More

Uhhrrr · 2026-04-11T19:44:35 1775936675

The software is never accountable, so the human running it is always accountable.

john_strinlai · 2026-04-11T20:17:07 1775938627

that is how it should be, not how it is.

Uhhrrr · 2026-04-09T20:09:26 1775765366

That is a good example of "dumb shit". No one believes Musk is a Nazi, but they try to make hay with it anyway.

ssully · 2026-04-09T20:16:51 1775765811

You do not have to look beyond Elon’s own Twitter accounts posts, retweets, and likes, to see that he is a full fledged white supremacist. Calling him a Nazi is appropriate.

Uhhrrr · 2026-04-09T21:50:30 1775771430

Ok, I went and looked at his last 50 or so tweets. I didn't see anything that supports what you are saying.

Uhhrrr · 2026-04-09T20:01:29 1775764889

The very first clip is video of Cory Booker.

Uhhrrr · 2026-04-07T04:03:00 1775534580

The last couple sentences tie things up really nicely.

ronanfarrow · 2026-04-08T04:23:33 1775622213

Appreciate this. I thought deeply about that and took a lot of time, when we had little of it to go around, iterating and shaping it. (Including talking to computer scientists to make sure I sounded as not-dopey as possible on the technical side!)

Uhhrrr · 2026-03-31T23:45:15 1775000715

If you don't review what the product does, you are irresponsible for the product.

jmalicki · 2026-04-01T01:50:17 1775008217

Is the CEO responsible for a company's financial performance? Do they review every line of code the company writes?

It is more irresponsible to spend the time reviewing all of the code rather than spending that time on things with bigger levers for satisfying your customers.

NothingAboutAny · 2026-04-01T05:28:04 1775021284

yes but if a dev pushes a line of code that wipes the accounts of millions of users at a fintech, the dev will get fired but the CEO will get sued into oblivion. if the agent isn't responsible, you HAVE to be, cause angry people wont listen to "it's no ones fault your money is gone"

Uhhrrr · 2026-03-27T22:51:34 1774651894

Put an ORM in your private repo which randomly 1% of the time calls DROP TABLE.

Uhhrrr · 2026-03-25T20:11:31 1774469491

When I read SoaNM, I quickly realized that it had been used as a style guide for several later books, and most long-form tech articles in Wired, the NYT, etc.

Uhhrrr · 2026-03-19T19:17:46 1773947866

The two errors, then, were that the LLM hallucinated something, and that a human trusted the LLM without reasoning about its answer. The fix for this common pattern is to reason about LLM outputs before making use of them.

paxys · 2026-03-19T19:22:23 1773948143

A big problem now both internally to a company and externally is that official support channels are being replaced by chatbots, and you really have no option but to trust their output because a human expert is no longer available.

If I post a question to the internal payment team's forum about a critical processing issue and some "payments bot" replies to me, should I be at fault for trusting the answer?

RussianCow · 2026-03-19T19:36:01 1773948961

I know this is happening with external customer support, but is this really happening internally at big companies? Preventing you from talking to a human in the correct department about an issue feels like a bomb waiting to explode.

jmalicki · 2026-03-19T21:32:08 1773955928

There is at least an effect that chatbots have become the primary line and support, and even if you are not prevented from talking to a human, the managers of the humans you would talk to have decided that since the chatbot is there, it is inappropriate for them to be spending much time supporting coworkers in other departments when the chatbot can do it.

So to a degree, corporate politics can sort of discourage it.

wmeredith · 2026-03-19T19:52:17 1773949937

I'm sure it is. Thankfully I don't work for a company this large any more, but when I was employed by a multinational with 30K+ employees, our IT department was outsourced to India and you had to get through a couple layers of phone tree/webchat hell to actually talk to a real person. I could easily see companies of this size replacing their support with LLM nonsense.

paxys · 2026-03-19T19:44:07 1773949447

Teams are heavily incentivized to incorporate AI in their internal workflows. At Meta it is a requirement, and will come up in your performance review if you fail to do so.

Uhhrrr · 2026-03-19T22:17:23 1773958643

Yes, of course, and the company which removes human experts should expect things to fail in the manner that things usually fail when you remove your internal experts.

SlinkyOnStairs · 2026-03-19T19:26:54 1773948414

> The fix for this common pattern is to reason about LLM outputs before making use of them.

That is politics. Not engineering.

Assigning a human to "check the output every time" and blaming them for the faults in the output is just assigning a scapegoat.

If you have to check the AI output every single time, the AI is pointless. You can just check immediately.

Uhhrrr · 2026-03-19T22:20:55 1773958855

The humans are not scapegoats, because they are capable of taking on responsibility.

There is a point to using LLMs. They can save time by doing a first pass. But when they do the last pass, disasters will follow.

fhd2 · 2026-03-19T19:48:34 1773949714

Well, I'd say there's two dimensions:

1. Check frequency (between every single time and spot checks).

2. Check thoroughness (between antagonistic in-depth vs high level).

I'd agree that, if you're towards the end of both dimensions, the system is not generating any value.

A lot of folks are taking calculated (or I guess in some cases, reckless) risks right now, by moving one or both of those dimensions. I'd argue that in many situations, the risk is small and worth it. In many others, not so much.

We'll see how it goes, I suppose.

butlike · 2026-03-19T21:13:44 1773954824

Groooooooaaaaaaaaaaaannnnnnnnnnn

AnimalMuppet · 2026-03-19T22:51:25 1773960685

Well, attempts to engineer the brittleness out of human behavior have not worked, like, ever.

leptons · 2026-03-19T19:25:08 1773948308

If "the level of awareness that created a problem, cannot be used to fix the problem", then you're asking too much if you expect a human to reason about an LLM output when they are the ones that asked an LLM to do the thinking for them to begin with.

thwarted · 2026-03-19T19:54:46 1773950086

This feels like a rediscovering/rewording of Kernighan's Law:

"Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it." ~ Brian Kernighan

leptons · 2026-03-19T22:28:09 1773959289

It's an old saying, I think Einstein is cited most often for it... something like this according to Google:

"We cannot solve our problems with the same thinking we used when we created them."

Uhhrrr · 2026-03-19T22:21:37 1773958897

In this case you would replace the human.

leptons · 2026-03-19T22:26:49 1773959209

Yes, I'd fire them, and then hire a more competent human.

I'm pretty happy with the team I've built. They make solid decisions that I can trust every time. I can't say the same for the LLM.

somewhereoutth · 2026-03-19T19:25:19 1773948319

However - Automation bias is a common problem (predating AI), the 'human-in-the-loop' ends up implicitly trusting the automated system.

krupan · 2026-03-19T19:34:20 1773948860

At least pre-LLM automation was written by a careful human who's job was on the line, and was deterministic.

alfalfasprout · 2026-03-19T19:31:40 1773948700

When organizational incentives penalize NOT using AI and firing the bottom x% regularly then are you really surprised LLM outputs aren't being scrutinized?

Uhhrrr · 2026-03-19T22:22:54 1773958974

Yes, because trusting LLM output is a great way to be in the bottom x%.

krupan · 2026-03-19T19:21:02 1773948062

It's more like, the LLM "hallucinated" (I hate that term) and automatically posted the information to the forum. It sounds like the human didn't get a chance to reason about it. At least not the original human that asked the LLM for an answer

nytesky · 2026-03-19T20:07:10 1773950830

I’m not in AI, but what is happening is that it is building output from the long tail of its training data? Instead of branching down the more common probability paths, something in this interaction had it travel into the data wilderness?

So I asked AI to give it a good name, and it said “statistical wandering” or “logical improv”.

c-linkage · 2026-03-19T19:33:36 1773948816

If you don't like hallucinate, try bullshit. [NB: bullshit is a technical term; see https://en.wikipedia.org/wiki/On_Bullshit]

https://www.psypost.org/scholars-ai-isnt-hallucinating-its-b...

krupan · 2026-03-19T19:35:47 1773948947

That is my preferred term, but it seems to derail discussions that might have otherwise been productive (might...the hope I have)

Uhhrrr · 2026-03-07T20:31:00 1772915460

Placing a bet based on insider national security information should be regarded as leaking. But the markets aren't the problem here.

jimkleiber · 2026-03-07T20:36:31 1772915791

Well markets give a huge financial incentive for it. Before you had to get paid a bribe for an intelligence agent. Now you can just "legitimately" bet on a market. It's a LOT easier and more spread out, I imagine.

spiderice · 2026-03-07T22:00:28 1772920828

The existence of banks gives a huge financial incentive to rob them. That doesn't mean we should get rid of banks. It means we should create a huge disincentive to rob them (which we do). Same thing needs to happen to people using national intelligence secrets in prediction markets.

thephyber · 2026-03-07T22:42:03 1772923323

That requires active participation in regulation and enforcement.

If anything, this administration is moving the other direction when it comes to betting markets, crypto, investments, etc.

What you are saying is logical for how enforcement should happen, but it isn’t happening that way.

jimkleiber · 2026-03-09T12:57:02 1773061022

Banks are about saving money and reducing risk. Arguably when they got into high risk investments to make high reward is what contributed to the 2008 recession.

Not sure if banks are the best example of proving the country is not headed towards high risk gambling.

dmantis · 2026-03-08T03:24:51 1772940291

On the other hand, it provides the whole world with the information and not just some spy agency. Isn't that more fair system?

And people dying in question is army, professional murderers for hire themselves, not a big loss.

_vertigo · 2026-03-07T21:25:07 1772918707

You couldn’t design a better system for incentivizing leaks if you were trying. Hell, the CEO literally said as much. Not sure how you can conclude the markets aren’t the problem.

abustamam · 2026-03-07T22:20:48 1772922048

Yeah I had to reread that part... I was like, no way the CEO of Polymarket publicly said on record that it incentivizes leaks. Had to check to make sure I wasn't on the onion.

thephyber · 2026-03-07T22:43:15 1772923395

The CFTC has been defunded and dismantled. The industries it regulated don’t even bother to put on a mask anymore.

Dylan16807 · 2026-03-08T06:03:57 1772949837

He was talking about companies at that part of the interview, and getting company info leaked to the market is generally a good thing.

abustamam · 2026-03-08T14:10:15 1772979015

Wow, it took some time for me to dig the interview out(0). I think it's stupid that Atlantic did not link to it, and that they misrepresented the context.

I agree that company info being leaked is whatever. No one is hurt by knowing that Apple is working on a foldable phone; maybe an exec loses his million dollar bonus and can't upgrade his yacht this year, and the market can operate off of that knowledge.

But the flip side is that there's no way to distinguish between leaked company info and leaked government info, and up until this era of history, there was rarely financial incentive for anyone to leak govt info, and even if there were, it was almost impossible to do so completely anonymously.

I'm not necessarily agreeing with the article. Who knows if that actually happened? But the incentives make it more plausible than ever.

(0) https://www.youtube.com/live/t647EWQst5A?si=TPNx7RbxWvn9wVdn

w4yai · 2026-03-07T22:48:05 1772923685

Let's make bets on "xxxx leaked yyyy info", so people will be incentived to discover/denounce who cheated.

Uhhrrr · 2026-03-05T17:14:02 1772730842

How do they know? Has this been published in a Reliable Source?

nhubbard · 2026-03-05T17:23:30 1772731410

This is the official Wikimedia Foundation status page for the whole of Wikipedia, so it's a reliable primary source.

vova_hn2 · 2026-03-05T17:49:23 1772732963

Actually, usage of primary sources is kinda complicated [0], generally Wikipedia prefers secondary and tertiary sources.

[0] https://en.wikipedia.org/wiki/Wikipedia:No_original_research...

jkaplowitz · 2026-03-05T17:58:23 1772733503

Yeah, but the purpose of an encyclopedia like Wikipedia (a tertiary source) is to relatively neutrally summarize the consensus of those who spend the time and effort to analyze and interpret the primary sources (and thus produce secondary sources), or if necessary to cite other tertiary summaries of those.

In a discussion forum like HN, pointing to primary sources is the most reliable input to the other readers' research on/synthesis of their own secondary interpretation of what may be going on. Pointing to other secondary interpretations/analyses is also useful, but not without including the primary source so that others can - with apologies to the phrase currently misused by the US right wing - truly do their own research.

Uhhrrr · 2026-03-05T18:38:29 1772735909

If you spend any time on Wikipedia, you'll find that secondary sources from an existing list are always preferred. The mandate from the link in GP (https://en.wikipedia.org/wiki/Wikipedia:No_original_research) extends, or at least is interpreted to mean to extend to, actively punishing editors who attempt to analyze or interpret primary sources.

My original post was a joke about this.