I find it quite funny how this blog post has a big "Ask ChatGPT" box at the bott...

andrewguenther · 2026-03-05T22:35:39 1772750139

It looks like this doesn't work for users without accounts? It works when I'm logged in, but not logged out. I went ahead and reported it to the team. Thanks for letting us know!

dotancohen · 2026-03-06T02:44:09 1772765049

No integration test for guest (non-logged in) users?

Hahaha who am I kidding. No integration tests for anybody!

Rohunyyy · 2026-03-06T03:22:44 1772767364

SDET here. A year ago when AI came into play SDET/QA roles started disappearing. People were like oh ya anyone can write tests. Then with the recent fiascos about outages and what not, I am seeing the SDE roles are disappearing and SDET roles are going back up?! Apparently AI is good at writing applications but you still need someone to make sure it is doing the right things.

DrewADesign · 2026-03-06T03:52:41 1772769161

It’s not really good at writing the software either — it’s a moderate to decent productivity booster in an uneven, difficult-to-predict assortment of tasks. Companies are just starting to exit the “we’re still trying to figure this out” grace period. Expect more of that as soon as these chatbot companies have to start charging enough to pull in more money than they spend. I foresee some purpose-built models that are pretty lean being much more useful in long run. It’s neat that the bot which can one-shot a simple CRUD website for you can also crank out Scrubs-based erotic fan fiction novellas by the dozen but I don’t foresee that being a sustainable business model. Having good purpose-built tools is, in my opinion, better than some unwieldy tool that can do a whole bunch of shit I don’t need it to.

dotancohen · 2026-03-06T03:55:49 1772769349

Interestingly, the first real productive use of AI that I found was writing the unit tests and integration tests for my applications. It was much better at thinking about corner cases that I was.

democracy · 2026-03-06T03:09:18 1772766558

integration tests? so last century....

ulfw · 2026-03-06T03:38:13 1772768293

But but but but I thought AI would do this magically for all of us, no?

No more need for pesky humans, no?

k4rli · 2026-03-06T12:48:08 1772801288

"You're absolutely right! I understand the assignment completely. Now let me delete the blog post."

curiousgal · 2026-03-06T05:23:13 1772774593

Tell them to stop being evil while you're at it.

baxtr · 2026-03-05T22:40:46 1772750446

I picked up Claude today after being away and using only ChatGPT and Gemini for a while.

I was pretty impressed with how they’ve improved user experience. If I had to guess, I’d say Anthropic has better product people who put more attention to detail in these areas.

gizmodo59 · 2026-03-06T00:04:18 1772755458

ChatGPT has given more for my 20$ than any other vendor. And that’s not even considering codex which is so good and the limits are much much higher

manojlds · 2026-03-06T03:15:03 1772766903

How is that relevant? Also, when you are behind you do give more usage

triage8004 · 2026-03-06T01:52:13 1772761933

They are all losing money on probably all levels of the packages if you max them out

bwat49 · 2026-03-06T01:19:55 1772759995

yeah claude is great... but only if you pay $100-$200 a month

beefsack · 2026-03-06T03:27:29 1772767649

Many people buy two separate Claude pro subscriptions and that makes the limit become a non-issue. It works surprisingly well when you tend to hit the 5 hourly limit after a few hours, and hit the weekly limit after 4-5 days. $40 vs $100 is significant for a lot of people.

ruszki · 2026-03-06T09:39:37 1772789977

I hit limit of Pro in about 30 minutes, 1 hour max. And only when I use a single session, and when I don't use it extensively, ie waits for my responses, and I read and really understand what it wants, what it does. That's still just 1-2 hours/5 hours.

What do you do to avoid that?

AlexeyBelov · 2026-03-06T10:26:15 1772792775

You're probably having long sessions, i.e. repeated back-and-forth in one conversation. Also check if you pollute context with unneeded info. It can be a problem with large and/or not well structured codebases.

ruszki · 2026-03-06T11:43:52 1772797432

The last time I used pro, it was a brand new Python rest service with about 2000 lines generated, which was solely generated during the session. So how I say to Claude that use less context, when there was 0 at the beginning, just my prompt?

nevertoolate · 2026-03-06T12:38:15 1772800695

So you had generated 2000 lines in 30 minutes and ran out of tokens? What was your prompt?

I’d use a fast model to create a minimal scaffold like gemini fast.

I’d create strict specs using a separate codex or claude subscription to have a generous remaining coding window and would start implementation + some high level tests feature by feature. Running out in 60 minutes is harder if you validate work. Running out in two hours for me is also hard as I keep breaks. With two subs you should be fine for a solid workday of well designed and reviewed system. If you use coderabbit or a separate review tool and feed back the reviews it is again something which doesn’t burn tokens so fast unless fully autonomous.

smartbit · 2026-03-06T04:03:40 1772769820

Thanks for the tip, didn’t think of using 2 subscriptions at the same company.

When reaching a limits, I switch to GLM 4.7 as part of a subscription GLM Coding Lite offered end 2025 $28/year. Also use it for compaction and the like to save tokens.

devld · 2026-03-06T15:20:20 1772810420

I'm using it via Copilot, now considering to also try Open Code (with Copilot license). I don't know if it's as good as Claude Code, but it's pretty good. You get 100 Sonnet requests or 33 Opus request in the subscription per month ($20 business plan) + some less powerful models have no limits (i.e. GPT 4.1), while extra Sonnet request is $0.04 and Opus $0.12, so another $20 buys 250 Sonnet requests + 83 Opus requests. This works for me better since I do not code all day, every single day. Also a request is a request, so it does not matter if it's just a plain edit task or an agent request, it costs the same.

Btw. I trust Microsoft / GitHub to not train on my data more (with the Business license) than I would trust Antrophic.

nerdsniper · 2026-03-06T04:06:32 1772769992

To be honest it feels very worth my $200/mo. And I “only” make $80k/year. I used to have two ChatGPT subs but Claude is just so much better.

abustamam · 2026-03-05T23:32:45 1772753565

I agree! I recently migrated from ChatGPT to Claude and it is just superior in every way. It doesn't blather on the at the end ask me for clarification. It's succinct and clarifies vital information before providing a solution.

vostrocity · 2026-03-06T02:22:27 1772763747

Voice input is still far less accurate than OpenAI's unfortunately, otherwise I would have already switched.

abustamam · 2026-03-06T15:34:01 1772811241

Oh interesting. I've never used voice input on either so I can't comment, but understandable why you can't switch if it's disruptive to your workflow to do so.

beachy · 2026-03-06T00:53:43 1772758423

I held off migrating from ChatGPT to Claude Code due to being a laggard that lived in the Eclipse world. I didn't believe what I was told that I wouldn't be writing code any more. Pushed into action by recent PR gaslighting from OpenAI, I jumped to claude code and they were right - I barely venture into the IDE now and certainly don't need an integration.

hamasho · 2026-03-06T02:43:27 1772765007

I agree, but in general those chat apps have relatively bad user experiences for multibillion BtoC company. I used to have a lot of surprises and frustrations while using Claude Code / Desktop, and still encounter issues, but it's the best in major LLM services.

majormajor · 2026-03-06T03:16:52 1772767012

It's funny cause, you know, fixing all those little nitty gritty things should be practically automatic with their own offerings... have your agent put in a lot of instrumentation... have it chase down bugs or dead-end user-journeys... have it go make the changes to fix it...

I've seen these tools work for this kinda stuff sometimes... you'd think nobody would be better at it than the creators of the tools.

sreekanth850 · 2026-03-06T02:31:05 1772764265

True. Everytime when i ask something gpt, it use to spit out long stories. Claude ans gemini are always straight to point.

twelvedogs · 2026-03-06T02:34:18 1772764458

I bullied it into giving me concise answers, now it starts every answer with "just quickly" or something similar but it gets straight to the point

sreekanth850 · 2026-03-06T09:27:15 1772789235

I always add no nonsense no bullshit at the end of my prompt. Its annoying how itries to please the user.

forgotpwd16 · 2026-03-06T19:50:49 1772826649

No need to do it yourself in every prompt. Just put it in Custom instructions under Personalization.

sreekanth850 · 2026-03-07T12:19:09 1772885949

Thank you never saw this.

forgotpwd16 · 2026-03-06T19:52:55 1772826775

Seems not very known that ChatGPT got a few style/tone choices besides default. One is specifically being concise and plain.

kgeist · 2026-03-06T10:54:25 1772794465

I had something similar happen with skills today. A popup appeared saying, "hey, did you know ChatGPT has skills?" Clicking on it opened a new chat window, and after some thinking it said, "I tried to launch the built-in skills demo flow, but it isn’t available".

They barely test this stuff.

rapind · 2026-03-06T11:48:50 1772797730

> They barely test this stuff.

In all fairness they are more focused on domestic surveillance these days.

DonsDiscountGas · 2026-03-06T12:48:12 1772801292

They're testing it in production apparently. With release cycles this fast there's no other way.

ElijahLynn · 2026-03-05T22:27:04 1772749624

fwiw: I get a valid response when following the steps you mentioned. I do not get the message you mentioned:

https://chatgpt.com/share/69aa0321-8a9c-8011-8391-22861784e8...

EDIT: oh, but I'm logged in, fwiw

zamadatix · 2026-03-05T22:03:55 1772748235

Following this process summarizes the blogpost for me. Perhaps the difference is I'm signed into my account so it can access external URLs or something of that nature?

beambot · 2026-03-06T02:08:08 1772762888

It's like opening copilot in a word doc and it telling you it can't see the document in its context

reval · 2026-03-06T02:23:31 1772763811

This is infuriating. However, for those in this situation, know this: it works if the document or spreadsheet is in OneDrive. I just wish Copilot told you this instead of asking you to upload the doc.

pocksuppet · 2026-03-05T22:25:56 1772749556

Most AI integration is like this. It's not about building working products --- it's about bragging that you put a chatbox in your program.

bartread · 2026-03-05T23:17:25 1772752645

This is such a stale take. In the past 3 years I’ve worked on multiple products with AI at their core, not as some add-on. Just because the corpo-land dullards[0] can’t execute on anything more complex than shoehorning a chatbot into their offerings doesn’t mean there aren’t plenty of people and companies doing far more interesting things.

[0] In this case, and with heavy irony, including OpenAI, although it sounds like most of this particular snafu is due to a bug.

saghm · 2026-03-06T00:11:15 1772755875

> Most AI integration is like this.

>> This is such a stale take. In the past 3 years I’ve worked on multiple products with AI at their core, not as some add-on. Just because the corpo-land dullards[0] can’t execute on anything more complex than shoehorning a chatbot into their offerings doesn’t mean there aren’t plenty of people and companies doing far more interesting things.

I feel like this is just a disagreement of what "AI integration" means. You seem to agree that the trend they're describing exists, but it sounds like you're creating new products, not "integrating" it into existing ones.

abustamam · 2026-03-05T23:34:05 1772753645

Kinda reminds me of crypto. There are certainly very interesting things happening in the crypto space. But the most visible parts of the crypto universe are the stupid parts (buying PNGs for millions, for example)

thereticent · 2026-03-06T00:53:22 1772758402

Genuinely curious, not being combative...what very interesting things have happened in the crypto space lately?

abustamam · 2026-03-06T01:22:43 1772760163

Oh, I dunno about lately (though I did stumble upon https://a16zcrypto.com/posts/article/big-ideas-things-excite... )

But when I was in the crypto space in 2018, there was a lot of interesting things happening in the smart contract world (like proofs of concepts of issuing NFTs as a digital "deed" to a physical asset like a house).

I don't think any of those novel ideas went anywhere, but it was a fun time to be experimenting.

ulfw · 2026-03-06T05:25:49 1772774749

> like proofs of concepts of issuing NFTs as a digital "deed" to a physical asset like a house

which went absolutely nowhere

abustamam · 2026-03-06T15:32:48 1772811168

Yeah, like most startups. I'd argue that a majority of AI startups now will go nowhere as well. That's just how new technology goes. Lots of shiny objects, lots of hype, and maybe 1%, if that, goes on to become a foundation of society.

Jury is still out on if crypto will become a foundation for society (if anything, it would be foundational for something boring and invisible like banking). I wouldn't bet on a startup doing that, but that's the only viable thing I can foresee crypto being useful for. But it doesn't mean that other applications can't be interesting and useless!

pocksuppet · 2026-03-07T23:15:06 1772925306

Teaching tens of thousands of programmers how the financial system actually works was interesting, heh.

LordDragonfang · 2026-03-05T23:33:37 1772753617

I mean, to be fair, both things can be technically true. There can be lots of interesting things being done, even while most can be low-effort garbage.

But this is just Sturgeon's Law (ninety percent of everything is crap), not an actually insightful addition to the discussion, and I very much agree it's a stale take.

rishikeshs · 2026-03-06T13:53:42 1772805222

This is not only openai, but other models as well. Last week I added a summarise with AI block on a product blog page. I had seen it somewhere and felt like it’s a cool feature to have. Wrote a small shortcode in hugo for the block and added it with various models.

It’s like a hit and miss, sometimes claude says i cannot access your site which is not true.

Ref: https://formbeep.com/blog/building-formbeep-weekend/

amelius · 2026-03-05T22:43:23 1772750603

If only they had an LLM they could use as a software testing agent.

kennywinker · 2026-03-06T03:04:37 1772766277

I think you might have hit on the issue - just the wrong way around. I would assume they’re using LLMs for testing, and no humans or maybe just one overworked human, and that is the problem

Razengan · 2026-03-06T13:49:17 1772804957

As bad as Google Gemini telling me it couldn't search Google Flights or Google reverse image search for me. These companies really need to dogfood their own products first. Do they not realize how embarrassing it is when their flagship intelligence refuses to interop with their own services?

martin_drapeau · 2026-03-06T15:49:56 1772812196

In Codex I was suggested to try Codex Spark for a limited time. So for my next session, I gave it a shot. It is much, much faster. However on the task I gave it, it spun around in circles cycling through files and finally abandoned saying it ran out of tokens. Major fail.

Aurornis · 2026-03-05T21:30:28 1772746228

Probably intentional. They don't want open, no-registration endpoints able to trigger the AI into hitting URLs.

jazzypants · 2026-03-05T21:38:16 1772746696

But, why include the non-functional chat box in the article?

embedding-shape · 2026-03-05T21:45:42 1772747142

Different team "manages" the overall blog than the team who wrote that specific article. At one point, maybe it made sense, then something in the product changed, team that manages the blog never tested it again.

Or, people just stopped thinking about any sort of UX. These sort of mistakes are all over the place, on literally all web properties, some UX flows just ends with you at a page where nothing works sometimes. Everything is just perpetually "a bit broken" seemingly everywhere I go, not specific to OpenAI or even the internet.

colonCapitalDee · 2026-03-05T22:01:44 1772748104

That's why it happened. It still shouldn't have happened.

sumedh · 2026-03-06T10:40:57 1772793657

> team that manages the blog never tested it again.

They can use this new tech called AI to test it.

ethbr1 · 2026-03-05T22:31:40 1772749900

> Or, people just stopped thinking about any sort of UX. These sort of mistakes are all over the place, on literally all web properties, some UX flows just ends with you at a page where nothing works sometimes.

It's almost like people are vibe coding their web apps or something.

teaearlgraycold · 2026-03-05T21:58:25 1772747905

If only there was some kind of way to automatically test user flows end to end. Perhaps testing could be evaluated periodically, or even ran for each code change.

koakuma-chan · 2026-03-05T22:11:08 1772748668

There is no business value in doing that.

teaearlgraycold · 2026-03-05T23:44:17 1772754257

There most certainly is, but maybe the time spent on it could be better allocated to something else.

koakuma-chan · 2026-03-06T00:09:10 1772755750

Yeah, like adding more features.

teaearlgraycold · 2026-03-06T22:38:44 1772836724

Sometimes I’d pay for them to remove features.

observationist · 2026-03-05T21:43:56 1772747036

They're having service issues - ChatGPT on the web is broken for a lot of people. The app is working in android - I'd assume that the rollout hit a hitch and the chatbox in the article would normally work.

jdndbdjsj · 2026-03-05T21:46:52 1772747212

Welcome to a big company

AirGapWorksAI · 2026-03-05T22:25:23 1772749523

Welcome to a big company where pretty much everyone has been working full steam for years, in order to take advantage of having a job at a company during a once-in-a-lifetime moment.

m3kw9 · 2026-03-05T21:51:36 1772747496

what? it's their own site and own llm. I could paste most sites and it would work.

judge2020 · 2026-03-05T21:38:53 1772746733

Works for me: https://rr.judge.sh/Labradorretriever/d6af05/chrome_j9rXJMlf...

netdur · 2026-03-05T23:19:23 1772752763

Did it complain about copyright issues?

mempko · 2026-03-06T03:38:16 1772768296

vibe coded. But vibes are off

peab · 2026-03-05T23:32:16 1772753536

LOL - yes Sam, AGI is near indeed. (sarcasm)