durron's comments

durron · 2025-10-20T19:57:21 1760990241

Do you find this to still be true with the Sonnet 4.5 model?

extr · 2025-10-20T20:21:00 1760991660

IMO Sonnet 4.5 is great but it just isn’t as comprehensive of a thinker. I love Anthropic and primarily use CC day to day but for any tricky problems or “high stakes, this must not have bugs” issues, I turn to Codex. I do find if you let Codex run on it its own too long it will produce comparably sloppy or lacking-in-vision type issues that people criticize Sonnet for, however.

PantaloonFlames · 2025-10-20T20:53:32 1760993612

That’s a curious approach. Why would you use both? Why not just use the more reliable dependable option for all purposes?

extr · 2025-10-20T21:18:11 1760995091

Sonnet 4.5/CC is faster, more direct, and is generally better at following my intent rather than the letter of my prompt. A large chunk of my tasks are not "solve this concurrency bug" or "write this entire feature" but rather "CLI ops", merging commits, running a linter, deploying a service, etc. I almost use it like it was my shell.

Also while not quite as smart, it's a better pair programmer. If I'm feeling out a new feature and am not sure how exactly it should work yet, I prefer to work with Sonnet 4.5 on it. It typically gives me more practical and realistic suggestions for my codebase. I've noticed that GPT-5 can jump right into very sophisticated solutions that, while correct, are probably not appropriate.

Sonnet 4.5: "Why don't we just poll at an interval with exponential backoff?"

GPT-5: "The correct solution is to include the data in the event stream...let us begin by refactoring the event system to support this..."

That said, if I do want to refactor the event system, I definitely want to use Codex for that.

deaux · 2025-10-21T14:43:46 1761057826

Strangely enough this is one of the first times here I see someone with the exact same experience. GPT-5 is very prone to a style that would for most codebases be overengineering. I think as a large part of HN works on huge enterprise FAANG-like code, this is where it shines, so here it gets rave reviews of just being the best overall. But globally, for most developers, it's overengineering and adds a lot of unnecessary code to maintain. Sonnet in that sense remains "every man's coder". I've gone back from 4.5 to 4 now, having spent a good chunk of time with 4.5 it just seems like a slight overall regression with no real upsides besides being a little faster than 4.

extr · 2025-10-21T18:28:34 1761071314

Glad I'm not crazy, the tide right now of codex > sonnet is overwhelming. Frankly I think what most people go by is "does the code work" - codex is admittedly relentless. It's very good at producing code that works. But "does it work" is not the end-all-be-all in most cases...

macNchz · 2025-10-20T23:13:26 1761002006

I frequently have multiple coding assistants going at once—Gemini 2.5 Pro via Aider as the workhorse for most standard changes, Sonnet 4.5 via Claude Code for question answering, documentation, test case development, or broad based changes to many files in a project, then GPT-5 for more complex diagnostic or architectural type things—I don’t generally like the code it writes, but it will often be able to fix situations where the other models get stuck in some kind of local maxima.

NiloCK · 2025-10-21T11:43:24 1761047004

Even inside the claude-code ecosystem, more than ever there are tradeoffs on raw speed vs intelligence vs cost.

Moving a bunch of verbose templated HTML around while watching results on a devserver? Haiku all day. It's a bonus that it's cheaper, but the real treat is its speed.

Adding a feature whose planning will involve intake of several files? Sonnet.

Working specifically on 'copy' or taste issues? Still I tend to prefer Opus here.

Individual experiences may vary!

wrs · 2025-10-20T21:11:16 1760994676

In my experience, there isn’t a model that is more dependable for all purposes. They each have some unique strengths.

theshrike79 · 2025-10-20T20:24:30 1760991870

I'm like 80% sure Sonnet 4.5 is just rebranded Opus.

Sonnet 4 was a coding companion, I could see what it was doing and it did what I asked.

Sonnet 4.5 is like Opus, it generates massive amounts of "helper scripts" and "bootstrap scripts" and all kinds of useless markdown documentation files even for the tinies PoC scripts.

deaux · 2025-10-21T14:45:04 1761057904

It's very much not, so I'm more than happy to take that bet - how much are we wagering? Have you ever used each for non-coding tasks?

The generation of helper, markdown and bootstrap scripts are very dependent on your harness.

theshrike79 · 2025-10-21T19:18:05 1761074285

I paid for "Claude Code", I'm not asking it for stuff about the Mesopotamian empire :)

esafak · 2025-10-20T20:08:25 1760990905

I don't. Sonnet is faster too.

mmaunder · 2025-10-20T20:36:40 1760992600

Yes. Sadly. And it really does make me sad. I was rooting for Anthropic. Still kinda am.

bgirard · 2025-10-20T20:41:41 1760992901

I have a very similar experience. I was heavily invested in Anthropic/Claude Code, and even after Sonnet 4.5, I'm finding that Codex is performing much better for my game development project.

mmaunder · 2025-10-20T21:54:18 1760997258

It seems particularly good at high performance programming in low level languages.

durron · on July 30, 2023

“Parent-led homeschooling is the approach that puts parents in charge of the education decisions for their children.”

Teachers in the US are required to have a Master’s level education and multiple months of student teaching in order to be certified to teach. Yes, there are many kids in the classroom. Yes, your kid isn’t going to get the same type of 1 on 1 attention you can give them at home. However, work in conjunction with the experts, don’t presume you know better by default.

gnicholas · on July 30, 2023

> Teachers in the US are required to have a Master’s level education and multiple months of student teaching in order to be certified to teach.

There is no federal teacher licensing standard, and state standards vary (and only apply to public school teachers, IIRC). In several states, the only educational requirement is a bachelor's degree. [1] This educational requirement might be coupled with required teaching experience, but this experience can be gained by teaching in a private school, where such requirements do not apply. And none of these experiential requirements establish a baseline for excellence — they just measure based on "time served".

It is simply false to assume that all teachers are "experts" to whom parents (who may have many more years of education, not that formal education is the touchstone), should defer.

1: https://www.axios.com/2017/12/15/8-states-that-made-it-easie...

durron · on May 26, 2023

The article (appropriately) glosses over a fun butterfly effect. The large export of lemons to Britain is a core reason for the existence of the Italian mafia.

https://www.newscientist.com/article/mg23831830-600-why-the-...

vanderZwan · on May 26, 2023

Geez, we're never going to run out of things we can blame on the British Empire, are we?

(this is a joke. I'm joking. I think)

Seriously though, I wish that wasn't paywalled. Sounds like a really interesting bit of story.

durron · on April 15, 2023

980 down/880 up $99/month. Based out of Massachusetts, it's a good deal for this area

durron · on Sept 16, 2020

At least in the US, this isn't true, tax rates have fallen over the last 40 years

https://taxfoundation.org/us-federal-individual-income-tax-r...

qwerty1234599 · on Sept 16, 2020

Talking about Europe here

bleppoblepis · on Sept 16, 2020

What makes Europe unique such that the proposition that lowered taxes would lead to greater Millenial success holds true, when the same low-tax policies applied in America seem to have yielded similar results to the European approach? If this seems like a leading question, it's not intended to be--I'm American and curious about the potential variables at play here that I'm not aware of.

avereveard · on Sept 16, 2020

the European market is a disaster for investment and entrepreneurship, that's what's holding millennial back. your average SMB reach is often regional, and the bureaucracy structure makes almost impossible for garage operations not only to go above national, but to even exist.

like in sports, the size of the talent pool matters, and with an unbearable cost for entry entrepreneurship almost s losing proposition unless if under the heels of some financing partner whims, as it's the only realistic option to sustain the bottom line costs from handling vatmoss, letters of taxations and the other billion historical bureaucratic commitments

tubularhells · on Sept 17, 2020

The bureaucracy part is not true in the Netherlands. Making a business and administering is easy here. In Germany, it's a nightmare.

durron · on Oct 13, 2019

Yep, close to half my company was laid off at the beginning of the summer. It was hard, it had been clear the company didn’t have the revenue to keep running but we were constantly assured the current round of funding would close. Then it didn’t and a lot of us were gone

durron · on Sept 11, 2019

Salaries at FAANGs are hard. I received an offer (New England based) for an L6 position from one this week and was shocked at the package. After getting off the phone and doing more research, I realized there's a chance I'm being lowballed, and I should negotiate for more if I choose to go to said company. I've done pretty well in my career so far, but even taking the initial offer would almost double my salary.

sys_64738 · on Sept 11, 2019

But Boston is ultra expensive. Like a mini NYC.

durron · on Aug 22, 2019

There's a nice writeup about how the WoW team went about re-creating a 10-year-old version of their game with modern security and toolchains

https://worldofwarcraft.com/en-us/news/21881587/dev-watercoo...

durron · on Aug 15, 2019

https://outline.com/AW7mde

Non paywall