Hacker Newsnew | past | comments | ask | show | jobs | submit | jdmoreira's commentslogin

claude code is really good at it from my experience

that's because they are distilling the frontier models

Frontier models aren't released, they are closed source.

And the Chinese have been a huge source of innovation in the field.


You can do a rough distill through the APIs. You don't need the weights.

It was much easier when companies had models on the /completion style APIs, because you could actually get the logits for each generation step, and use that as a dataset to fit your model to.

That isn't to diminish the efforts of the Chinese developers though, they are great.


distilling does not require the models to be released. They simply use the apis.

They have been a source of innovation but probably not in training them.


They just lack of performant hardware. They have enough knowledge. And so they choose a more effective strategy without wasting resources on training from scratch.

is there any evidence that this is significant contribution?

My intuition that one need ALOT api credits to distill such large models.


Peak civilization

You would expect them to have started baking the pizza inside the pentagon already by now :)

Or they order Chinese food to throw it off

That's not how pressure works if it's opened. The forces balance out


Hard to tell... It's not double or more net gain, that I can tell.

I'm not even entirely sure it's a net positive at this point but it feels like it.

I would say probably between 20% to 50% more productive.


I am giving my 6 year old girl an old acer netbook that boots directly to pico-8. This will be her first computing experience. She never had access to phones or tablets.


You’re awesome.


I must be holding wrong then because I do use Claude Code all the time and I do think its quite impressive… still I cant see where the productivity gains go nor am I even sure they exist (they might, I just cant tell for sure!)


if you back and forth with the model, and discuss/approve every change it does, that's the problem.

you need to give it a bigish thing so it can work 15 min on it. and in those 15 min you prepare the next one(s)


Sure. But am I supposed to still understand that code at some point? Am I supposed to ask other team members to review and approve that code as if I had written it?

I'm still trying to ship quality work by the same standards I had 3 or 5 years ago.


when compiler appeared assembly programmers would complain all day how ugly and inneficient the generated code was

if you want to get the productivity gain you need to figure out how to solve the code review problem


Your solution is, "just ship worse code, it's probably fine"?

I think it's your standards that have fallen 90%…


No not, worse code. Wrong code. Code filled with bugs. Code filled with lawsuits too. Code that make you look productive this month while you prepare to leave the company, and turn out to be absolute pooopoo the day after you leave.


I think there might be something here! a core of truth about what the future might hold. I cant take this approach right now though. Its not a good approach today.


But did the generated code do what you told it to do and was it deterministic? LLMs aren’t the same at all. That metaphor doesn’t work.


Very good (albeit extremely cynic) take!

reality is probably somewhere in the middle but it does make a good case


I’ve always had this weird intuition that Zeno’s Arrow Paradox is some indication that there must be some discreteness. Somehow, somewhere, there must be a ‘tick.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: