Hacker Newsnew | past | comments | ask | show | jobs | submit | shridharxp's commentslogin

Few months ago, the founder was talking about "AGI" and ridiculous universal basic compute. At this point, I don't even know whom to believe. My first hand experience tells ChatGPT and even ClaudeCode are no where near the expertise they are touted to be. Yet, the marketing by these companies is so immense that you get washed away, you don't know who are agents and who are putting their true opinions.


> My first hand experience tells ChatGPT and even ClaudeCode are no where near the expertise they are touted to be

Not doubting you, but where specifically have the latest models fallen short for you?


ClaudeCode:

- Making functions async without need; it doesn't know the difference between the two or in which scenarios to use them.

- Consistently fails to make changes to the frontend if a project grows above 5000 LOC or a file goes near 1000 LOC.

- The worst part is it lies after making changes.

ChatGPT:

- Fails to implement mid-complex functionality such as scrolling to the bottom when new logs are coming in and not scrolling when the user is checking historical logs.

These models are good at mainstream tasks, the snippets of which you find a lot in repositories. Try to do something off-beat such as algorithmic trading; they fail spectacularly.


I'm unsure how someone could use LLMs regularly and not encounter significant mistakes. I use them a lot less than some devs and still run into basic errors pretty often, to the point that I rarely bother using them for niche or complicated problems even though they are pretty helpful in other cases. Just in the past few days I've had Claude trip all over itself on multiple basic tasks.

One case was asking how to do a straightforward thing with a popular open source JavaScript library, right in the sweet spot of what models should excel at. Claude's whole approach was completely broken because it relied on a hallucinated library parameter that didn't exist and didn't have an equivalent. It invented a keyword that doesn't appear in the entire open source library repo, to control functionality the library doesn't have.


Care to share a chat link?


There is no moat in selling/renting AI models. They are a commoditized product now. I can't imagine with what thought process did investors poured in such money on OpenAI.


Tulip mania is a mania because it short circuits thought.


It's okay to bring some "natural" language in technical communication. It feels more humane. All the whitewashed corporate language, riddled marketing bullshit feels so soul dead.


You can express dissatisfaction and anger „naturally“ without calling people losers and monkeys.


> You can express dissatisfaction and anger „naturally“ without calling people losers and monkeys.

I can't speak for others. But if I am screwing up as badly as GitHub is, I'd rather someone calls me a loser and monkey for it. It's like someone splashing ice cold water on my face and showing me the reality. It's going to be very uncomfortable, yes. But I'll learn from it and try not to screw up so badly again. I find this kind of natural outburst refreshing really.


Imo there is a big difference between insulting a person's work and insulting a person themself. People can and do mess up colossally without being losers or monkeys.


That’s a theoretically admirable attitude if true (I don’t doubt you believe it, and maybe even do it successfully, but often how we react differs from how we think or say we’d react) but definitely not universal. A more common and probable outcome is people clamming up and becoming defensive, actively rejecting the criticism because of how it was delivered.

Though best case scenario, the people working on these features agree and can point their managers to the post as an example of growing discontent. I doubt it’ll have an effect, though. GitHub is now under the AI division at Microsoft.


I think it's a breath of fresh air. Don't want to be called out like this then stop fucking up.


I could try to explain that most jobs are way more nuanced than just 'failing and deserving to be called a monkey' or 'not failing.' Or, I could just call you names for not seeing that, you could call me names back, and we can keep doing this forever.


Your argument is lacking nuance, declaring that the criticism being levied here must be a simple binary.

The specific error they are criticizing is extremely egregious, akin to builder declaring a house without a roof complete. “failing and deserving to be called a monkey” is a criticism being levied against a 0/100 level mistake, not a mere minor mistake as you are claiming.

While it might be desirable to use less colorful language, it is frankly challenging to express the sheer level of grossly incompetent organizational ineptitude on display here in a reviewed and delivered product actively causing negative customer impact for literal years which is trivially fixed and yet has been ignored.

Customers of Github should be infuriated that Github gleefully foists such utterly defective software upon them. It is hard to get that across in dispassionate writing.


Thanks for the thoughtful response.

> Your argument is lacking nuance, declaring that the criticism being levied here must be a simple binary.

That isn't my argument. I am arguing against the idea that there is an "objective" threshold of failure where, once crossed, it becomes acceptable to call people names.

> Customers of Github should be infuriated that Github gleefully foists such utterly defective software upon them. It is hard to get that across in dispassionate writing.

See, while it has its bugs, I don't see a major problem with GitHub as a software product (setting aside the monopoly concerns). I encourage passionate discussion, but calling people names doesn't communicate passion; it communicates impatience. It suggests you don't have the patience to actually make a case for something you're supposedly passionate about, so you're choosing a shorter, more aggressive form instead.


I'm sure getting called a monkey will stop them from ever making a mistake again.


If this was true, teachers and trainers would have the easiest job in the world: just insulting their pupils would stop them from failing an exam, race or whatever again.


The comment is clearly sarcasm.


Not everyone is that robust. People get hurt over things like that. Not everyone is a wizard who does not give a fuck and does not need to care.

These are people for God's sake. Empathy!


Treating grown up people like little kids is a major problem. If that was a stressor which requires defensive actions such as this one, what are you good for in life?


    > These are people for God's sake. Empathy!

One man's empathy is another man's hatred.

From my perspective your take and actions in this thread is itself completely devoid of empathy.

The reason for colorful language breaking through professionalism is because there is real human emotion behind those words. Real pain and suffering, lost time in the life that will never be regained, an ever widening bald spot from the stress. That type of thing yearns to be expressed in a way that generic corpo speak is by design unable to communicate.

Your response to these emotions is to simply stick your head in the sand(aka refuse to read the blog post)? Worse yet, even without that context, you are here trying to convince those around you to also stick their heads in the sand?

To dream up scenarios where theoretical someones in a giant faceless corp might maybe possibly be offended? Instead of trying to listen and understand the person already in front of you who has actually been offended?

Again everything is a matter of perspective, but from mine your comments severely lack the empathy you supposedly call for.


Feeling empathy for their pathetic fragile existence doesn't mean you sympathise with said fragility.


Not being able to control your anger issues and name calling ppl as a public face of your org sounds pretty fragile to me.


I think this is called projection. Not everyone is angry when name calling someone.


Yeah the tone matters.


If people get hurt over that they need to take some courses on building confidence...


Right, those black people who get offended by being called that just need confidence, right? Those LGBT are so sensitive and can’t handle the colorful names we call them! Imagine that. This kind of comment shows how HN commenters can be so incredibly hateful while thinking they are righteous, which is the worst kind of people, it’s the atitude that leads to the most terrible policies and behavior ever seen on this planet.


Or you grow up and see these kinda things within the right context and brush off whats easy to brush off. Obvious racial slurs or discriminating insults towards a whole community is obviously something different. But you sound like the type of person to cancel a comedian over a joke.


People indeed need a thicker skin


Yeah, its rude to actual monkeys - they did nothing wrong!


If he had gone on a rant purely about the product - eg “GitHub actions is a garbage product that never works”, I think that wouldn’t have left such a bad taste in my mouth. Calling the developers all “losers” crosses a line.


Sure. If you feel the need to write "this is shitty code", fair enough, I'm fine with making allowances for that kind of language. But please leave it at that, instead of also insulting the people who wrote it. There are, unfortunately, plenty of ways for bad incentives to result in competent people creating bad products.


that's all well and good but not when it is peddled by you simultaneously in other facets: https://news.ycombinator.com/item?id=46065778


But not when it's against coc


As a corporate drone it's refreshing. Already planning to dedicate some of the holiday season to learning Zig and this latest move only makes it more enticing.


And if he cleaned it up, an even larger number of people would be calling it written with AI.

Shrug.

If he were berating a specific person, I might flag it. Berating Github and Microsoft as an organization? Nah.

Given that CEOs seem to now live in a Post Shame Reality(tm), I'll allow bringing some shame to the situation.


That means you would need to expose your proprietary code and you will be left with no moat.


All of the code seems to be MIT and AGPLv3. Where is this moat?


For any given good solution, there will be a core 20% of prospects with more money than patience and/or capacity who would rather you just do it for them. Their propensity to trust you to do so goes up if they understand how you're going about it.


It's interesting to see hacker news response time reaching almost 2 seconds for this post.


>The longer someone lives, the more potential value they can contribute to a society.

This is questionable. Highly populous countries have worse living conditions than moderately populous ones, currently.


> Highly populous countries have worse living conditions than moderately populous ones, currently.

https://ourworldindata.org/grapher/population-density-vs-pro...

There does not look to be a strong correlation between population density and income, at least on a log-log scale across countries. But I would guess that these numbers hide a trend for cities to be richer than rural areas (subsistence farming etc).


Highly populous countries were colonized and robbed off their resources until recently.


Better living conditions cost more money per family, which leads to less children.


Isn't it Output Power / Input Power which results Q > 1?


Injecting JS into the page or controlling it using extension APIs is not a secure way to control the browser.


I never mentioned injecting JS into the page, and besides injecting LLM generated code or generally remote code won't be approved by the Chrome Store https://developer.chrome.com/docs/extensions/develop/migrate....

Your claim is analogous to saying that Apple's app store is not secure. We had to go through stringent vetting and testing by Google to list in the Chrome Store. Any basis or reasoning you can provide for your claim?

Regardless, its a wild leap to claim a Chrome Store Chrome Extension is more insecure than this arbitrary binary?


Yeah, sorta feels like docker on a new instance is safer than connecting to actual browsers and injecting js code there… would love to skip cdp protocol though, it’s quite restrictive


Are you making a straw man argument? I am not injecting js code, we solved this problem in a secure way with minimal permissions taken by our Chrome Extension, which runs in safe and secure sandbox within the browser.

Perhaps we are talking past each other, your literally giving instructions to your users to connect to their actual browsers: https://docs.browser-use.com/customize/real-browser Where under the hood your launching Chrome with debugging mode but with the user's credentials and passwords. This browser is then controlled via CDP by a highly insecure browser-use binary running in a container. Your users are bound to get pwned with this setup! https://github.com/browser-use/browser-use/blob/70ae758a3bfa...


I think, in the end, it comes down to hope—how hopeful you are that your blogs will turn out to be beneficial one day.


It would be interesting to consider where humans would find fulfillment when what they have done every day for years is rendered worthless.


I like to think about what I did before I had to work. Young me would never worry about finding things to do and none of those things were economically valuable.


Some would pursue less lucrative passions. Some would spiral into lazy Sunday behavior every day.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: