I've had the same suspicion for various providers - if I had time and motivation I would put together a private benchmark that runs hourly and chart performance over time. If anyone wants to do that I'll upvote your Show HN :)
Sigh. Lines of code is not execution man. Having functional apps is not execution. A full stack, AWS-deployed, multi-stage scaleable microservices wonder is not execution!
LLMs make it a lot easier to build MVPs, but the hard work of VALIDATING problems and their solutions, which IMO was always >80% of the work for a successful founder, is harder than ever. With AI we now get 100 almost-useful solutions for every real problem.
That website tries too hard to write clever marketing copy and does a bad job describing what actually is.
Better description: Pinokio is a free, open-source "AI browser" that simplifies installing, running, and managing complex, open-source AI applications and creative tools (like Stable Diffusion, ComfyUI) with one-click scripts, removing the need for coding or complex command-line setup.
I think in this case browser is meant as a place to browse, e.g. the Google Play store is an app browser. I don't hear it used that way often anymore, but it at least sounds familiar.
And the example given was specific to OpenAI models, yet the title is a blanket statement.
I agree with the author that GPT-5 models are much more fixated on solving exactly the problem given and not as good at taking a step back and thinking about the big picture. The author also needs to take a step back and realize other providers still do this just fine.
Ah you're right, scrolled past that - the most salient contrast in the chart is still just GPT-5 vs GPT-4, and it feels easy to contrive such results by pinning one model's response as "ideal" and making that a benchmark for everything else.
There are more knobs to turn when you have an actual library, and you become a lot less fungible than a random collection of TW classes.
HeroUI faces the same problem, and now their React Native library includes an optional (paid) conpiler solution that makes it faster.
MUI has the same problem but besides templates they have their MUI X data components which aren't limited in complexity to what can be ergonomically copy and pasted to a clipboard.
I expect a lot of business disruption because of AI. Agree it's not the same as employee replacement, but it adds to the sort of fog of war around what effect AI is really having.
reply