Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Completely agree in principle, I'd expect this when minimizing entropy over any text incl. code. However, evals across variety of domains show that LLMs can reach (and even surpass) expert performance[^1].

[1]: https://arxiv.org/abs/2508.17669



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: