Am I missing it or is there no information about performance? Looking for a toke... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		hamdingers 52 days ago \| parent \| context \| favorite \| on: Asus Ascent GX10 Am I missing it or is there no information about performance? Looking for a tokens/sec

aseipp 52 days ago | [–]

Right now I get 59 tok/sec on GPT-OSS 120B using Unsloth's dynamic 4-bit quants, via llama.cpp https://news.ycombinator.com/item?id=45881049

simlevesque 52 days ago | [–]

He didn't give that info but the transcript linked at the end shows how much time was spent for each query.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact