Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
from
login
From Software Engineer to AI Environment Architect
(
infini-ai-lab.github.io
)
1 point
by
lovecoding_
55 days ago
|
past
SEQUOIA: Exact Llama2-70B on an RTX4090 with half-second per-token latency
(
infini-ai-lab.github.io
)
131 points
by
zinccat
on May 5, 2024
|
past
|
61 comments
Sequoia: Speculative decoding boosting LLM inference by 8-10x
(
infini-ai-lab.github.io
)
3 points
by
fgfm
on March 14, 2024
|
past
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: