N
Hacker Next
new
show
ask
jobs
submit
login
Accelerating Gemma 4: faster inference with multi-token prediction drafters
blog.google
687 points by
amrrs
24 days ago
|
330 comments
add comment