©RillNews
new
show
ask
jobs
submit
login
Accelerating Gemma 4: faster inference with multi-token prediction drafters
blog.google
204 points by
amrrs
3 hours ago
|
77 comments
add comment