Rillnews logo, news as rill©RillNews
  • new
  • show
  • ask
  • jobs
  • submit
login
DSpark: Speculative decoding accelerates LLM inference [pdf]github.com
559 points by aurenvale 6 hours ago | 217 comments
For contacts: 1 (647) 800-3333