Accelerating Large Language Model Decoding with Speculative Sampling
Paper • 2302.01318 • Published • 4
exploring speculative sampling with autoregressive model like: https://proceedings.mlr.press/v139/song21a.html and https://proceedings.mlr.press/v119/