Content Deep Dive
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Company
Together AI
Date Published
March 12, 2024
Author
Zhuoming Chen, Avner May, Ruslan Svirschevski, Yuhsun Huang, Max Ryabinin, Zhihao Jia, Beidi Chen
Word count
616
Language
English
Hacker News points
None
URL
www.together.ai/blog/sequoia
Summary
No summary generated yet.