Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
What's this blog post about?
Company
Together AI
Date published
March 12, 2024
Author(s)
Zhuoming Chen, Avner May, Ruslan Svirschevski, Yuhsun Huang, Max Ryabinin, Zhihao Jia, Beidi Chen
Word count
616
Language
English
Hacker News points
None found.