Content Deep Dive
FP8: Efficient model inference with 8-bit floating point numbers
Company
Baseten
Date Published
March 7, 2024
Author
Pankaj Gupta, Philip Kiely
Word count
1021
Language
English
Hacker News points
2
URL
www.baseten.co/blog/fp8-efficient-model-inference-with-8-bit-floating-point-numbers
Summary
No summary generated yet.