4 |
How are people training this LLMs? Dont they need lot of money? |
2024-01-19 |
53 |
Fireworks: Function Calling Model and API |
2023-12-21 |
3 |
Serving Open Source Models 4x faster than vLLM by quantizing with ~no tradeoffs |
2024-01-10 |
3 |
FireAttention – Serving Mixtral and open-source MoE models at 4x speed vs. vLLM |
2024-01-09 |
3 |
Multi-Query Attention Is All You Need |
2023-07-13 |
2 |
Accelerating Code Completion with Fireworks Fast LLM Inference |
2023-10-11 |
1 |
Fireworks.ai: Language Model Serving with Custom LoRA Fine-Tuned Models |
2023-08-18 |