237 |
Is AI the next crypto? Insights from HN comments |
2023-11-08 |
234 |
Mistral 7B Fine-Tune Optimized |
2023-12-20 |
217 |
Using reinforcement learning and $4.80 of GPU time to find the best HN post |
2024-10-28 |
13 |
OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost |
2024-06-20 |
3 |
Fine-Tuning Best Practices Series Introduction and Chapter 1: Training Data |
2024-08-29 |
3 |
What we've learned in 3 days of Llama 3 |
2024-04-22 |
2 |
LLM Fine-Tuning Best Practices: Base Models Proprietary/Open Source, Large/Small |
2024-08-28 |
2 |
Fine-Tuning for Production Apps |
2024-09-02 |
1 |
DPO fine-tuning outperforms SFT |
2024-10-02 |
1 |
Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning |
2024-02-29 |
1 |
S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit |
2024-01-18 |
4 |
Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results |
2024-12-30 |