498 |
How to Finetune GPT-Like Large Language Models on a Custom Dataset |
2023-05-25 |
339 |
LoRA from scratch: implementation for LLM finetuning |
2024-01-22 |
258 |
Takeaways from hundreds of LLM finetuning experiments with LoRA |
2023-10-13 |
118 |
Understanding, using, and finetuning Gemma |
2024-02-24 |
104 |
Finetuning LLMs on a Single GPU Using Gradient Accumulation |
2023-03-30 |
7 |
How YOU Can Help Make AI Accessible to Everyone |
2023-04-27 |
2 |
StatQuest: Word Embedding with PyTorch and Lightning |
2023-12-14 |
2 |
Scaling Large (Language) Models with PyTorch Lightning – Lightning AI |
2023-10-05 |
2 |
NeurIPS 2023 LLM Efficiency Challenge Starter Guide |
2023-08-11 |
2 |
The Battle of Language Models: Lit-LLaMA vs. GPT3.5 vs. Bloom Vs |
2023-05-02 |
1 |
Deploy a custom Llama 3 API in 15 lines of code |
2024-08-24 |
1 |
Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch |
2023-07-03 |
1 |
Accelerating Large Language Models with Mixed-Precision Techniques |
2023-05-11 |
1 |
Parameter-Efficient LLM Finetuning with Low-Rank Adaptation (LoRA) |
2023-04-26 |
19 |
Deploy dedicated DeepSeek 32B on L40 GPUs ($8/hour) |
2025-02-01 |
16 |
Lightning AI hub: Production AI in your cloud in minutes |
2025-02-05 |
5 |
I'm crazy: I deployed DeepSeek in my company VPC in <24 hours without Kubernetes |
2025-02-01 |
1 |
Basics of Graph Neural Networks |
2025-03-05 |