24 Hacker News submissions by month with at least  points since the start of

24 submissions with 1 points or greater

HN Points HN Title (Links to original post) Submitted Date
48 Navigating the World of Large Language Models 2024-03-22
16 Is LMDeploy the Ultimate Solution? Why It Outshines VLLM, TRT-LLM, TGI, and MLC 2024-06-20
15 Benchmarking LLM Inference Back Ends: VLLM, LMDeploy, MLC-LLM, TensorRT-LLM, TGI 2024-07-05
5 A List of Top Open-Source Embedding Models 2024-10-30
4 Building RAG with Open-Source and Custom AI Models 2024-05-06
4 Solving ML Model Reproducibility: Lessons Learned from a Covid Hackathon 2022-04-25
3 From Ollama to OpenLLM: Running LLMs in the Cloud 2024-07-18
3 Stable Diffusion 3: Text Master, Prone Problems? 2024-06-18
3 A Guide to Open-Source Image Generation Models 2024-03-28
2 Exploring the World of Open-Source Text-to-Speech Models 2024-09-20
2 Serving LlamaIndex as Rest APIs 2024-06-03
2 Deploying Stable Video Diffusion with BentoSVD 2023-11-28
2 Building a Production-Ready LangChain Application with BentoML and OpenLLM 2023-10-22
2 Monitoring Metrics in BentoML with Prometheus and Grafana 2023-10-20
1 Top Open-Source Vision Language Models 2024-10-11
1 Tuning TensorRT-LLM for Optimal Serving 2024-09-20
1 Compound AI Systems 2024-08-24
1 Building a RAG App with BentoCloud and Milvus Lite 2024-06-14
1 Scaling AI Models Like You Mean It 2024-04-26
1 A Guide to ComfyUI Custom Nodes 2025-01-02
1 Secure and Private DeepSeek Deployment 2025-02-14
2 2024 State of AI Inference Infrastructure Survey Results 2025-02-26
2 The Complete Guide to DeepSeek Models: From V3 to R1 and Beyond 2025-03-07
2 Six Infrastructure Pitfalls Slowing Down Your AI Progress 2025-03-19