64 Hacker News submissions by month with at least  points since the start of

64 submissions with 50 points or greater

HN Points HN Title (Links to original post) Submitted Date
586 Uncensor any LLM with abliteration 2024-06-13
415 Try Stable Diffusion's Img2Img Mode 2022-08-29
323 MonadGPT – What would have happened if ChatGPT was invented in the 17th century? 2023-11-24
252 LLM in a Flash: Efficient LLM Inference with Limited Memory 2023-12-20
240 Microsoft Phi-2 model changes licence to MIT 2024-01-06
238 Falcon 180B 2023-09-06
229 OpenLLaMA 13B Released 2023-06-18
218 T0* – Series of encoder-decoder models trained on a large set of different tasks 2021-10-18
214 Hugging Face Releases Agents 2023-05-10
211 A neural network to auto-complete your thoughts 2019-09-17
200 PaddleOCR: Lightweight, 80 Langauge OCR 2021-09-09
197 Space secrets leak disclosure 2024-06-01
185 BigCode Project Releases StarCoder: A 15B Code LLM 2023-05-04
181 Best 7B LLM on leaderboards made by an amateur following a medium tutorial 2024-01-05
180 AnimeGANv2: Convert Face Portraits into Anime 2021-11-09
179 Stability.ai sent a take down request to Runway ML's SD v1.5 citing IP Leak 2022-10-20
175 We raised $100M for open and collaborative machine learning 2022-05-09
168 Llama 3 8B is almost as good as Wizard 2 8x22B 2024-04-19
168 SantaCoder: A new 1.1B code model for generation and infilling 2022-12-22
167 Nvidia releases NVLM 1.0 72B open weight model 2024-10-02
165 StackLlama: A hands-on guide to train LlaMa with RLHF 2023-04-06
163 Explaining the SDXL Latent Space 2024-02-05
160 BLOOM: The largest open multilingual language model 2022-07-12
152 Hugging Face and Google partner for AI collaboration 2024-01-25
137 Wordalle – Guess the prompt used to generate a set of images from DalleMini 2022-07-01
131 Mistral-8x7B-Chat 2023-12-10
131 A CC-By Open-Source TTS Model with Voice Cloning 2024-11-04
127 FineWeb: Decanting the web for the finest text data at scale 2024-06-02
117 The age of machine learning as code has arrived 2021-10-22
115 Yi-34B-Chat 2023-11-24
107 GPT-3.5 and Wolfram Alpha via LangChain 2023-01-18
105 The Falcon has landed in the Hugging Face ecosystem 2023-06-05
103 HuggingChat: Chat with Open Source Models 2024-02-21
102 Hugging Face and AWS partner to make AI more accessible 2023-02-21
101 HuggingFace Training Cluster as a Service 2023-09-05
95 More than 80 AI models from Qualcomm 2024-02-28
95 Segmind Stable Diffusion – A smaller version of Stable Diffusion XL 2023-10-25
94 LLaMA-Pro-8B 2024-01-06
93 HuggingChat 2023-04-25
88 Yarn-Mistral-7B-128k 2023-11-11
82 Apple/OpenELM: Efficient Open-Source Family Language Models 2024-04-24
78 Sparse LLM Inference on CPU: 75% fewer parameters 2023-10-19
77 Pokemon GAN 2022-02-14
75 YouTube-Commons: Audio transcripts of 2,063,066 YouTube videos, CC-By license 2024-04-18
73 Switch Transformers C – 2048 experts (1.6T params for 3.1 TB) (2022) 2023-11-20
69 Few-Shot Learning in Practice: GPT-Neo & 'HuggingFace' Accelerated Inference API 2021-06-04
66 Multimodal Neurons in Pretrained Text-Only Transformers 2023-08-04
66 Show HN: Simply Reading Analog Gauges – GPT4, CogVLM Can't 2024-01-22
61 Find images from movies based on what you draw 2021-10-13
61 HuggingChat – ChatGPT alternative with open source models 2023-12-15
58 MSFT's WizardLM2 models have been taken down 2024-04-16
58 OpenLLaMA 7B Training Completed to 1T Tokens 2023-06-07
57 Phi-2 2023-12-13
56 Dolphin-2_6-Phi-2 2023-12-24
55 Alibaba releases 72B LLM with 32k context length 2023-11-30
54 LiteLlama-460M-1T has 460M parameters trained with 1T tokens 2024-01-07
54 Large Language Models: A New Moore's Law? 2021-10-27
52 Fine-Tuning LLMs to 1.58bit 2024-09-18
51 LLaMA 3 70B Llamafiles 2024-04-19
425 Llama-3.3-70B-Instruct 2024-12-06
348 A Replacement for BERT 2024-12-19
52 Train faster static embedding models with sentence transformers 2025-01-15
394 Open-R1: an open reproduction of DeepSeek-R1 2025-01-28
227 Kokoro WebGPU: Real-time text-to-speech 100% locally in the browser 2025-02-07