59 Hacker News submissions by month with at least  points since the start of

59 submissions with 1 points or greater

HN Points HN Title (Links to original post) Submitted Date
11 smolagents: A simple library to build AI agents 2025-01-02
10 Phi-4 weights have been released under MIT license 2025-01-08
3 Timeline of AI model releases in 2024 2025-01-01
2 Vdr-2B-multi-v1 a multilingual embedding model for visual document retrieval 2025-01-10
2 Show HN: We collected detailed annotations for text-to-image generation 2025-01-10
2 Hugging Face Smolagents 2025-01-05
2 Hugging Face advocates for Code Agents: agents that write tool calls as code 2025-01-02
2 ModernBERT: Encoder-only Transformer Model Strictly Improving on past work 2025-01-01
52 Train faster static embedding models with sentence transformers 2025-01-15
6 Kokoro-TTS 2025-01-13
2 Flex.1-Alpha – A new modded Flux model that can properly handle being fine tuned 2025-01-19
1 Show HN: An Agentic AI dataset for deepfake detection 2025-01-15
394 Open-R1: an open reproduction of DeepSeek-R1 2025-01-28
227 Kokoro WebGPU: Real-time text-to-speech 100% locally in the browser 2025-02-07
49 Janus-Pro: Autoregressive framework unifying multimodal understanding&generation 2025-01-27
39 DeepSeek-R1-Distill-Qwen-1.5B Surpasses GPT-4o in certain benchmarks 2025-01-20
38 Fully autonomous AI agents should not be developed 2025-02-07
20 Selene Mini: Open-sourced SOTA small language-model-as-a-judge 2025-01-29
19 The smallest VLM ever: 250M parameters 2025-01-23
17 DeepSeek R1 2025-01-20
12 Open-source DeepResearch – Freeing our search agents 2025-02-04
6 Microsoft Phi 4 with R1 Reasoning 2025-02-04
5 Open R1: Update #2 2025-02-11
5 Deepseek VL2 Small 2025-02-08
4 Qwen 2.5 Max 2025-01-28
4 Hugging Face open sources a web-browsing agent that uses VLMs 2025-01-24
4 Deepseek R1 Zero 2025-01-20
3 Fine-Tune Deepseek-R1 with a Synthetic Reasoning Dataset 2025-02-11
3 Hugging Face AI Agents Course 2025-02-10
3 HuggingFace open reproduction of R1 data and training pipeline 2025-01-27
3 DeepSeek-R1 on iPhone? (DeepSeek-R1-Distill-Qwen-1.5B-GGUF) 2025-01-21
2 OpenAI o3 just scored 99.8% on CodeForces using brute-force 2025-02-12
2 FinePersonas 2025-02-10
2 #9: Does AI Remember? The Role of Memory in Agentic Workflows 2025-02-03
2 Mistral-Small-24B-Base-2501 2025-01-30
2 Generate Images, Chat with PDF in WebGPU via DeepSeek Janus Pro 1B 2025-01-28
2 The state of open video generation models 2025-01-28
2 Bespoke-Stratos-17k: Open Reasoning Dataset by Distilling DeepSeek-R1 2025-01-27
2 DeepSeek-R1 WebGPU 2025-01-22
1 FP8 DeepSeek R1 Distilled LLMs for SGLang and VLLM 2025-01-29
33 The Ultra-Scale Playbook: Training LLMs on GPU Clusters 2025-02-19
17 Vector Search with DuckDB 2025-02-26
9 Show HN: A Transformer model that preserves logical equivalence 2025-03-02
6 DeepSeek-R1 without CCP censorship 2025-02-20
6 More Efficient Chain-of-Thought Reasoning Through Certainty Probing 2025-02-18
6 SigLIP 2: A better multilingual vision language encoder 2025-02-22
4 LLaSE-G1 A FOSS speech enhancement model 2025-03-08
4 Qwen/QwQ-32B released on Hugging Face 2025-03-06
4 Wan2.1-T2V-14B 2025-02-25
4 The Curse of Depth in Large Language Models 2025-02-13
3 GEN3C: 3D-Informed World-Consistent Video 2025-03-06
3 Microsoft Releases Phi-4-multimodal [pdf] 2025-02-26
3 WanX open weight sota 14B video model release 2025-02-25
3 Step-Audio-Chat: a 132B end-to-end speech-to-speech model 2025-02-17
2 FastRTC: The Real-Time Communication Library for Python 2025-02-25
2 Show HN: Roast Any Website with AI 2025-02-25
2 SWE-Lancer: Can LLMs Earn $1M from Real-World Freelance Software Engineering? 2025-02-18
2 Desklib AI Detector Ranks No 1 on Raid Benchmark for AI Detection 2025-02-17
2 Forget What You Know about LLMs Evaluations – LLMs Are Like a Chameleon 2025-02-13