52 |
Train faster static embedding models with sentence transformers |
2025-01-15 |
394 |
Open-R1: an open reproduction of DeepSeek-R1 |
2025-01-28 |
227 |
Kokoro WebGPU: Real-time text-to-speech 100% locally in the browser |
2025-02-07 |
49 |
Janus-Pro: Autoregressive framework unifying multimodal understanding&generation |
2025-01-27 |
39 |
DeepSeek-R1-Distill-Qwen-1.5B Surpasses GPT-4o in certain benchmarks |
2025-01-20 |
38 |
Fully autonomous AI agents should not be developed |
2025-02-07 |
33 |
The Ultra-Scale Playbook: Training LLMs on GPU Clusters |
2025-02-19 |