54 Hacker News submissions by month with at least  points since the start of

54 submissions with 1 points or greater

HN Points HN Title (Links to original post) Submitted Date
252 How DALL-E 2 Works 2022-04-19
159 Differentiable Programming – A Simple Introduction 2022-04-12
142 How Imagen Works 2022-06-23
129 LeMUR: LLMs for Audio and Speech 2023-07-27
111 Build Your Own Imagen Text-to-Image Model 2022-08-17
108 The Full Story of Large Language Models and RLHF 2023-05-03
98 Introduction to Diffusion Models for Machine Learning 2022-05-12
95 How RLHF Preference Model Tuning Works (and How Things May Go Wrong) 2023-08-09
66 Using JAX in 2022 2022-02-15
53 Image Generation with Electrostatics 2022-11-30
49 How physics advanced generative AI 2023-04-19
45 What is residual vector quantization? 2023-09-04
16 A Beginner's Guide to TorchStudio, the PyTorch IDE 2022-03-28
11 Universal-1: Robust and accurate multilingual speech-to-text 2024-04-03
11 MediaPipe for Dummies 2022-05-02
8 How to run Stable Diffusion locally 2022-08-24
8 Decoding Strategies – How LLMs Choose the Next Word 2024-08-21
7 Recent Developments in Generative AI for Audio 2023-06-27
7 Emergent Abilities of Large Language Models 2023-03-07
7 Variational Autoencoders for Dummies 2022-01-03
6 What AI Music Generators Can Do and How They Do It 2023-09-24
6 Kaldi Speech Recognition – A Simple Tutorial 2022-01-20
5 AlphaTensor explained – Motivation, method, and assessment 2022-11-22
5 Transcribe audio to text with Cloudflare Workers and AssemblyAI 2023-08-03
5 Conformer-2 AI model for speech recognition 2023-07-20
5 How to evaluate Speech Recognition AI models 2023-06-16
4 How ChatGPT actually works 2023-04-27
4 Automatically determine video sections with AI using Python 2023-11-07
4 Stable Diffusion faster in Keras thanks to XLA 2022-11-30
3 Complete guide to modern generative AI image models 2023-05-10
3 Introduction to Generative AI 2023-05-02
3 AI trends in 2023: Graph Neural Networks 2023-03-29
3 Conformer-1: a robust speech recognition model 2023-03-15
3 Why You Should (or Shouldn't) be Using Google's JAX in 2023 2023-02-11
3 Stable Diffusion 1 vs. 2 – What you need to know 2022-12-06
3 OpenAI Whisper Benchmarks 2022-09-27
3 MinImagen – Build Your Own Imagen Text-to-Image Model 2022-09-08
3 Variational Autoencoders Simply Explained 2022-04-06
2 AssemblyAI (YC S17) raises $50M Series C to build superhuman Speech AI models 2023-12-05
2 RLHF vs. RLAIF for language model alignment 2023-08-22
2 Reinforcement Learning from AI Feedback 2023-08-01
2 Five Things to know about Large Language Models 2023-05-23
2 Introduction to LLMs for Generative AI 2023-05-17
2 Unsupervised Machine Learning for Beginners 2023-02-19
2 How to Build an Audio Intelligence Dashboard 2022-09-21
1 How Microsoft's New Large Vision Model "Florence-2" Works 2024-07-15
1 AssemblyAI announces lower latency, lower cost, more possibilities 2024-01-11
1 AI for Universal Audio Understanding: Qwen-Audio Explained 2023-12-14
1 Retrieval Augmented Generation (Rag) on Audio Data with LangChain 2023-09-26
1 Free Speech-to-Text APIs, AI Models, and Open Source Engines 2022-12-11
1 Getting Started with Hugging Face's Gradio 2022-10-05
1 JavaScript Text-to-Speech – The Easy Way 2022-04-05
3 We Built a Scalable AI Lakehouse at AssemblyAI 2024-11-22
3 Golden Gemini – A new approach in Speech AI 2025-02-04