HN Points | HN Title (Links to submission) | Submitted Date |
---|---|---|
14 | Show HN: Auto-generate hard evaluation data for LLMs | 2024-10-02 |
3 | Show HN: Talc (S23) Question and Answer Generation for AI Assistants | 2024-03-18 |
1 | Show HN: Talc – Custom benchmarking for LLM apps | 2023-11-01 |
3 | LLMs are still bad at handling dates | 2023-11-10 |
2 | OpenAI gets a C+ in high school English | 2023-11-17 |
2 | How do Google's code tips fail? | 2023-10-23 |