HN Points | HN Title (Links to submission) | Submitted Date |
---|---|---|
23 | Show HN: Automated red teaming for your LLM app | 2024-06-13 |
16 | How to benchmark Llama2 Uncensored vs. GPT-3.5 on your own inputs | 2023-08-10 |
2 | Automated jailbreaking techniques with DALL-E | 2024-07-01 |
2 | Benchmark Command R vs. GPT/Claude on your own data | 2024-04-09 |
1 | Iterate on LLMs Faster | 2024-05-28 |
1 | DBRX vs. Mixtral vs. GPT: create your own benchmark | 2024-03-31 |
1 | How to benchmark Gemini vs. GPT with your own data | 2023-12-15 |
1 | Benchmark Llama 2 vs. GPT on your own data | 2023-07-24 |