Company
Date Published
Author
Vikram Chatterji
Word count
1117
Language
English
Hacker News points
None

Summary

Galileo has launched GenAI Evaluation, a low-latency and cost-efficient method for evaluating generative AI models. This approach aims to reduce the reliance on human-in-the-loop evaluations and costly LLM-based evaluations, providing ultra-low-latency evaluations in milliseconds without compromising accuracy. The 5 breakthroughs of Galileo Luna include outperforming popular evaluation techniques, eliminating the need for ground truth test sets, reducing cost by up to $97% compared to GPT-3.5, achieving 18% higher accuracy than GPT-3.5 in detecting hallucinations, and enabling real-time evaluations with ultra-low latency of milliseconds. The Luna Evaluation Foundation Models are now available to all Galileo customers at no additional cost, powering various evaluation tasks such as hallucination detection, RAG analytics, security and privacy, and more.