Evaluating Multi-Call Chains & Product Update

Company

Context.ai

Date Published

April 30, 2024

Author

Henry Scott-Green

Word count

461

Language

English

Hacker News points

None

URL

blog.context.ai/evaluating-multi-call-chains-product-update-april-2024

Summary

We're launching the ecosystem's best support for evaluating multi-call chains, allowing users to evaluate multi-stage workflows with many calls to LLMs and functions, both end-to-end and across any stage of the chain. This feature is fully LangSmith SDK compatible, making it easy to get started. Additionally, we've added test case tagging, JSON schema validation evaluators, comparison diff view, and more UX improvements, including tracing, custom evaluator creation flow, and visual enhancements.

Evaluating Multi-Call Chains & Product Update | April 2024

Summary