Showing tag results for evaluation

Dec 10, 2025

Agent Onboarding Process for Agentic Systems: Maintain accuracy at scale

Vikesh,

Vidhya

A robust agent onboarding process is the backbone of scalable, reliable multi-agent AI systems—enforcing uniqueness, rigorous evaluation, and responsible AI practices at every step.

Sep 25, 2025

Taming Complexity: Intuitive Evaluation Framework for Agentic Chatbots in Business-Critical Environments

Karol,

Marc

This blog post introduces a comprehensive evaluation framework for enterprise chatbots powered by large language models (LLMs), specifically addressing the challenges of assessing Line of Business (LOB) agents in business-critical environments. The authors tackle the fundamental problem that traditional chatbot evaluation metrics fail to capture th...

Oct 11, 2024

Multimodal RAG with Vision: From Experimentation to Implementation

Pratima,

Maitreyi,

Oscar

This blog post delves into the experimentation journey of fine-tuning a multimodal RAG pipeline to best answer user queries that require both textual and image context. We ran our experiments by systematically testing various approaches, adjusting one configuration setting at a time and using clearly defined evaluation metrics to validate the perfo...

Jan 13, 2021

Evaluation Framework for Information Extraction

Omri Mendels

In this blog post we cover the process, requirements, and the design of an evaluation framework for NLP and Information Extraction. We cover the reasoning behind such a framework, and discuss its implementation with examples from a Named Entity Recognition evaluation point of view.