Showing tag results for evaluation

Sep 25, 2025
Post likes count4

Taming Complexity: Intuitive Evaluation Framework for Agentic Chatbots in Business-Critical Environments

Karol Żak Marc Gomez
Karol,
Marc

This blog post introduces a comprehensive evaluation framework for enterprise chatbots powered by large language models (LLMs), specifically addressing the challenges of assessing Line of Business (LOB) agents in business-critical environments. The authors tackle the fundamental problem that traditional chatbot evaluation metrics fail to capture th...

CSEMachine LearningFrameworks
Oct 11, 2024
Post likes count8

Multimodal RAG with Vision: From Experimentation to Implementation

Pratima Sayyaparaju Maitreyi Nair Oscar Fimbres
Pratima,
Maitreyi,
Oscar

This blog post delves into the experimentation journey of fine-tuning a multimodal RAG pipeline to best answer user queries that require both textual and image context. We ran our experiments by systematically testing various approaches, adjusting one configuration setting at a time and using clearly defined evaluation metrics to validate the perfo...

CSEISE
Jan 13, 2021
Post likes count0

Evaluation Framework for Information Extraction

Omri Mendels
Omri Mendels

In this blog post we cover the process, requirements, and the design of an evaluation framework for NLP and Information Extraction. We cover the reasoning behind such a framework, and discuss its implementation with examples from a Named Entity Recognition evaluation point of view.

CSEMachine Learning