Results for "evaluation"

7 tools
Tool
Category
Pricing
Free?
Updated
LangSmith
Developer platform for debugging and monitoring LangChain-based AI agent applications
Best for: LangChain
AI Agents Freemium Mar 2026
Braintrust
Enterprise AI evaluation platform for logging, testing, and improving agent quality
Best for: evaluation
AI Agents Freemium Mar 2026
Galileo AI
AI quality management platform for automated LLM evaluation and hallucination detection
Best for: evaluation
AI Agents From $1/mo Mar 2026
Arize Phoenix
Open-source AI observability tool with interactive tracing and RAG evaluation
Best for: observability
AI Agents Freemium Mar 2026
Humanloop
Collaborative platform for prompt management, evaluation, and LLM fine-tuning
Best for: prompt-management
AI Agents Freemium Mar 2026
Weights & Biases Weave
W&B toolkit for tracing, evaluating, and improving AI agent applications
Best for: W&B
AI Agents Freemium Mar 2026
Langfuse
Open-source LLM observability platform for tracing, debugging, and monitoring AI agents
Best for: observability
AI Agents Freemium Mar 2026
Compare
Select 2 tools to compare
Compare →