DeepEval logo

DeepEval

Verified Open Source

Open-source LLM evaluation framework for CI/CD pipelines

DeepEval is an open-source evaluation framework for testing and monitoring LLM applications. It provides 14+ evaluation metrics, integrates with pytest, and enables continuous LLM quality testing in CI/CD workflows.

1 views 15.9k stars 1.5k forks 1 post Share LinkedIn

Product Overview

Use Cases

  • LLM Unit Testing
  • CI/CD for AI
  • RAG Quality Testing
  • Hallucination Detection

Ideal For

AI EngineersQA TeamsMLOps Engineers

Architecture Fit

Enterprise ReadySelf HostedCloud NativeAPI FirstMulti-Agent CompatibleKubernetes SupportOpen Source

Technical Details

Deployment Model
self-hosted

Add Reference or Discussion Note

You can leave a discussion note on this product page. The product owner adds new reference links.

Loading sign-in state…

Community Feedback

Loading…

Login to leave feedback on this product.

More tools in Evaluation

View all →