AI & Agent Engineering Platform

Build AI that works—faster. One place for development, observability, and evaluation.

Get Started Self-Host OSS

Powering the world’s leading AI teams

1 Trillion

spans per month

50 Million

evals per month

5 Million

downloads per month

One platform.

Close the loop between AI development and production.

Integrate development and production to enable a data-driven iteration cycle—real production data powers better development, and production observability aligns with trusted evaluations.

Arize AX: Observability built for enterprise.

AX gives your organization the power to manage and improve AI offerings at scale.

Explore Arize AI Observability for:

Development tools to build high-quality agents and AI apps

Prompt optimization

Make agents self-improving with automatic optimization using evaluations and annotations

View Docs

Replay in Playground

Replay, debug, and perfect your prompts with a playground designed for development

View Docs

Prompt Serving and Management

Manage prompts, serve optimizations fast, and empower everyone to changes

View Docs

Evaluation that powers reliable, production-ready AI applications and agents

CI/CD Experiments

Detect prompt and agent regressions early with evaluation-driven CI/CD

View Docs

LLM as a Judge

Power eval-driven development by automatically evaluating prompts and agent actions at scale with LLM-as-a-Judge

View Docs

Human Annotation and Queues

Manage labeling queues, production annotations, and golden dataset creation in one place

View Docs

Observability to debug, trace, and improve your AI agents and applications

Open Standard Tracing

Trace agents and frameworks with speed, flexibility, and simplicity — powered by OTEL

View Docs

Online Evals

Catch problems instantly with AI evaluating AI

View Docs

Monitoring and Dashboards

Monitor AI in real time with the world’s most advanced analytical platform

View Docs

Building & Evaluating AI Agents.

Continue your journey into AI Specialization with advanced learning hubs.

Built on open source & open standards.

As AI engineers, we believe in total control and transparency.
Just the tools you need to do your job, interoperable with the rest of your stack.

No black box eval models.

From evaluation libraries to eval models, it’s all open-source for you to access, assess, and apply as you see fit.

See the evals library

No proprietary frameworks.

Built on top of OpenTelemetry, Arize’s LLM observability is agnostic of vendor, framework, and language—granting you flexibility in an evolving generative landscape.

OpenInference conventions

No data lock-in.

Standard data file formats enable unparalleled interoperability and ease of integration with other tools and systems, so you completely control your data.

Arize Phoenix OSS