Early access · 2026

The operational layer for AI agents.

One dashboard for AI cost and behavior across every coding tool your team uses and every AI feature in your product. Real-time anomaly alerts. Install in under an hour.

agentwatch.io / projects

Session-volume anomaly · meeting-notes

5 Claude Code agents, 30s dispatch loop, running 10d unattended · 6,352 cycles · projected $400+

Critical
Sessions (7d)
34,980+312%
Cost (7d)
$558+289%
Active agents
11+5
Projects · last 7 days
Sorted by sessions
meeting-notes
31,762
$412.08
orcha-web
1,248
$58.41
platform-api
942
$44.17
ml-eval
614
$28.92
docs-site
318
$11.04
sandbox
96
$3.21

Most AI-native companies have no idea what their AI agents are actually doing, or costing.

Claude Code and Cursor run on every engineer’s laptop. AI features run inside your product. No single dashboard sees both. You find out about runaway pipelines and spend spikes from the next invoice, days or weeks too late.

Questions every VP of Engineering, Head of Platform, and VP of Product needs to answer this week, and can’t:

What does our AI cost per customer?
Which engineer's session burned through Opus credits last week?
Did any pipeline run unattended for the last 10 days?

Two surfaces. One dashboard.

Your AI runs in two places. AgentWatch covers both, with the same data model and the same view.

Your Local

Every coding agent on every developer machine.

  • Cross-tool view: Cursor, Claude Code, Codex, Cline, Windsurf
  • Per-developer, per-project, per-tool cost attribution
  • Real-time alerts on runaway loops and abandoned sessions
  • Zero instrumentation. Install in under an hour.

Your Apps

Every AI feature in your product.

  • Per-feature, per-user, per-customer cost attribution
  • Behavioral anomaly detection across production agents
  • One-line SDK in Python and JavaScript
  • Same dashboard as your internal AI usage

One dashboard. Both surfaces. Same data model.

AgentWatch is not LLM observability. We don’t trace every model call or evaluate output quality. We answer cost, attribution, and behavioral anomaly questions across both your internal AI tools and your product’s AI. Most teams need both kinds of tools.

The shift, in numbers

AI tooling went from a line item to a budget category in 18 months. Most teams have no instrumentation for it.

70%

of AI-native engineering teams now run 3+ AI models in production.

23% run 6+. Most have no per-model, per-team, or per-feature cost attribution today.

Source: Datadog, State of AI Engineering 2026

2 out of 3

of platform engineering and SRE leaders will own production AI systems by 2028.

Most aren't equipped to manage the cost and behavior side yet.

Source: Gartner, Innovation Insight: LLM Observability, July 2025

Three steps from install to dashboard.

Run one command on a developer machine. Copy your API key. Open the dashboard.

  1. 1

    Install the collector

    One command. Reads session files from Claude Code, Cursor, Codex, Cline, Windsurf. No code changes.

  2. 2

    Optional: instrument product AI

    One-line SDK per AI call. Python and JavaScript. Same data model as the collector.

  3. 3

    Open the dashboard

    Full AI cost and behavior, internal and product, in one view. Set alerts. Export to Slack.

~/your-project
$npx @agentwatch/cli init
Detected: Claude Code, Cursor, Codex
Reading session files (1,247 found)
Collector running in background
App created
Copy your API key from the panel on the right.

App Created

API key shown once. Copy it now.

aw_live_dd0276952bb3c71b7225ab30a2a4457495113438e6d123741cf12cc0ba42a4e1

Set as AGENTWATCH_API_KEY in your environment.

From install to dashboard in 30 minutes.

No infrastructure. No procurement. No instrumentation work. Run a command on a developer machine, open the dashboard, see your team’s AI usage.

A real user story

Caught a 10-day runaway pipeline in minutes. Not weeks.

Five Claude Code agents on a 30-second loop. Spawned by a meeting-notes pipeline our team built. Ran for 10 days unattended. Generated 31,762 session files across 6,352 dispatch cycles.

AgentWatch fired a Slack alert the moment the anomaly crossed threshold. From the alert, we drilled into the session timeline, identified the runaway agent structure, and set a budget cap to prevent recurrence. Detection time: minutes. Without the collector, the bill would have shown up two weeks later as a $400 mystery.

AgentWatch HQ
eng-alerts
AW
AgentWatchApp12:47 PM

Session-volume anomaly detected

Project meeting-notes is running 5 Claude Code agents on a 30-second loop. Unattended for 10 days. 6,352 dispatch cycles. Projected cost $412.

Agents

5

Sessions

31,762

Projected

$412

🚨3👀2🙏12 replies

The alert that fired in our team’s #eng-alerts channel.

The detection layer

Named anomalies. Named fixes.

AgentWatch watches for a small set of behaviors that go wrong with agents, and gives you a clear move for each one.

Session-volume anomaly

A project's session count diverges from its 7-day baseline. The pattern that catches runaway pipelines.

Loop detection

Same prompt or tool-call signature repeats beyond a threshold within a session. Catches agents stuck on a sub-task.

Spend velocity

Cost-per-hour ramping faster than the project's baseline. Catches expensive-model misuse before the invoice.

Abandoned session

Long-running session with no human input for hours. Catches the 'left it running over the weekend' case.

Course correction, end to end.

What we did when the meeting-notes anomaly fired.

1

Detect

Slack alert fires in #eng-alerts: session-volume anomaly · meeting-notes.

2

Diagnose

Open the session timeline from the alert. Five agents on a 30-second loop.

3

Contain

Pause the project and set a $50/day budget cap from the alert itself.

4

Prevent

Promote the diagnosis to a rule: any project crossing 100 sessions/hour auto-pauses.

Get on the early access list.

We’re working with a small number of design partners in 2026. If your team is running multiple AI tools and feels the cost and visibility gap, we’d like to talk.

We reply within 48 hours. No newsletter, no marketing automation. Just a real reply from one of us.