How the AI works
CauseFlow’s investigation engine is a single orchestrator agent backed by modern large language models. The orchestrator drives the full investigation end-to-end: it reads logs, inspects infrastructure, reviews code changes, queries databases, synthesizes findings, and proposes remediations — all through iterative tool-use against your connected integrations. Internally, heavier reasoning tasks (triage, investigation synthesis, code fix generation, quality verification) run on a higher-capability model; high-volume scanning tasks (log pattern matching, metric sweeps, correlation) run on a faster model tuned for throughput. Model routing is deterministic per task type — no random model selection occurs during an investigation.What agents access
During an investigation, AI agents access only the data sources you have explicitly connected. Access is:- Read-only — agents query your data but never write, delete, or modify anything in your infrastructure
- Least-privilege — each agent receives temporary credentials scoped to the minimum permissions it needs (for example, a log analysis agent receives only CloudWatch log query permissions)
- Credential-scoped — temporary AWS credentials expire after 15 minutes and are revoked at the end of every investigation
| Data source | What is queried | Required integration |
|---|---|---|
| Logs | CloudWatch log groups relevant to the incident | AWS (cross-account role) |
| Metrics | CloudWatch metrics for the affected services | AWS (cross-account role) |
| Infrastructure state | ECS task definitions, EC2 instance health, ALB status | AWS (cross-account role) |
| Code changes | Recent commits, pull requests, file contents for affected repos | GitHub App installation |
| Database health | Query performance, slow queries, active connections | Relay (private network agent) |
| Issue trackers | Linked tickets (Jira, Linear, Shortcut) | OAuth connection via Dashboard |
| APM and error tracking | Error rates, traces (Datadog, Sentry, New Relic) | OAuth connection via Dashboard |
What decisions require a human
Every remediation action requires explicit human approval before it executes. This is the default for all tenants, with no exceptions. When an investigation is complete, CauseFlow proposes a remediation plan. That plan appears in the dashboard as a pending approval. A user with theadmin role must review and approve it. Until approval is granted, no action is taken.
Auto-remediation can be enabled by an admin in Settings > Remediation, which allows specific action types to execute without manual approval. This setting is opt-in and disabled by default.
Additional human decision points:
- Quota confirmation — CauseFlow checks your plan quota before starting an investigation. If quota is exhausted, no investigation starts.
- Known solution review — when a known solution is applied, the incident moves to
awaiting_approvalso a human can verify the match before any action runs. - Abort — any in-progress investigation can be stopped by an admin at any time from the incident detail page.
How we verify quality
CauseFlow applies a verification step after each investigation synthesis to challenge the AI’s own conclusion. This step independently examines the evidence, searches for contradictions, and proposes alternative explanations. The result is stored alongside the root cause report so you can see what was checked. Quality signals used across investigations:- Confidence score — each root cause conclusion includes a 0–1 confidence score. Conclusions below a threshold trigger a warning in the dashboard.
- Feedback loop — after each investigation you can mark the root cause as accurate, inaccurate, or partial. Feedback is incorporated into future pattern matching.
- Pattern confirmation — when positive feedback repeatedly confirms a pattern, that pattern earns a higher weight in future known-solution matching.
Observability
Every investigation produces a per-run trace visible in the CauseFlow dashboard. The trace shows:- Which AI steps ran and in what order
- What data each step queried
- How long each step took
- The evidence each step produced
- The final synthesis and confidence score
admin users from the incident detail page.
CauseFlow does not share your incident data or investigation traces with third parties. Trace data stays within your tenant and is subject to your data residency configuration.
How it works
See the full investigation lifecycle from alert to resolution.
Skills
Customize which AI tools run for your specific incident types.
Security overview
Tenant isolation, encryption, audit trail, and compliance readiness.
Memory and chat
How CauseFlow stores and uses long-term knowledge about your systems.