Detailed Workflow: Incident to Resolution

1. Detection (L5→L2)

Router sends syslog
→ Team Leader receives

2. Triage (L2)

Mistral-7B classifies
→ "routing_protocol"

3. Delegation (L2→L3)

A2A message to
Stability Agent inbox

4. Tool Request (L3→L4)

Agent calls MCP
`ospf_parser` tool

5. Device Query (L4→L5)

MCP SSH to router
Get OSPF data

6. Parse & Return (L4→L3)

TextFSM parses CLI
Returns JSON to agent

7. Analysis (L3)

Qwen-7B analyzes
Proposes fix

8. Response (L3→L2)

A2A response to
Team Leader

9. Approval Queue (L2→L1)

ACP creates request
UI displays card

10. Human Review (L1)

Operator reviews
Clicks [APPROVE]

11. Apply Fix (L4→L5)

MCP config_apply
Execute commands

12. Validate (L5)

OSPF neighbor UP
Incident resolved

Timing Breakdown

Detection:

<1s

Triage (LLM):

4s

Analysis (LLM):

8s

MCP Tools:

3s

AI Analysis:

~15s

Key Features

  • Idempotency via Redis
  • 24hr cache TTL
  • Structured responses
  • Audit trail logging
  • Graceful fallback