PoCAT Documentation Get Started

Common Issues & Solutions

Symptom-based incident matrix and response flow.

Last updated: 2026-05-27 Section: Troubleshooting

Standardizing symptom–cause–action patterns for recurring incidents greatly reduces MTTR.

SymptomLikely causeImmediate actionPermanent fix
404 / route miss spikePriority conflictRollback recent rulesNormalize priorities
401 / 403 spikeToken expiry, NTP driftFix auth and time syncMonitor renewal
Timeouts risingDownstream latencyCB and queue bufferingCapacity and pool tuning
Retry stormProlonged outageLower retry limitsRedesign backoff

Incident response flow

  1. Record blast radius, start time, and related deploy within 5 minutes.
  2. Check error rate, P95, and route miss on dashboards.
  3. Apply temporary measures (rollback, CB, traffic block).
  4. Reconstruct the request chain via Correlation ID to confirm root cause.
  5. File RCA notes and prevention tickets.