← Back to all heuristics

Blame the Process, Not People

Fix the system.

LearningReliability

Heuristic

Focus on system fixes, not individual blame.

Hook

Blame hides root causes.

Problem

When incidents are blamed on individuals, people stop reporting issues and true root causes remain. The system does not improve.

Why it matters

Blameless learning creates psychological safety and drives real system improvements. The same mistakes are less likely to repeat.

Signals you are here

  • People are hesitant to report mistakes
  • Postmortems focus on who, not why
  • Incidents repeat in similar ways
  • Teams avoid experimentation due to fear

Anti-patterns

  • Public shaming or naming during incidents
  • Skipping postmortems after failures
  • Focusing on policy violations instead of system gaps
  • Treating incidents as personal failures

Try this

  • Run blameless postmortems with clear action items
  • Document contributing system factors
  • Invest in fixes that reduce human error
  • Share learnings across teams
  • Track improvements to prevent recurrence

Example

An outage was traced to alert fatigue and unclear ownership. The team fixed alert thresholds and created a rotating review, preventing future misses.

Reflection prompt

Think of your last incident. What system change would prevent it regardless of who was on-call?

More like this

Heuristic

Short Feedback Loops

Fast feedback beats perfect plans.

FlowLearning

Heuristic

Continuous Learning Culture

Learning is uptime.

Learning

Heuristic

Fail Closed, Log Everything, Recover Gracefully

Safe failure beats quiet failure.

ReliabilitySecuritySecurity

Heuristic

Make Infrastructure Disposable

Cattle, not pets.

AutomationReliabilityInfrastructure

Heuristic

State Is Your Enemy, Treat It Carefully

Less state, fewer surprises.

ReliabilityArchitectureInfrastructure

Heuristic

Test Where It Breaks, Not Where It Works

Test the breaks, not the breeze.

AutomationReliabilityDelivery