How to find the root cause of any problem with 5 whys

The 5 whys technique, developed by Taiichi Ohno at Toyota, asks why repeatedly until you reach the true source of a problem. Roughly 87% of issues resurface within 90 days because teams fix symptoms, not the broken system underneath. Most root causes turn out to be embarrassingly simple

Summary

  • Most teams fix symptoms and wonder why problems return - roughly 87% of issues resurface within 90 days because nobody asked why more than once or twice before slapping on a band-aid
  • Root causes are embarrassingly simple - missing documentation, one broken part, a miscommunication between two people. These tiny gaps create enormous downstream losses
  • 1% daily improvement compounds to 37x gains in a year - Toyota’s method of fixing one root cause per day creates exponential performance gains, and companies doing proper root cause analysis spend a fraction on rework compared to those making surface-level patches
  • Speed matters more than formality - running 5 whys at 2:15 PM when a problem hits at 2 PM beats any scheduled meeting next week when memories are fuzzy and people get defensive. Want help eliminating recurring problems?

The 5 whys method answers your question in one sentence: keep asking “why” until you hit the broken system underneath the symptom. That’s it. Not the person who messed up, not the event that triggered the alarm - the missing process, the undocumented step, the gap nobody noticed. Most organizations never get there because they stop asking after the first or second “why” and then wonder why the same fire keeps reigniting every few weeks. Teams tell us the same thing in different words, teams that commit to asking all five whys cut recurring problems dramatically, and the fixes are often laughably simple.

Here’s how the technique works, where it goes wrong, and why it matters more now than ever.

Mistake nearly everyone makes

Thursday, 2:17 PM. The payment processing system crashed. Again.

Your CTO’s texting furiously. Support’s drowning. The CEO wants answers. You restart the server - crisis averted.

Until next Thursday. And the Thursday after that.

That’s basically symptom-chasing. Here’s what it looks like when you actually dig:

A company’s inventory system keeps showing incorrect stock levels. The quick fix? Manual counts every morning. Cost: 2 hours daily, every single day.

Running the 5 whys reveals something different:

  1. Why are inventory counts wrong? Because the barcode scanner sometimes double-scans items
  2. Why does it double-scan? The trigger button gets stuck occasionally
  3. Why does the button stick? Warehouse dust accumulates under it
  4. Why does dust accumulate there? The protective case has a gap where the button sits
  5. Why was this gap not sealed? Nobody documented scanner maintenance requirements when switching vendors

The real problem isn’t the scanner. It isn’t the dust. It’s the missing maintenance documentation. Fix: a 30-minute case cleaning schedule. The potential annual savings compound over time in ways that make the original manual-count workaround look absurd.

Quick test for your own problems. Your website crashed yesterday. What’s your first instinct?

  • “Server overload”
  • “Traffic spike”
  • “Database timeout”

All wrong. Those are symptoms wearing problem costumes. We’ve observed a common root cause pattern in e-commerce: developers setting up automated testing that accidentally runs against production servers. These issues can go unnoticed for months.

Real examples that expose how this works

The coffee machine pattern

Project deadlines keep slipping at an agency. Managers blame lazy developers. The 5 whys reveals something unexpected:

  1. Projects are late because morning standups start 20 minutes late
  2. Standups are late because half the team arrives late
  3. The team arrives late because they wait for coffee
  4. They wait because only one coffee machine works
  5. One works because facilities canceled the service contract to save a small monthly fee

Fix: restore the service contract. On-time delivery improves dramatically.

The silent notification pattern

Complaint volume spikes at a SaaS company. Support team blames a product bug. The 5 whys reveals:

  1. People complain about missing features
  2. Features exist but they can’t find them
  3. The feature announcement emails never arrive
  4. Emails go to spam folders
  5. IT changed email providers without updating authentication records

Not a product problem. Not a support problem. A DNS fix.

Once you’ve found root causes, you need a system to track improvements and prevent the same problems from coming back. Without visibility into your processes, you’re just guessing where things break.

Solution Process
Process Improvement Software

Tallyfy is Process Improvement Made Easy

Save Time
Track & Delegate Processes
Consistency
Explore this solution

How to run a 5 whys session that works

Forget the meeting room. Forget the whiteboard. Here’s what matters.

Catch it fresh

The moment something breaks, open a shared doc. Right then. Not tomorrow. Memory degrades 40% after 24 hours - that’s not a guess, it’s well-documented cognitive science.

Bring the right people

Not their managers. The person who witnessed it. The developer who pushed the code. The person who filed the complaint. Managers add politics; witnesses add facts.

Ask like a toddler

“Why?” isn’t specific enough. Try:

  • “Why specifically?”
  • “Why at that exact moment?”
  • “Why there and not elsewhere?”

Look for the pattern break

The root cause usually lives where something changed:

  • New hire started
  • Process modified
  • Tool switched
  • Deadline moved

Document the fix in the workflow itself

Not in a wiki nobody reads. In the actual workflow. Where people do work. This is where Tallyfy shines - fixes get embedded directly into the process steps people follow every day, not buried in some knowledge base that gathers dust.

The compounding math that changes everything

Toyota doesn’t aim for perfection. They aim for 1% better. Every. Single. Day.

Fix one root cause = 1% improvement. Sounds tiny?

  • Day 1: 1% better
  • Day 30: 26% better
  • Day 365: 3,778% better

That’s 37x improvement in one year. Not a typo.

OK, that’s the theoretical ceiling, though gains still compound in practice. But here’s the catch - you need immediate feedback, not quarterly reviews. A machine operator at Taiichi Ohno’s Toyota can stop the entire production line when they spot a problem. They ask why immediately. Not in next week’s meeting.

And here’s something I think most people miss: If you’re automating a workflow riddled with undiagnosed root causes, you’re just making a bad process fail faster and at greater scale. The 5 whys discipline has to come before any automation initiative, not after.

Pitfalls that trip up even experienced teams

The blame trap

If your fifth “why” is a person’s name, you’re doing it wrong. “Because Sarah messed up” isn’t a root cause. It’s scapegoating.

Dig deeper. Why could Sarah mess up? What system failed her? What guardrail was missing?

The multiple roots problem

Real problems rarely have one cause. That server crash? Could be:

  • Branch A: code deployment issue
  • Branch B: infrastructure scaling problem
  • Branch C: communication breakdown

Run parallel 5 whys. Fix all branches.

Parallel 5 whys branches tracing a server crash to three separate root causes including missing rollback and load testing

The prediction power

Here’s where it gets interesting - once you’ve done 50+ five whys analyses, you start seeing problems before they happen. One misconception we see constantly is that 5 whys is purely reactive - but at this point it becomes predictive. You walk into a new situation and think “I bet the root cause is X” - and you’re right 70% of the time.

One government contractor discovered that “critical information scattered across Outlook folders and Excel spreadsheets” was the root cause of their onboarding taking 5-7 business days per new hire. After applying 5 whys and fixing the documentation issue, they cut onboarding to 2-3 days - a reduction of over 50%. The same HR person handled 10-20 simultaneous new hires.

The hidden cost of surface-level fixes

MIT studied 500 companies using problem-solving techniques. The findings?

Companies that stop at surface fixes:

  • Spend 31% of revenue on rework
  • Face the same problems 5.4 times per year
  • Lose 2 key employees annually to frustration

Companies using proper root cause analysis:

  • Spend 4% on rework
  • Face problems 0.8 times per year
  • Have 91% employee retention

The difference? They ask why until it hurts.

When 5 whys goes wrong

Failure mode 1: the infinite loop

“Why did sales drop?” “Because leads are weak” “Why are leads weak?” “Because sales aren’t closing them”

You’re going in circles. Solution: change the question. Instead of “why,” ask “what changed?”

Failure mode 2: analysis paralysis

Some teams ask why 15 times. They create clunky spreadsheets. They form committees.

Stop. If you’re past 7 whys, you’re overthinking.

The root cause of overthinking? Fear of being wrong. Just pick the most likely cause and test it.

Failure mode 3: the correlation trap

“Website traffic dropped because we changed the logo.”

Maybe. Or maybe Google updated their algorithm that day.

Always test your hypothesis. Change it back. Did the problem resolve?

Are you hearing this at work? That's busywork

"How do I do this?" "What's the status?" "I forgot" "What's next?" "See my reminder?"
people

Enter between 1 and 150,000

hours

Enter between 0.5 and 40

$

Enter between $10 and $1,000

$

Based on $30/hr x 4 hrs/wk

Your loss and waste is:

$12,800

every week

What you are losing

Cash burned on busywork

$8,000

per week in wasted wages

What you could have gained

160 extra hours could create:

$4,800

per week in real and compounding value

Sell, upsell and cross-sell
Compound efficiencies
Invest in R&D and grow moat

Total cumulative impact over time (real cost + missed opportunities)

1yr
$665,600
2yr
$1,331,200
3yr
$1,996,800
4yr
$2,662,400
5yr
$3,328,000
$0
$1m
$2m
$3m

You are bleeding cash, annoying every employee and killing dreams.

It's a no brainer - improve your workflows

How other industries use 5 whys

Healthcare:

A hospital reduced prescription errors 94% using 5 whys. Root cause? Doctors’ handwriting? No. The pharmacy computer defaulted to the wrong dosage unit. Nobody noticed for 3 years. Which is bonkers, when you think about it.

Aviation:

Airlines have traced chronic delays to unexpected root causes like crew meal timing. Morning flights run late because crew can’t get breakfast before early shift starts. Hungry crew move slower.

Education:

A university improved graduation rates 22% with 5 whys. The root cause of dropouts wasn’t grades or money. It was parking. Students couldn’t find spots, missed classes, fell behind, dropped out. Solution: reserved lots for struggling students.

Real-world templates you can steal

For complaints:

  1. What exactly was the experience?
  2. Why did our system allow that?
  3. Why didn’t we catch it earlier?
  4. Why wasn’t there a failsafe?
  5. Why wasn’t this scenario considered?

For process breakdowns:

  1. Where specifically did it break?
  2. Why was that step vulnerable?
  3. Why didn’t anyone notice?
  4. Why was there no backup?
  5. Why wasn’t this documented?

For team performance issues:

  1. What measurable outcome dropped?
  2. Why did behavior change?
  3. Why did the environment allow it?
  4. Why didn’t systems prevent it?
  5. Why wasn’t this flagged earlier?

Put root cause analysis into practice

Example Procedure
Issue Tracking
1Determine channel of reporting
2Check for duplicate/similar bugs
3Send helpful notification to client
4Create a new ticket
5Prioritize and assign
+8 more steps
View template
Example Procedure
Customer Complaint Resolution Workflow
1Acknowledge the Complaint
2Categorize and Prioritize
3Investigate the Root Cause
4Propose Resolution to Customer
5Implement the Resolution
+2 more steps
View template

The counterintuitive truth about root causes

Everyone thinks root causes are deep, technical, buried under layers of complexity. Nope. Turns out, 80% of root causes are embarrassingly simple: nobody wrote it down, two people assumed different things, a $20 part wasn’t replaced, someone forgot to click “save,” an email went to spam. The tragedy? These simple causes create million-dollar problems.

We saw this firsthand with a media production team that was struggling during rapid growth. The root cause? They needed to transition from project management to process management. Once they standardized their 60-task production workflow using Tallyfy, they saved $57,480 per year, tripled revenue, and published 128 episodes in 2.5 months. Something I’ve noticed across industries - the fix is almost always cheaper than the workaround people tolerate for months.

A cruise line lost $2.1M because nobody updated the crew wifi password in a shared document. Crew couldn’t report maintenance issues. Ship needed emergency repairs.

Beyond reactive: preemptive 5 whys

Smart organizations don’t wait for problems. They ask “why” before things break:

  • Why might this new process fail?
  • Why could people hate this feature?
  • Why would employees resist this change?

It’s called “pre-mortem 5 whys.” Bezos’s Amazon uses it before launching products. They prevent most potential issues before launch. Does it guarantee zero failures? No, but it tilts the odds.

But here’s the real transformation - what if you could see problems forming in real-time? Not after they explode. Not during a crisis. But that subtle moment when a process starts degrading?

We’ve observed that teams tracking their workflows digitally catch bottlenecks days before they’d have exploded into full-blown crises. In Tallyfy, when something takes longer than usual, the system flags it automatically. No meetings. No blame. Just instant visibility into where things are breaking down.

Think about your last major problem. How much earlier could you’ve caught it with real-time visibility? A day? A week?

The questions you need to ask

About your processes:

  • Why do we do it this way?
  • Why did we choose this tool?
  • Why does this take 3 approvals?
  • Why don’t we automate this?
  • Why do people keep bypassing this?

About your problems:

  • Why does this keep happening?
  • Why didn’t our last fix work?
  • Why do only certain people report this?
  • Why does it happen on Tuesdays?
  • Why are we surprised every time?

Your next steps

  1. Pick your most annoying recurring problem. The one that makes everyone groan. That’s your first 5 whys target.

  2. Set a 15-minute timer. Don’t overthink. Ask why quickly. First instincts are usually spot on.

  3. Test your root cause hypothesis. Fix it for one week. Did the problem disappear?

  4. Track the improvement percentage. Even 5% compounds quickly.

The 5 whys isn’t magic. It’s disciplined thinking. But disciplined thinking is rare.

You could keep firefighting. Keep patching. Keep having the same problems next quarter. Or you could ask why five times and fix it forever.

Frequently asked questions

What happens if asking “why” 5 times doesn’t reach the root cause?

Keep going. Five is a guideline, not a rule. Toyota engineers sometimes ask why 7-8 times for complex problems.

The key indicator you’ve hit the root? When the answer points to a missing or broken system, not a person or surface-level event. If you’re at why #10 and still going in circles, you’re probably asking the wrong initial question.

Can 5 whys work for positive outcomes too?

Absolutely. Called “positive deviance analysis.” When something goes unexpectedly well, ask why.

A retail chain discovered one store had 40% higher sales. Five whys revealed: the manager played upbeat music. Not just any music - specifically 128-132 BPM tracks that subconsciously increased shopping pace. They rolled it out nationwide.

How do you handle multiple root causes branching from one problem?

Create parallel tracks. A payment failure might branch into: technical track (why did the API fail?), process track (why wasn’t there a backup?), and communication track (why weren’t people notified?).

Run 5 whys on each branch. Fix all of them. Most problems have 2-3 root causes working together.

What’s the biggest mistake people make with 5 whys?

Stopping at human error. “Because John forgot” isn’t a root cause. It’s a symptom.

Why could John forget? No reminder system. Why no reminder? Nobody thought it was needed. Why? No previous incidents documented. The root cause is missing documentation, not John’s memory.

Should you run 5 whys sessions individually or as a group?

Start individually, then compare. People self-censor in groups.

Have 3-4 people do independent 5 whys, then compare results. You’ll find patterns and blind spots. One person might follow the technical thread, another the human factors. Together, you get the complete picture.

How quickly should you implement fixes after finding root causes?

Within 48 hours for simple fixes, one week for complex ones. The longer you wait, the less likely you’ll fix it.

Energy dissipates. New fires start. People forget why it mattered. If a fix will take months, implement a temporary countermeasure right away. Work on the permanent solution in parallel.

What if the root cause is outside your control?

Ask “why” about your response instead. Can’t control vendor delays? Why don’t you have backup vendors? Why isn’t there buffer time? Why aren’t delays communicated earlier?

You’ll find something you can control. There’s always something.

How do you know when you’ve found the root cause versus a contributing factor?

Test it. Implement the fix. If the problem disappears completely, you found the root. If it only improves partially, you found a contributing factor. Keep digging. Real root causes, when fixed, prevent the problem from ever recurring in that form.

Can 5 whys work for strategic business problems, not just operational ones?

Yes, but modify the approach. Instead of “why did this happen?” ask “why isn’t this happening?”

Why aren’t we growing 20% annually? Why aren’t people upgrading? Why isn’t our market share increasing? The technique reveals strategic blindspots just as effectively as operational ones.

What’s the relationship between 5 whys and other problem-solving tools?

5 whys is the detective work. Root cause analysis is the full investigation. Fishbone diagrams map all possible causes visually. PDCA cycles ensure fixes work over time.

Use 5 whys first for speed, then other tools for complexity. Think of it as your problem-solving Swiss Army knife - not the only tool, but the no-brainer you reach for first.

How do you prevent analysis paralysis with 5 whys?

Set a timer. 20 minutes maximum for the first pass. Perfection isn’t the goal - direction is. You can always revisit and go deeper.

Most teams overthink because they’re afraid of being wrong. A decent root cause fixed today beats a perfect root cause identified never.

Is there a 3 whys or 7 whys alternative?

Ricardo Semler’s Semco uses “3 whys” for quick decisions. Amazon uses “5 whys + so what” for impact assessment. Some nuclear facilities use “7 whys” for safety incidents.

The number isn’t sacred. It’s about depth. Simple problems might need 3. Complex systems might need 7. Let the problem complexity drive the count, not arbitrary rules.

About the Author

Amit is the CEO of Tallyfy. He is a workflow expert and specializes in process automation and the next generation of business process management in the post-flowchart age. He has decades of consulting experience in task and workflow automation, continuous improvement (all the flavors) and AI-driven workflows for small and large companies. Amit did a Computer Science degree at the University of Bath and moved from the UK to St. Louis, MO in 2014. He loves watching American robins and their nesting behaviors!

Follow Amit on his website, LinkedIn, Facebook, Reddit, X (Twitter) or YouTube.

Automate your workflows with Tallyfy

Stop chasing status updates. Track and automate your processes in one place.