110 supermarkets. 16 hours a day. Staff sickness taking out whole shifts at a time. Demand patterns that broke overnight and stayed broken for months. I learned more about operational resilience in those nine months than in the previous nine years — and most of it holds up in normal trading too.
This post isn't a war story. I have plenty of those, but they're not particularly useful to an operations director trying to do their job well in 2026. What is useful is the operational lessons that came out of running a regional distribution centre through a period when nothing about the operation worked the way it had been designed to work, and the only choices were either to find new ways to make it work or to fail in front of 110 stores that were depending on us to deliver.
What follows is five lessons that emerged from that period, told with the benefit of about five years of hindsight. They're not Covid-specific. They're things that became visible because the operation was under enough stress to make them visible, but they were true the whole time. They're just easier to ignore in normal trading.
If you run a UK warehouse operation, at any scale, in any sector — these are the operational properties that distinguish the operations that survive bad weeks from the ones that buckle.
The first thing Covid did to operations was strip out the people who knew how things worked. Not in a planned way, with handover and replacement. Suddenly, at random, with no notice. A team leader would be off for two weeks. Then another. Then three of their direct reports. The institutional knowledge of how the operation actually ran — as opposed to how it was documented to run — was being subtracted in chunks.
What I learned, painfully, is that the operations that survived this were the ones where the process was actually written down. Not the ones where the process was "in everyone's head" or "we've always done it this way." The latter operations failed quickly, because "we've always done it this way" turned out to mean "three specific people knew how to do it, and they're all off."
In normal trading, the difference between a properly documented process and an institutionally-known one is invisible. In stress, it's everything. If your operation depends on five specific people knowing how things work, you don't have a process — you have a fragility.
The second lesson is one that I've since seen in almost every operation I've audited, including ones that have never had a crisis. Most warehouse operations are tuned tightly to their normal volume profile. They can handle the average week elegantly. They struggle with the bad week.
What this looks like in practice: the operation is staffed to handle Monday's volume, with maybe 10% headroom. When Monday is genuinely 20% higher than expected, the operation falls behind. Wednesday's volume hits while Monday is still being cleared. Thursday compounds. By Friday, you're running overtime, errors are climbing, and the team is exhausted — not because the average volume changed, but because the variance broke the tight tuning.
The Covid version was extreme — variance of 200-300% on some product lines — but the principle is universal. Operations that work well over time are tuned for variance, not for averages. They have surge capacity in critical zones. They have slack in scheduling. They have process steps that flex rather than queue. Operations that look efficient on a good week are usually fragile on a bad one.
I've written about this elsewhere — it's the most common single cause of sustained overtime I see in audits — but Covid made it brutally obvious. With sickness taking out shift supervisors and team leaders at random, the only thing that kept the late shift from inheriting chaos every day was the discipline of the handover.
Before Covid, our handover was about 15 minutes, fairly informal, mostly verbal. After the third week of someone-different-running-the-shift-every-day, we rebuilt it. Written log. Standard structure. Five minutes, every transition, no exceptions. The single change that produced the biggest reduction in cross-shift problems for the rest of the period.
What I take from that, applied to normal operations: the handover is where the previous shift's problems either get absorbed by the next shift or get left for the next shift to deal with. If it's not written down and structured, the absorption defaults to the people, and the people forget, and the problems compound.
Pre-Covid, our forecasting was solid. We knew what was coming. We staffed accordingly. The plan and the floor mostly agreed.
Then March 2020 happened, and the forecast became actively misleading. Volume patterns broke overnight. Categories that were stable for years became volatile. The forecast was still being produced and still being used to plan staffing, but the underlying assumptions it was built on had become wrong without anyone updating it.
The lesson — which I've since applied to every operation I've worked with — is that a forecast is only useful when the variance between forecast and actual is regularly checked, and the check actually influences planning. When the forecast becomes a number that's produced, circulated, and not interrogated, it goes from being a tool to being a hazard. The operation runs on assumptions that nobody has validated.
This is part of why the forecast-vs-actual loop is one of the highest-impact interventions in PainKiller audits. Most operations are running forecasts that haven't been pressure-tested against reality for months. The Covid version of this was extreme; the normal-trading version is just slower-moving and more invisible.
This is the lesson that took the longest to fully land, and the one I'd argue is the most generalisable to normal operations.
During the worst stretches of Covid, we kept that RDC running because the team kept showing up. People came back from being ill to cover sickness. Team leaders worked double shifts to keep stores supplied. People did jobs they hadn't been trained for, learned them on the job, and made it work. Nobody had to ask them. They just did it.
What I learned from that — and what I now look for in every audit — is that the relationship between an operations team and an operation is something most directors don't actively manage, and it shows. The teams that show up when things are hard are the teams that have been treated as people, not as labour, in normal times. The teams that turn over fast and disengage when there's pressure are the teams that have been treated transactionally.
The operational implication is that retention, engagement, and operational performance are not separate problems. They're the same problem viewed through different lenses. The operations directors who run resilient sites are the ones who understand this and act on it long before there's a crisis.
What this means for your operation now (no crisis required)
Five years on, with a lot of warehouse audits behind me, here's what I take from all of this for operations directors running UK warehouses today:
- Document your process properly. Not in a binder that nobody opens. As live, current, accessible material that someone new could read on day one and understand how things actually work. If the answer to "how do we do X here" is "ask Steve," your operation is one Steve away from failing.
- Build for variance, not averages. Look at your bad weeks, not your good ones, when you're designing capacity. If your operation only works at average volume, you have a problem that hasn't expressed itself yet.
- Take the handover seriously. Written, structured, five minutes, every transition. The cheapest, fastest operational improvement available to most warehouses.
- Pressure-test the forecast weekly. Forecast vs actual, every week, with someone responsible for asking "why was this wrong" rather than "what's the next forecast." A forecast nobody interrogates is worse than no forecast.
- Manage the team relationship deliberately. Retention and engagement are operational metrics, not HR metrics. The directors who treat them that way run sites that perform better, especially when things are hard.
A free 12-page diagnostic guide built from patterns observed across PainKiller audits. Covers the operational signals that distinguish resilient operations from fragile ones. No email required.
Open the guide →A final thought
The thing about resilience is that it's invisible until it's needed. The operations that survived Covid weren't the ones that "responded brilliantly" — they were the ones that had quietly built resilient operating models for years before, often without realising it was resilience they were building. They had written processes. They had handover discipline. They had teams who would show up.
The operations that didn't make it through, or that made it through expensively, weren't worse-managed. They were just tuned tighter to normal trading, with no margin for variance and no documentation to fall back on. They were efficient on paper. They just weren't robust.
If I had one thing to say to operations directors now, with hindsight: resilience is a property you build in normal trading, not in a crisis. By the time you need it, it's too late to build. The interventions that produce it are unglamorous — process documentation, handover discipline, forecast loops, team relationship work. They're also the things that pay off every other week, not just during the bad ones.