Most marketers will tell you that pre-flight testing is essential. Most marketers will also admit, when pressed, that they don’t really do it — at least not on the scale that matters. Pre-flight testing has a public-relations problem inside the industry: every CMO knows it works, almost every team underuses it, and the gap between intention and execution gets wider every quarter.
That gap costs real money. A meaningful share of every media budget — somewhere between 15% and 40%, depending on whose research you read — flows behind creative that simply doesn’t move the needle. The creative isn’t bad in any obvious sense. It’s just not the strongest of the available options. Without testing, “strongest of the available options” is a verdict that nobody on the team is positioned to defend with data.
So why doesn’t testing happen?
The three things that kill ad testing budgets
Most teams that have given up on rigorous pre-flight testing didn’t do so because they stopped believing in it. They stopped because of one of three operational realities:
Cost. A traditional brand-side ad-test costs anywhere from $4,000 to $25,000 per study. Multiply that across an agency-of-record producing 60 creatives a quarter, and the math stops working long before the campaign ships. The result: the most expensive and visible creative gets the most testing — and the long tail of regional, social-first, and mid-funnel creative goes out the door untested.
Speed. Studies typically take 3 to 6 weeks to field, analyse, and turn into a usable report. That’s incompatible with the cadence at which modern teams are actually shipping work. By the time the report lands, the creative has run for two weeks already and the next brief is on the table.
Methodology. Many established platforms still rely on rubrics and panel methodologies developed for 30-second TV. Apply them to a 6-second TikTok hook and the scoring is, generously, less reliable than the gut of a senior creative director. Teams notice. They stop trusting the score.
The reframe
The way out of the trap isn’t more rigorous testing on fewer creatives. It’s the opposite: lighter, faster, more frequent testing on more of them. The creative-effectiveness score doesn’t need to be 99% defensible against statistical attack. It needs to be 80% reliable, comparable across the category, and available in time to influence which creative actually ships.
Three operating principles tend to unlock that:
- Test before commitment. Don’t test the final cut after the brand has signed off — test the concept, the storyboard, even the rough Figma frames. Catching problems before production is where the savings compound.
- Test multiple variants. Comparing variants is more useful than scoring one creative in isolation. The relative deltas between variants are far more reliable than the absolute scores.
- Test on category benchmarks, not industry averages. A 4.2 on emotional resonance in beverage doesn’t mean the same thing as a 4.2 in fintech. Your scoring system needs to know your category.
What actually changes when teams adopt this
The visible change is that more shipping decisions are defended with data. The less-visible — and arguably more important — change is in what gets killed. Teams that test more aggressively kill more variants, earlier, and with less drama. The “let’s just go with it” creative that survives the brief but flunks the audience read gets caught before the media buy is committed.
That single shift — moving from one mandatory test on the hero film to many lightweight tests across the campaign — is what turns a testing budget from a necessary evil into a multiplier on the rest of the spend.
The hidden cost of skipping
The cost of an untested creative isn’t the test you didn’t run. It’s the media you bought against work that could have been better. A £200,000 media plan running behind a creative that scored 3.1 on Emotional Resonance, when a sibling variant scored 4.4, is a £200,000 media plan working roughly two-thirds as hard as it should. That’s not a marginal optimisation — that’s a third of the budget evaporating into a creative choice nobody made deliberately.
Most agencies and brands have an intuitive sense that this is happening. The pushback against testing isn’t usually a denial of the principle; it’s a tactical compromise driven by cost and speed. When the testing layer is fast enough and cheap enough that there’s no excuse to skip, the compromise stops making sense.
A pragmatic checklist
If you’re rebuilding your testing programme, the most useful exercise is to map your current spend against three questions:
- What percentage of our shipped creatives received any pre-flight test before launch?
- For the ones that did, did the test actually influence a go/no-go decision?
- For the ones that didn’t, can we defend the omission against post-campaign analysis?
If the honest answers are “less than half”, “not really”, and “no” — you’re not testing inadequately. You’ve already left the testing market and didn’t notice.
The fix isn’t another panel study. It’s a faster, cheaper, more comparable system applied to every brief that crosses the team. The creative that survives that filter is the creative your media plan should be funding.