Think You Know Your AI Tools? Think Again

May 16, 2026
5 min read

This chapter is for the person responsible for hiring or briefing the AI expert, because the gap between surface-level AI fluency and genuine operational knowledge is where budget disappears and competitive advantage quietly walks out the door. It is also, quietly, leaving serious capability and serious money on the table.

The Tech-Creativity Gap No One Is Talking About

Technical capability is not creative judgment. A technically skilled team can build a workflow that runs correctly, select appropriate models, and produce output reliably at scale. That is real and valuable work. But there is a ceiling to what technical skill alone can deliver, and most businesses are hitting it without realising what they are bumping into.

Brand DNA is not a set of hex codes and a font choice. It is the accumulated understanding of what a business stands for, who it is speaking to, what those people need to feel, and how every piece of communication either reinforces or undermines that relationship over time.

You cannot gloss over that with a logo and call it brand-safe AI output. And you cannot outsource the creative judgment layer to a team whose expertise is infrastructure, however brilliantly they build it.

That standard is only achievable when the people designing the system understand brand as deeply as they understand the technology. When the creative intelligence is baked into the infrastructure, not applied as a final coat of paint.

The question worth asking your AI implementor is straightforward: can you articulate our brand's tone of voice, the emotional response our creative needs to produce, and how you test output against those standards at scale? If that question produces hesitation, you have a technically functional AI operation. Not a brand-safe one.

The Iceberg Problem

The tools most people know represent the visible tip of a model ecosystem of extraordinary scale and depth. Hugging Face, the primary open repository for AI models, currently hosts over one million models. The ones that appear in LinkedIn posts, TikTok tutorials, and agency pitch decks represent a fraction of a fraction of that library.

This matters for two reasons.

First, some of the most capable models available right now are not on anyone's radar because they were not launched with a PR campaign. They were published by research teams, open-sourced under Apache 2.0 licences, and are sitting in public repositories waiting to be used commercially, freely, by anyone who knows to look.

Second, the tools that do have brand recognition, the ones your team is paying per credit, per seat, per generation, are frequently not the best option for the job. They are simply the most marketed option.

Netflix is a useful illustration of exactly this point.

What Netflix Just Quietly Released

On April 3, 2026, Netflix published its first open source AI model on Hugging Face. It is called VOID, Video Object and Interaction Deletion, and what it does is genuinely remarkable.

Most video editing AI tools can remove an object from footage. VOID goes further. It understands physics and causality. Remove a person holding a guitar from a scene, and a lesser tool leaves a floating instrument. VOID understands that the guitar falls. That the displaced air settles. That the scene's logic changes. It rewrites the reality of the footage, not just the surface of it.

In head-to-head testing against Runway, one of the most widely used and most expensively subscribed video AI platforms on the market, VOID was preferred by human evaluators 64.8% of the time. Runway scored 18.4%.

It is licensed under Apache 2.0. You can use it commercially. It is free. The model weights, void_pass1.safetensors and void_pass2.safetensors, are available directly from huggingface.co/netflix/void-model, and it runs inside ComfyUI for those already working with node-based workflows.

The honest caveat: VOID requires a GPU with at least 40GB of VRAM to run inference. This is not a MacBook job. It needs dedicated hardware, an A100 or equivalent. But for any studio, production house, or agency running serious creative work, that hardware either already exists or costs a fraction of what would be spent burning Runway credits at scale over a year.

The point is not that everyone should run VOID tomorrow. The point is this: a model of Hollywood production quality, built by one of the world's most sophisticated media organisations, was released publicly for free, and most of the people paying significant monthly fees for inferior results have no idea it exists.

That is the iceberg.

The Token Burn Problem

Here is a question worth sitting with: at what point does using AI become more expensive than not using it?

The answer arrives faster than most people expect. Professionals using frontier AI models for deep, sustained work, complex image generation, long-form video, multi-pass creative iteration, are increasingly discovering that a serious session's token or credit cost can rival a skilled human's day rate. Not in theory. In practice, on actual invoices.

This is not an argument against AI. It is an argument for precision. For knowing which tasks genuinely benefit from AI capability, which tasks are better served by a human, and which tasks are being handed to AI out of habit, novelty, or the uncomfortable feeling that not using AI means falling behind.

Large frontier models accessed through SaaS platforms are expensive per query, metered by usage, and subject to pricing changes outside your control. For the tasks they are genuinely best suited to, complex reasoning, nuanced generation, sophisticated multi-modal work, they earn their cost. For the vast majority of routine, repetitive, well-defined tasks that make up most of an organisation's actual AI usage, they are a premium product solving a standard problem.

The teams genuinely ahead on this are running a deliberate split. Frontier models where the complexity justifies the cost. Smaller, local, or open source models for the volume work. The operational intelligence is in knowing the difference and building infrastructure that routes each task to the appropriate tool automatically.

SaaS AI Is the Entrance, Not the Building

If your entire AI operation runs through SaaS interfaces, chat windows, web-based image generators, subscription platforms, you are at the entrance to a very large building you have not yet entered.

This is not a criticism of those tools. They are accessible, well-designed, and genuinely capable within their scope. The problem is when they become the ceiling of an organisation's AI ambition rather than the starting point.

The real operational leverage in AI does not live in which platform you subscribe to. It lives in understanding models as infrastructure. In knowing what a SafeTensors file is and why that format matters for stable, reproducible model deployment. In understanding what a LoRA does and why fine-tuning a smaller model on your brand's specific visual language produces more consistent, more on-brand output than prompting a general-purpose frontier model and hoping. In understanding the difference between a 70-billion-parameter model and a 7-billion-parameter model, not just in scale, but in where each one belongs in a production workflow.

Hugging Face is not just a repository. It is a library of competitive advantage that most businesses have not opened the door to. When you understand how to navigate it, how to evaluate a model card, assess a licence, download weights, and integrate a model into a workflow, you are operating at a level that the subscription-only competitor simply cannot access.

The Choice Problem

AI models change. Weekly. Sometimes daily. A model that leads on image realism this month may be outperformed on typography next month by something released quietly on a Tuesday. For visual and video output specifically, the LLM, the LoRA, and the VAE each play a distinct role in what the final output looks and feels like. Get any one of those wrong and the result is a lottery, however good the prompt is.

This is why flexibility is not a nice-to-have in an AI creative system. It is the architecture.

A system locked to a single workflow, or to a vendor's curated selection of models with no transparency about what is running underneath, cannot respond to this pace of change. You are not buying capability. You are buying someone else's fixed moment in time, with no visibility into when it was last updated, what was chosen and why, or whether a better option now exists.

Well-designed AI infrastructure does not lock you in. It is built around the end output required, with model selection treated as a variable rather than a constant. The best operators are continuously evaluating, testing, and swapping components as the landscape evolves. That is not instability. That is the only rational response to a field moving at this speed.

The question worth asking any implementation partner is simply: how does your system adapt as models improve, and who makes that call? If the answer involves a procurement cycle or a platform roadmap outside your control, you are not building capability. You are renting a snapshot.

The Right Question for Your Business

The chapter started with a person who knows the tool names and feels informed. The chapter ends with a different, harder question for that person and the leadership team around them.

Not "which AI tools are we using?" but "do we understand what we are actually using, what it costs at every level, what better and cheaper alternatives exist, and whether the people making creative judgments with it are genuinely qualified to do so, or just technically capable?"

The businesses discovering real competitive advantage in AI right now are not the ones with the most subscriptions or the most impressive vendor relationships. They are the ones who have done the unglamorous work of understanding the landscape below the surface, who know that a Hollywood-grade video model can be free, that the most expensive frontier model is frequently the wrong choice, and that the creative intelligence directing the technology matters as much as the technology itself.

The iceberg is deep. Most of the value is in the part you cannot see from the surface. And most of the damage happens there too.

The Challenge

Ask your AI implementation team, internal or external, three questions this week.

What open source models are we currently evaluating or using, and why or why not? What is our actual cost per meaningful AI output across our current stack, and have we modelled what local or open source alternatives would cost? And finally: who on this team is making creative quality judgments on our brand output, and what is their background in brand, communication, and audience, not just in AI?

The answers will tell you more about your real AI position than any vendor deck or platform demo.

Share this post

Subscribe to our newsletter

By clicking Sign Up you're confirming that you agree with our Terms and Conditions.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.