Topics

If Apple can do the easy part, what is left for an AI wrapper?

If Apple can rewrite, suggest, generate, monitor, and act where users already work, what does your product know about workflow state, review responsibility, and risk that the system cannot?

If Apple can rewrite, suggest, generate, monitor, and act where users already work, what does your product know about workflow state, review responsibility, and risk that the system cannot?

Language: en

If Apple can do the easy part, what is left for an AI wrapper?

The first uncomfortable question

I used to look at a new AI wrapper and ask whether the feature worked. Could it rewrite the paragraph, summarize the page, generate the picture, draft the reply, turn a chat into a reminder, or produce a cleaner version of something the user already had? That was a reasonable first question when the system did not do much of this natively. A small product could feel useful simply by placing a capable model near a daily annoyance and removing a little copy-paste friction. The user did not need the product to understand the whole workflow yet. It was enough that the tool saved ten minutes in a place where the platform offered nothing comparable.

After WWDC26, I would ask a colder question: if the same action appears inside Mail, Messages, Safari, Photos, Shortcuts, or Siri AI, what does this product still know that Apple does not? Not what model does it call, not whether the interface is cleaner, not whether the first output looks impressive. What does the product know about the user, the workflow, the state of the work, the source of truth, the permission boundary, the review moment, or the cost of being wrong? That is the replacement question. It is colder because it removes the comfort of novelty. The product cannot hide behind the fact that AI itself feels new. It has to show knowledge that accumulates only through use, domain fit, and workflow ownership.

That is not a dunk on wrappers. A wrapper can be a very good way to discover a market. You put a useful interface around a model, watch what people repeat, and learn where the real workflow begins. Many durable products start as something thin because thin is how a team learns quickly. The problem is staying there. Apple is pulling a lot of generic intelligence into the daily environment, and generic intelligence was exactly what many early AI products were selling. Once the platform owns the broad action, the wrapper has to move toward workflow knowledge or accept that it is temporary. The honest founder response is not to defend the word wrapper as if it were an insult. It is to ask what the wrapper is teaching. If it teaches only that people like a faster first answer, the lesson is weak. If it teaches where context, review, memory, and accountability matter, the wrapper has done its job. The product should become harder to describe as "just a wrapper" every month. Not because the team hides the model call, but because the surrounding workflow becomes specific enough that the model call is no longer the only interesting part.

What the system quietly takes

Platforms usually take the work that is frequent, broad, low-risk, and annoying to launch a separate app for. Rewriting a sentence. Turning a chat into a reminder. Monitoring a Safari page for a restock or price change. Making a normal image. Suggesting a small next step in Messages. Pulling context from what is already on screen. None of these tasks are worthless. They are valuable precisely because they happen often. But they are too close to the operating system to remain defensible as a separate destination for long. The platform has a natural advantage whenever the task depends less on proprietary workflow state and more on being present at the exact moment of need.

Users are honest about this. If something can happen in the place where they already are, they do not want another account, another permission screen, another subscription, another browser tab, another clipboard handoff, and another moment of explaining context to a tool. A slightly better external tool has to fight the cost of leaving the flow. Sometimes it wins because the result is much better or the workflow is specialized. But for ordinary tasks, "open a separate AI app" becomes a tax the user will stop paying as soon as the system alternative becomes good enough. This does not require Apple to be best in every category. It only requires Apple to be good enough for the broad, frequent, low-risk job. That is usually where platform absorption starts.

Apple also gets to wrap these features in its privacy story. A thin external product now has to explain why it deserves the same text, image, browsing, or message context that a system feature can touch with less friction. For ordinary tasks, "our model is better" may not be enough of an answer, especially if the user cannot see a meaningful difference in the tenth use. The wrapper has to prove that it is not just a detour to the same generic capability. It needs a reason to receive context that is stronger than novelty. That reason might be domain memory, team workflow, evidence preservation, compliance review, brand control, or a specialized evaluation loop. Without one of those, the product is asking the user to leave the flow merely to reach a capability the platform is actively moving into the flow. There is also a consent problem hiding here. The more context an external tool asks for, the more the user wants to know what the tool will remember, share, train on, or expose to teammates. A system feature can often borrow trust from the platform. A startup has to earn that trust with product behavior, not only a privacy page. That means clearer scopes, better deletion, visible sources, and fewer reasons to paste sensitive work into a generic box.

State is the part that does not copy cleanly

The more I look at this, the less I believe the prompt is the durable part. The prompt is easy to move. The interface pattern is easy to copy. A clever output format can become familiar quickly. A model provider can release a stronger default. A platform can put the same transformation closer to the text field, image library, browser tab, or message thread. If the product is mostly a prompt, a button, and a formatted output, the platform does not need to copy the whole company. It only needs to make the core transformation convenient. That is why a thin wrapper can feel fine until the surrounding environment improves. The product does not collapse because it was useless. It collapses because the user no longer needs to leave the place where the work already lives.

What is harder to move is state. Not state as an abstract architecture word, but the lived record around the work: who approved the draft, which customer was involved, what source can be cited, which claim is risky, what version was sent, what budget was approved, what exception was granted, what needs review, what must never be shared, and what happens if the output is wrong. That state accumulates through repeated use. It is not the model answer. It is the memory of the work. A platform feature can generate a competent draft, but it does not automatically know why last week’s draft was rejected, which phrase legal approved, which source the team trusts, or which customer promise cannot be changed.

Take a sales email tool. If it only turns bullet points into polished copy, it is fragile. If it knows the last call, the promised next step, the pricing approval, the CRM stage, the account risk, the tone that account prefers, the legal language that must remain unchanged, and the exact moment a human must review the message, it is no longer just writing email. It is carrying a small piece of the sales workflow. Apple can make drafting easier inside Mail. It is much less likely to become the opinionated memory of one company’s revenue process.

The same pattern shows up in legal drafting, ecommerce images, research memos, clinical intake, hiring notes, and internal support. The output is the visible part, so it receives most of the attention. The defensible part is often the boring trail around it: context, permissions, review, evidence, recovery, and accountability. A research tool that only summarizes a PDF is weak. A research tool that preserves claim provenance, flags uncertain citations, remembers previous objections, and shows what changed after review is becoming a product. The wrapper survives by moving from output generation to workflow state. This is why the tenth use matters so much. By then the user is no longer impressed that text appeared. They want the tool to remember the relevant source, avoid the previous mistake, respect the same approval boundary, and make the next pass easier. That accumulated memory is the part a platform feature does not automatically inherit.

Starting thin can still be smart

I do not want to turn this into "all wrappers are bad." That is too blunt and not how products actually form. Thin products are often how a team finds the deeper work. You start with a narrow surface because you need to learn whether users care at all. You do not yet know which context matters, which step is painful, which output is trusted, which workflow repeats, or which buyer will pay. A wrapper can be a probe. It becomes dangerous only when the team mistakes the probe for the business. The right question for a thin product is therefore temporal: what will this surface teach in the next few months that lets the team move beyond the surface? If there is no answer, the wedge is not really a wedge.

The mistake is confusing the wedge with the destination. If the team learns quickly, the wrapper becomes a doorway into a real product. The team notices that users are not paying for the first draft; they are paying for approval history. They are not paying for the image; they are paying for brand-safe variations. They are not paying for the summary; they are paying for traceable claims and review. If the team does not learn, the whole company stays a text box with a better default prompt. That is the layer Apple is most willing to compress. The signal to watch is whether the roadmap gets more specific over time. If every release is another generic input mode, the product is drifting. If each release captures more of the real workflow, the wrapper is becoming substance.

There is a real counterpoint here: platforms do not absorb every useful workflow cleanly. Apple wants broad features. It does not want to become the system of record for every niche profession, every regulated process, every messy team habit, every procurement exception, every clinical review path, or every sales operating detail. That is exactly where a wrapper has to move. It survives by becoming more specific than Apple wants to be and more accountable than a generic system feature needs to be. Specificity is not decoration. It is the escape route from platform compression. The mistake is to add surface-level specificity, like a few templates named after industries, while the product still behaves like a general text box. Real specificity changes the data model, the permission model, the evaluation set, the review flow, and the recovery path. It changes what the product refuses to do as much as what it can produce.

The harder replacement question

I would take any AI wrapper and remove the model from its pitch for one minute. What remains? If the answer is a prompt library, a prettier editor, a saved output page, and a few export buttons, I would be nervous. Those can be useful, but they are not a durable reason to leave the system flow. If the answer includes user memory, source trails, collaboration, evaluation, permissions, version history, domain-specific constraints, and a reason the result can be trusted, then there may be a product. The model may still be central, but it is no longer the only thing being sold. This test is intentionally harsh because the market will be harsh. Platforms will not politely avoid the easy layer because startups got there first.

I would also look at the tenth use, not the first. Many wrappers feel impressive the first time because they compress a task the user already disliked. By the tenth use, the user expects memory. They expect fewer corrections, better defaults, less explanation, cleaner handoff, and a visible sense that the product has learned the shape of their work. If the product resets to a generic prompt every time, it is renting novelty from the model layer. Novelty can create trial. It rarely creates trust by itself.

The other test is substitution. If Apple’s built-in feature replaced the model call tomorrow, what would be lost? A durable product should have a painful answer to that question. The user would lose source history, workflow state, team approval, evaluation data, audit logs, policy boundaries, customer context, or specialized recovery. A fragile product will quietly admit that almost nothing disappears except a slightly different interface. That does not mean it has no value today. It means the clock is loud. I would push the question even further: if the model became cheaper, faster, and available everywhere, would this product become stronger or more irrelevant? Durable products should become stronger because their workflow state can use better intelligence. Thin wrappers become more irrelevant because the scarce thing they were selling has become ambient. A strong product should welcome better models because they improve the workflow. A weak wrapper fears better models because they erase the wrapper. This is the cleanest way to separate an entry point from a company. If the product becomes more valuable when intelligence is cheaper, faster, and more available, then it probably owns context, state, trust, or distribution that the model alone does not own. If the product becomes less valuable, it was selling access to the model layer in a moment before the platform normalized that access.

Where I land

WWDC26 does not kill AI products. It kills a comfortable excuse: that a clean UI around a capable model is enough time to build a moat. The platform is becoming better at generic assistance in the places where users already live. That does not erase the need for specialized products, but it raises the standard for what those products have to know. A wrapper that never moved beyond the first output will feel more exposed as system features improve. The uncomfortable but useful version of the lesson is this: if the product becomes less valuable when intelligence becomes more available, it was probably selling scarcity in the model layer rather than value in the workflow.

So I would still build AI products. I would just be much more suspicious of products whose value stops at generation. The better question now is simple and a little brutal: what does this product know about the user, the workflow, and the risk that the platform cannot know deeply? If the answer includes state, permission, evidence, review, and recovery, there is a path. If the answer is only "we call a strong model and make the first answer convenient," the platform can make that layer smaller very quickly.

If the answer is real, the wrapper was only the beginning. It was the temporary surface that helped the team find a durable product. If the answer is thin, Apple probably just made the clock louder. The next stage of AI product strategy is not about pretending wrappers never worked. It is about refusing to stop where the platform is most likely to arrive. A wrapper that becomes a workflow can survive the platform becoming smarter. A wrapper that remains a prompt around a generic action has to keep outrunning a company that controls the operating system, the default apps, the privacy story, and the place where the user already is. That is not impossible, but it is a brutal place to build without deeper state. The practical move is to ask what should be true after repeated use that was not true on day one. Does the product remember the user’s review decisions? Does it preserve sources? Does it learn the approval boundary? Does it make collaboration easier? Does it know which outputs were corrected and why? Does it make risky work easier to inspect? Does it provide a recovery path when the model is wrong? Those are the questions that turn a wrapper into infrastructure for a workflow. If the product cannot answer them, the platform does not need to kill it dramatically. It only needs to make the generic job convenient enough that users stop leaving their flow. The route out is not mystery. Pick a domain where context matters, make that context structured, keep the evidence trail, test the repeated cases, and build the review path before pretending the first answer is the product. A thin wrapper can still be the doorway, but the doorway has to lead somewhere. If month six looks like month one with more templates, the product is probably waiting for the platform to compress it. A stronger month six looks different: fewer generic prompts, more workflow state, clearer permissions, better memory, sharper refusal, and a result that improves because the product knows the history of the work. That is the difference between a wrapper that found a market and a wrapper that is still renting attention from the model layer. I would also look at what the product refuses to generate. A generic wrapper usually tries to answer everything because breadth looks useful. A workflow product knows when the source is weak, when approval is missing, when the request conflicts with policy, or when the user should inspect the result inside a fuller surface. That refusal is not a limitation. It is evidence that the product has moved beyond generic generation and now understands the cost of being wrong.