The Agents Have Arrived. Now What?

There is a peculiar moment in every technological revolution when the equipment finally works and everyone suddenly realizes that nobody wrote down the actual reason for buying it.

We have now reached that moment with AI agents.

On one side stands OpenClaw, bristling with GitHub stars, promising to live on our own hardware, preferably a discreet Apple Mac mini humming somewhere behind the bookshelves like a very clever crustacean in a witness protection program. On the other side stands Google Gemini Spark, polished, cloud-resident, professionally lit, and probably already aware that your dentist appointment conflicts with your annual tax panic. One is the home-built workshop assistant. The other is the tireless corporate concierge. Both are sold as the future. Both can write emails, inspect calendars, organize files, monitor things, summon other tools, and generally give the impression that the twenty-first century has finally grown little mechanical legs.

The obvious question is not whether this is impressive. It is impressive. The obvious question is: what the hell do we do with it?

The first wave of agent enthusiasm has been delightfully adolescent. We ask agents to summarize our inbox, as though the chief tragedy of modern life were that we had not yet found a sufficiently expensive way to read newsletters. We ask them to schedule meetings, which is useful, although it also risks creating a world in which agents schedule meetings with other agents to discuss whether the human stakeholders are still required. We ask them to monitor subscriptions, chase receipts, draft replies, compare hotels, file documents, and tell us which of our many neglected projects deserves fresh neglect this week.

This is all practical. It is also faintly depressing. We have built small tireless digital servants and immediately assigned them the work of a moderately competent office intern.

Perhaps that is inevitable. New tools first imitate old workflows. The first movies looked like filmed theatre. The first websites looked like brochures that had fallen into a modem. The first AI agents will look like secretaries who never ask for coffee breaks and occasionally hallucinate the location of the stationery cupboard.

But agents become interesting only when we stop treating them as chatbots with elbows.

A chatbot answers. An agent continues. That difference looks small until you apply it to real life. A chatbot can tell you that your server logs look suspicious. An agent can watch them, compare them with yesterday, open a ticket, draft a mitigation, run the safe diagnostic command, refuse the unsafe one, and wake you only when the fire has reached furniture height. A chatbot can suggest a blog outline. An agent can notice that three half-written drafts all orbit the same argument, assemble the fragments, check links, prepare image prompts, propose a slug, and leave the final judgment to the human who still has taste, prejudice, and a pulse.

That is the better model: not “do my work,” but “hold the thread.”

Modern work is not hard because each individual action is impossible. It is hard because everything leaks. Email leaks into calendars. Calendars leak into notes. Notes leak into repositories. Repositories leak into documentation. Documentation leaks into forgotten promises made in meetings nobody wanted to attend. The human mind is good at insight and bad at persistent clerical continuity. We remember the grand concept and forget the file name. We understand the architecture and lose the spreadsheet. We know exactly what must be done, except for the twelve small humiliating steps between intention and completion.

Agents are continuity machines.

This is why the OpenClaw versus Spark distinction matters. The local agent is closer to a workshop animal. It can touch your files, scripts, local models, private notes, odd directory structures, half-finished experiments, and possibly that one shell script you named final_really_final_2.sh. It belongs to the tradition of personal computing: powerful, dangerous, intimate, and occasionally repaired with curses. Its virtue is sovereignty. Its vice is that sovereignty usually comes with configuration files.

The cloud agent, by contrast, lives in the well-lit kingdom of managed identity, integrations, policy dashboards, and subscription tiers. It knows where your Google Drive is. It knows your calendar. It can glide through Gmail with the serene confidence of something that has never had to mount an external disk. Its virtue is convenience. Its vice is that convenience usually arrives carrying a clipboard and a privacy policy.

Neither model is morally pure. The local agent can be reckless with your own machine. The cloud agent can be exquisitely governed while quietly becoming the central nervous system of your life. The future will not be local versus cloud. It will be a bureaucratic aquarium of agents with different passports: some on your Mac mini, some in Google’s cloud, some in your phone, some embedded in applications, some sulking in Docker containers, all claiming to act “under your direction.”

That phrase is doing a lot of work.

The real discipline will be delegation design. Not prompt engineering. Not agent vibes. Delegation design. What may the agent observe? What may it decide? What may it prepare? What may it execute? When must it ask? What is reversible? What is logged? What is forbidden even if technically possible? An agent without boundaries is not an assistant. It is a raccoon with API keys.

The most useful agents will not be the ones that pretend to be autonomous executives. They will be the ones that are boringly accountable. Show me what you saw. Show me what you inferred. Show me what you changed. Show me what you refused to do. Leave tracks. Accept correction. Learn preferences, but do not become a theological authority on my personality because I once wrote “fine” in an email.

So what do we do with agents?

We give them the work that currently dies in the swamp between attention and action. We let them maintain project memory. We let them watch systems. We let them prepare drafts, fixtures, reports, pull requests, reading lists, invoices, and test matrices. We let them remind us of contradictions. We let them be suspicious of stale documents. We let them compare what we said last month with what we are about to ship today. We let them keep score.

And then we keep the dangerous part for ourselves.

The agent may assemble the evidence. The human must decide what matters. The agent may draft the email. The human must own the tone. The agent may find the pattern. The human must decide whether the pattern is meaningful or merely decorative. The agent may manage the machinery. The human must remain responsible for the direction of travel.

Otherwise we will not have built assistants. We will have built an extremely elaborate way to become middle managers of our own lives.

Still, there is something magnificent in the absurdity. Somewhere, a tiny local lobster-agent is waiting on a Mac mini. Somewhere else, Google’s cloud butler is polishing its permission prompts. They are ready to summarize, schedule, browse, file, monitor, compare, and politely ask whether we want to proceed.

The embarrassing truth is that they may be more ready than we are.

The Agents Have Arrived. Now What?

No comments yet

Continue reading

Does an AI Need a Soul to Care About Cows?

The Brake They Would Like Someone Else to Build

The Museum of Recent Intelligence