Mistral Forge Review: Build Your Own AI in 2026
Claude computer use works, with caveats â hereâs what 24 hours of real testing found.
Claude computer use shipped March 23, 2026 â and I woke up that morning to find a spreadsheet populated, three browser tabs open with research, and a summary document waiting in my text editor. No macros. No Zapier chain. Just Claude, operating my Mac like a (slightly cautious) remote assistant.
This is Anthropicâs computer use feature, and it shipped as a research preview on March 23, 2026 for Claude Pro and Max subscribers. Itâs the first time a leading AI assistant has offered mainstream users the ability to hand over actual desktop control â not through a plugin, not through an API integration, but by literally clicking around your screen.
Iâve been testing it for about 24 hours now. Hereâs where it stands.
Quick Verdict
Aspect Assessment Overall Score â â â â â (3.7/5) â genuinely useful, clearly unfinished Best For Repetitive desktop tasks youâd normally script or delegate Availability Claude Pro ($20/mo) and Max ($100/mo) subscribers only Reliability ~70-75% task completion on structured workflows Speed Slower than youâd do it manually. Faster than teaching someone else to do it Privacy Risk Medium â Claude sees your screen. Anthropic says nothing is stored Bottom line: This is the most practical computer-use implementation from any major AI lab. It works. Itâs also slow, occasionally confused, and not something Iâd run unsupervised on anything important. But for the right tasks, itâs already saving me real time.
The setup is simpler than I expected. You open Claude on your phone (or the desktop app â but the phone workflow is the point), type a task description, and Claude takes control of your Mac or PC. It can open applications, navigate web pages, type text, click buttons, fill form fields, and move between windows.
No step-by-step scripting required. You describe the outcome â âfind the Q1 revenue numbers from the finance dashboard and put them into the budget spreadsheetâ â and Claude figures out the clicks.
Under the hood, itâs taking screenshots of your display at regular intervals, interpreting what it sees, deciding what action to take next, executing that action, then taking another screenshot to verify the result. Itâs a perception-action loop running on your actual desktop, not a sandboxed simulation.
What Claude computer use can do right now:
What it canât do (yet): anything requiring drag-and-drop precision, most keyboard shortcuts, complex image editing, or tasks that need sub-second timing.
I ran Claude through a dozen real tasks over the past day. Not benchmarks â stuff Iâd actually need done on a Monday morning.
I asked Claude to find pricing information for five project management tools from their websites and fill in a comparison spreadsheet Iâd already set up in Google Sheets.
Result: It got four out of five correct. Missed one because the pricing page had a dynamic element that wasnât visible in the screenshot capture. Took about 8 minutes for what wouldâve taken me 15. Not faster for one-off tasks, but I was drinking coffee instead of clicking.
I had a batch of 12 entries to submit through an internal tool with a web interface. Each entry had the same five fields, different values. I pasted the data into my message and told Claude to fill them in.
Result: 11 out of 12 correct. It stumbled on a dropdown menu that required scrolling, entered the wrong value, then corrected itself on the next entry. Total time: about 20 minutes. I wouldâve hated every second of doing this manually.
I told Claude to open TextEdit, write a brief summary of a PDF I had open in Preview, and save the file to my Desktop.
Result: It worked. The summary was decent (not great â it was working from screenshots of the PDF, not parsing the actual text). Saved the file correctly. The whole thing felt like watching someone remote-desktop into my machine.
I asked Claude to check my email for a specific thread, extract a few data points from it, add them to a Numbers spreadsheet, then open Slack and send a message summarizing the update.
Result: Partial success. It found the email, got the data, updated the spreadsheet correctly. Then it opened Slack but typed the message into the wrong channel. I caught it before it sent. This is the kind of near-miss that makes supervision non-optional right now.
I need to be direct about the limitations, because the demo videos make this look smoother than it is.
Itâs slow. Each screenshot-analyze-act cycle takes a few seconds. A task that involves 30 clicks might take 3-4 minutes. Youâre not going to watch Claude do something faster than you could. The value is doing it while youâre not at your desk, or while youâre focused on something else.
Visual interpretation fails happen. Claude reads your screen through screenshots. If a UI element is ambiguous, overlapping, or requires scrolling to reveal, it can misclick. Dark mode and non-standard UI layouts trip it up more than standard interfaces.
No undo awareness. If Claude makes a mistake (wrong cell in a spreadsheet, wrong field in a form), it doesnât always recognize the error. It just keeps going. This is the single biggest reason you canât walk away and trust it completely.
Browser-heavy tasks are inconsistent. Modern web apps with dynamic content, modals, cookie banners, and authentication flows create edge cases that Claude handles unpredictably. Simple static websites? Fine. A SaaS dashboard with a dozen floating elements? Coin flip.
Privacy is a real consideration. Claude is seeing your screen. All of it. Anthropic states that screenshots arenât stored or used for training, but you should think carefully about running this with sensitive data visible. I closed my password manager and banking tabs before every test. You should too.
This isnât the first computer-use agent. But itâs the first from a tier-one AI lab thatâs available to regular subscribers rather than developers or enterprise buyers.
| Feature | Claude Computer Use | Manus My Computer | GPT-5.4 Computer Use |
|---|---|---|---|
| Availability | Pro/Max subscribers ($20-100/mo) | Separate subscription | Not yet launched for consumers |
| Setup | Built into Claude app | Separate agent install | N/A |
| Reliability | ~70-75% on structured tasks | ~65-70% | N/A |
| Mobile trigger | Yes â send from phone | No | N/A |
| Speed | Slow but steady | Similar | N/A |
| Platform | Mac and Windows | Mac, Windows, Linux | Expected Mac/Windows |
The mobile trigger angle is what separates Claudeâs implementation. Manus and OpenClaw both require you to be at your computer to set up the task. Claude lets you message from your phone while youâre on the train and come back to a completed task. Thatâs a different product category.
Anthropic shipped a second feature alongside computer use: Claude Code Channels. This connects Claude Code (Anthropicâs developer tool for writing and editing code) to messaging platforms like Telegram and Discord.
The use case: youâre away from your dev machine, you realize you need to fix something or run a script, and you message Claude Code through Telegram. It executes on your machine. No VPN, no SSH, no remote desktop.
Iâm a Claude Code user, and this immediately made sense to me. The number of times Iâve wanted to trigger a build or run a test from my phone is embarrassing. This solves that specific itch.
Itâs a developer-only feature and requires Claude Code to be running on your machine, so itâs not for everyone. But if youâre already in that ecosystem, itâs a natural extension of the agentic AI shift that dominated GTC last week.
Yes, try it if:
Not yet if:
Computer use is the logical next step in a trend thatâs been building all month. Agentic AI stopped being a concept and became the default at GTC 2026. Google shipped Gemini agents that act on your behalf across Gmail and Drive. OpenAI has been building toward its own computer-use offering.
What makes Anthropicâs move interesting is the packaging. This isnât a research paper. It isnât a limited beta for enterprise partners. Itâs a feature that any Claude Pro subscriber can turn on today. Thatâs a deliberate choice to put autonomous computer control in the hands of millions of paying users, not just developers and researchers.
Whether thatâs bold or premature depends on how the next few weeks go. If the error rate drops and the speed improves, this becomes the feature that justifies the subscription. If people lose work because Claude deleted the wrong file or sent a message to the wrong person, Anthropic will hear about it loudly.
My bet: itâll land somewhere in between. Good enough to be useful for specific tasks, not reliable enough to trust blindly. Thatâs exactly where most AI agent tools sit right now â and exactly the trajectory that leads to something much more reliable in 6-12 months.
Claude computer use works, and itâs the first mainstream implementation of something the AI industry has been promising for years. Itâs also slow, imperfect, and not ready for unsupervised operation on anything that matters.
That sounds like a lukewarm endorsement. It isnât. The fact that I can message Claude from my phone and come back to a populated spreadsheet â even one I need to double-check â represents a genuine shift in what an AI assistant can do. Not a theoretical shift. A practical one, available today, included in a subscription I already pay for.
Give it a test this week. Start with something youâd normally put off because itâs tedious. If Claude handles it, youâll immediately see the value. If it doesnât, youâve lost five minutes and learned where the ceiling currently sits.
Either way, computer use is where AI assistants are headed. Anthropic just got there first with something you can actually use.
Claude computer use is a feature that lets Claude autonomously control your Mac or PC â opening apps, browsing the web, typing text, clicking buttons, and moving data between applications. You describe a task in natural language, and Claude executes it by visually interpreting your screen and taking actions, without requiring step-by-step instructions or pre-built integrations.
Itâs included with Claude Pro ($20/month) and Claude Max ($100/month) subscriptions at no additional charge. It launched March 23, 2026 as a research preview. Free-tier Claude users do not have access.
Claude takes screenshots of your display to interpret whatâs on screen. Anthropic states these screenshots are not stored or used for training. However, Claude can see everything visible on your screen during a session, including sensitive information. Close password managers, banking tabs, and private documents before starting a computer use session.
Claudeâs implementation is the first from a major AI lab available to regular consumers (not just developers). Its key differentiator is the mobile trigger â you can send tasks from your phone. Manus My Computer and OpenClaw both require you to be at your computer to initiate tasks. Reliability is comparable across all three, in the 65-75% range for structured workflows.
Claude Code Channels, launched alongside computer use on March 23, 2026, connects Claude Code (Anthropicâs developer tool) to messaging platforms like Telegram and Discord. Developers can trigger code execution, builds, and scripts from their phone without needing SSH or remote desktop access to their development machine.
For specific, repetitive desktop tasks â data entry, web research, form filling â it can handle the work at a basic level. For anything requiring judgment, context about your preferences, or real-time communication with other people, a human assistant is still necessary. Think of it as a complement to your workflow, not a replacement for a person.
Last updated: March 24, 2026. Based on hands-on testing during the first 24 hours of the research preview. Features and reliability may change as Anthropic updates the preview.
Related reading: Claude Opus 4.6 Review | Computer Use Agents Compared: Manus vs OpenClaw vs Claude | Agentic AI Is the New Default