Skip to main content

Microsoft Copilot Studio Brings Computer-Using Agents to the Enterprise

For years, IT and DevOps teams have wrestled with the same stubborn problem: how do you automate workflows in systems that were never built for automation? Legacy apps, vendor portals, and proprietary line-of-business platforms rarely offer APIs. That means someone, usually a human, ends up clicking through screens, entering data, and completing transactions by hand.

Microsoft has a direct answer to that problem. Computer use in Microsoft Copilot Studio is now generally available, with expanded availability rolling out to all commercial geographies in Microsoft Power Platform.

What Computer-Using Agents Actually Do

The simplest way to think about it: computer use gives an agent the same tools a person has — a browser, a screen, a keyboard, and the ability to read what’s on the page and take the next logical step.

That’s a meaningful shift. Most automation tools rely on brittle, selector-based scripts that break the moment a UI changes. Instead, the computer uses a tool that relies on vision and reasoning to navigate live UIs — adapting when layouts shift, fields move, or workflows branch.

In practical terms, this means agents can now handle workflows that previously required manual workarounds or expensive integration projects. For organizations with deep investments in proprietary platforms or third-party portals, workflows that previously required either a multi-quarter integration project or an army of contractors clicking through screens can now be handed to an agent. For enterprise IT teams, this can also reduce pressure to modernize or rebuild legacy systems before automation can begin.

A Real-World Example

Microsoft points to Graebel, a global mobility and relocation services company, as an early example of what this looks like in production.

Working with GET AI and Microsoft, Graebel built and deployed the Graebel Service Order Agent in Microsoft Copilot Studio. The agent monitors designated mailboxes and interprets unstructured service-order emails using Azure Content Understanding, extracting key data into a structured form with confidence scoring. It validates each request against Graebel’s business rules, service logic, and compliance requirements before any action is taken. The agent then operates Global Connect directly through its UI — navigating screens, entering data, and completing transactions exactly as a trained human operator would, without APIs or platform redevelopment. Exceptions and low-confidence cases are escalated through human-in-the-loop workflows, preserving governance and service quality.

That last point matters. Keeping humans in the loop for exceptions isn’t a limitation — it’s the right design for enterprise workflows where compliance and accuracy aren’t optional.

Security, Governance, and Model Choice

One concern that often follows news like this: what about security? Microsoft is keenly aware of that question. The platform is built within the existing Power Platform governance framework, so agents inherit the same security controls, compliance policies, and audit capabilities that organizations already have.

Microsoft is actively investing in advanced governance, operations, and scale for computer-using agents, with customer feedback directly informing the roadmap.

Copilot Studio now also supports model selection for agents, with models from both OpenAI and Anthropic available. That flexibility matters for teams that want to match the right model to the right task — whether they’re optimizing for reasoning depth, latency, or cost.

What DevOps Teams Should Pay Attention To

Computer-using agents aren’t a replacement for well-designed APIs or modern integrations. But they do fill a long-standing gap in enterprise environments. Not every system gets modernized on schedule. Not every vendor exposes the data access your team needs. And not every business process maps neatly to an existing automation workflow.

Getting started is straightforward. Teams can create or open an agent in Microsoft Copilot Studio, navigate to Tools, select Add tool, and then add a new computer-use capability. From there, the task can be described in natural language.

Computer-using agents extend automation from API-reachable systems to anything with a UI, the larger surface in most enterprises. General availability in Copilot Studio repositions the conversation from integration backlog to action governance, and model choice across OpenAI and Anthropic confirms that competitive value sits at the operating layer,” according to Mitch Ashley, VP and Practice Lead, Software Lifecycle Engineering, The Futurum Group.

“Procurement should treat this as a control plane question. UI-driven agents inherit human-level reach across legacy and SaaS workflows, so authorization, audit, and exception handling must operate at the action level. Without that visibility, autonomy stalls short of what the technology enables.”

The broader trend here is clear. Agentic AI is moving beyond conversation and into action. For DevOps teams managing a mix of modern and legacy environments, that’s worth paying close attention to.



from DevOps.com https://ift.tt/I36egoE

Comments

Popular posts from this blog

Cursor’s New SDK Turns AI Coding Agents Into Deployable Infrastructure

For most of its life, Cursor has been an IDE. A very good one. But with the public beta of the Cursor SDK, the company is making a different kind of move — one that should get the attention of DevOps teams. The Cursor SDK is a TypeScript library that gives engineers programmatic access to the same runtime, models, and agent harness that power Cursor’s desktop app, CLI, and web interface. In short, the agents that used to live inside an editor can now be invoked from anywhere in your stack. That’s a meaningful shift in how AI coding tools fit into software delivery pipelines. From the Editor to the Pipeline If you’ve used Cursor before, the workflow is familiar — you interact with an agent in real time, asking it to write functions, fix bugs, or review code. The SDK breaks that dependency on interactive use. Now you can call those same agents programmatically, from a CI/CD trigger, a backend service, or embedded inside another tool. Getting started is a single inst...

Mistral Moves Coding Agents to the Cloud — and Gets Out of Your Way

For the past year or so, AI coding agents have been tethered to your local machine. You kick off a task, watch the terminal, and babysit every step. It works — but it’s not exactly hands-free. Mistral just changed that. On April 29, the Paris-based AI company announced remote coding agents for its Vibe platform, powered by a new model called Mistral Medium 3.5. The idea is simple: Instead of running coding sessions on your laptop, they now run in the cloud — asynchronously, in parallel, and without you watching over them. What’s Actually New Coding sessions can now work through long tasks while you’re away. Many can run in parallel, and you no longer become the bottleneck at every step the agent takes. That’s the core pitch. You start a task from the Mistral Vibe CLI or directly from Le Chat — Mistral’s AI assistant — and the agent handles the rest. When it’s done, it opens a pull request on GitHub and notifies you, so you review the result inste...

OpenAI Debuts Symphony to Orchestrate Coding Agents at Scale

OpenAI has unveiled Symphony, an open-source specification that shifts how software development teams deploy AI in workflows, moving from interactive coding assistance toward continuous orchestration of autonomous agents. Symphony reframes project management tools as operational hubs for AI-driven coding. Rather than prompting an assistant for individual tasks, developers assign work through issue trackers, allowing agents to execute tasks in parallel and deliver outputs for human review. The change reflects a trend in enterprise AI in which systems are increasingly embedded into production pipelines rather than used as standalone tools. Symphony emerged from internal experimentation at   OpenAI , where engineers attempted to scale the use of   Codex   across multiple concurrent sessions. While the agents proved capable, human operators became the limiting factor. Engineers found they could only manage a handful of sessions before coordination overhead offset pro...