Control My Power App with Copilot Studio
This might be the week the bots stop “assisting”… and start working.
Microsoft quietly flipped a switch — and Copilot Studio can now literally use your computer.
Not API calls. Not connectors. Not cloud sandboxes.
Actual mouse movement. Real keyboard input.
A legit AI agent that can launch your Power App, fill the fields, and submit the form — like a disturbingly compliant intern.
In this episode we unpack the feature Microsoft calls Computer Use — the update that turns Copilot into a hands-on operator of Windows machines. We walk through setup, the security ceremony no one warns you about, and then watch the AI stumble, misclick, recover, adapt… and eventually succeed. It’s messy, slow, hilarious — and also historic.
This is agentic AI in the enterprise — the moment automation stops being a diagram and becomes a digital worker.
If your business runs on legacy apps, intranet buttons, and “almost integrated” everything… this is the episode you need to hear. Because this is where Copilot stops writing emails — and starts doing tasks.
You will either fear for your job… or go rewrite your job title to “AI Wrangler.”
In this episode we break down the wild new Microsoft Copilot Studio capability that literally lets an AI agent use your Windows computer — clicking, typing, dragging, opening apps, filling forms… like a real intern who doesn’t take lunch breaks.
This is the episode where Copilot stops advising — and starts doing actual work.
What We Cover
• What “Computer Use” inside Copilot Studio actually is
• Why this is a breakthrough for enterprise + legacy UI automation
• What setup steps you absolutely cannot skip
• What the AI actually “sees” on the screen when deciding actions
• Watching the agent struggle… misclick… adapt… and eventually succeed
• Security + governance realities you need to think about NOW
Key Takeaways
-
Copilot can now interact with software visually — not just via connectors
-
This means any app — even old ones with no API — can now be automated
-
This is REAL agentic AI in the M365 stack, not a macro replay
-
Every run is slightly different — because it’s reasoning live
-
Setup is bureaucratic for a reason: you’re giving AI mechanical control
-
This is the bridge between modern models + messy corporate user interfaces
Why This Matters for Business
| Pain Today | The New Reality |
|---|---|
| Legacy apps with no API | Now controllable by AI through Computer Use |
| manual data entry | now autonomous agent tasks |
| brittle RPA clicks | adaptive model-driven reasoning |
| integration “dead ends” | computer vision unlocks new automation surface |
This is the beginning of digital workers that operate like humans — not scripts.
Setup Requirements (High Level)
-
Windows 10/11 Pro (NOT Home)
-
Power Automate Desktop installed
-
Machine Runtime enabled + signed in
-
Machine registered in Power Automate → Monitor → Machines
-
“Enable for Computer Use” toggled ON
-
Runtime version ≥ 2.59 required
Agent Behavior Demo Highlights
-
Cursor movements are improvised in real-time
-
The model “reads” the screen like a human does
-
If a UI element doesn’t behave — it tries something else
-
Calendar picker = chaos until it finally typed the date manually
The point isn’t perfection.
The point is actual reasoning.
Keywords
Microsoft Copilot Studio
Computer Use Copilot
agentic AI Microsoft
AI automation Power Apps
Power Automate Desktop runtime
Windows automation AI
enterprise automation AI
autonomous digital workers
legacy UI automation AI
M365 Show Podcast
Support + Follow
Become a supporter of this podcast:
https://www.spreaker.com/podcast/m365-show-podcast--6704921/support
Keywords / Search Relevance
Opening: “The AI Agent That Runs Your Power App”Most people still think Copilot writes emails and hallucinates budget summaries. Wrong. The latest update gives it opposable thumbs. Copilot Studio can now physically use your computer—clicking, typing, dragging, and opening apps like a suspiciously obedient intern. Yes, Microsoft finally taught the cloud to reach through the monitor and press buttons for you.And that’s not hyperbole. The feature is literally called “Computer Use.” It lets a Copilot agent act inside a real Windows session, not a simulated one. No more hiding behind connectors and APIs; this is direct contact with your desktop. It can launch your Power App, fill fields, and even submit forms—all autonomously. Once you stop panicking, you’ll realize what that means: automation that transcends the cloud sandbox and touches your real-world workflows.Why does this matter? Because businesses run on a tangled web of “almost integrated” systems. APIs don’t always exist. Legacy UIs don’t expose logic. Computer Use moves the AI from talking about work to doing the work—literally moving the cursor across the screen. It’s slow. It’s occasionally clumsy. But it’s historic. For the first time, Office AI interacts with software the way humans do—with eyes, fingers, and stubborn determination.Here’s what we’ll cover: setting it up without accidental combustion, watching the AI fumble through real navigation, dissecting how the reasoning engine behaves, then tackling the awkward reality of governance. By the end, you’ll either fear for your job or upgrade your job title to “AI wrangler.” Both are progress.Section 1: What “Computer Use” Really MeansLet’s clarify what this actually is before you overestimate it. “Computer Use” inside Copilot Studio is a new action that lets your agent operate a physical or virtual Windows machine through synthetic mouse and keyboard input. Imagine an intern staring at the screen, recognizing the Start menu, moving the pointer, and typing commands—but powered by a large language model that interprets each pixel in real time. That’s not a metaphor. It literally parses the interface using computer vision and decides its next move based on reasoning, not scripts.Compare that to a Power Automate flow or an API call. Those interact through defined connectors; predictable, controlled, and invisible. This feature abandons that polite formality. Instead, your AI actually “looks” at the UI like a user. It can misclick, pause to think, and recover from errors. Every run is different because the model reinterprets the visual state freshly each time. That unpredictability isn’t a bug—it’s adaptive problem solving. You said “open Power Apps and send an invite,” and it figures out which onscreen element accomplishes that, even if the layout changes.Microsoft calls this agentic AI—an autonomous reasoning agent capable of acting independently within a digital environment. It’s the same class of system that will soon drive cross-platform orchestration in Fabric or manage data flows autonomously. The shift is profound: instead of you guiding automation logic, you set intent, and the agent improvises the method.The beauty, of course, is backward compatibility with human nonsense. Legacy desktop apps, outdated intranet portals, anything unintegrated—all suddenly controllable again. The vision engine provides the bridge between modern AI language models and the messy GUIs of corporate history.But let’s be honest: giving your AI mechanical control requires more than enthusiasm. It needs permission, environment binding, and rigorous setup. Think of it like teaching a toddler to use power tools—possible, but supervision is mandatory. Understanding how Computer Use works under the hood prepares you for why the configuration feels bureaucratic. Because it is. The next part covers exactly that setup pain in excruciating, necessary detail so the only thing your agent breaks is boredom, not production servers.Section 2: Setting It Up Without Breaking ThingsAll right, you want Copilot to touch your machine. Brace yourself. This process feels less like granting autonomy and more like applying for a security clearance. But if you follow the rules precisely, the only thing that crashes will be your patience, not Windows.Step one—machine prerequisites. You need Windows 10 or 11 Pro or better. And before you ask: yes, “Home” editions are excluded. Because “Home” means not professional. Copilot refuses to inhabit a machine intended for gaming and inexplicable toolbars. You also need the Power Automate Desktop runtime installed. That’s the bridge connecting Copilot Studio’s cloud instance to your local compute environment. Without it, your agent is just shouting commands into the void.Install Power Automate Desktop from Microsoft, run the setup, and confirm the optional component called Machine Runtime is present. That’s the agent’s actual driver license. Skip that and nothing will register. Once it’s installed, launch the Machine Runtime app; sign in with your work or school Entra account—the same one tied to your Copilot Studio environment. The moment you sign in, pick an environment to register the PC under. There’s no confirmation dialog—it simply assumes you made the right decision. Microsoft’s version of trust.Step two—verify registration in the Power Automate portal. Open your browser, go to Power Automate → Monitor → Machines, and you should see your device listed with a friendly green check mark. If it isn’t there, you’re either on Windows Home (I told you) or the runtime didn’t authenticate properly. Reinstall, reboot, and resist cursing—it doesn’t help, though it’s scientifically satisfying.Step three—enable it for Computer Use. Inside the portal, open the machine’s settings pane. You’ll find a toggle labeled “Enable for Computer Use.” Turn it on. You’ll get a stern warning about security best practices—as you should. You’re authorizing an AI system to press keys on your behalf. Make sure this machine contains no confidential spreadsheets named “final_v27_reallyfinal.xlsx.” Click Activate, then Save. Congratulations, you’ve just created a doorway for an autonomous agent.Step four—confirm compatibility. Computer Use requires runtime version 2.59 or newer. Anything older and the feature simply won’t appear in Copilot Studio. Check the version on your device or in the portal list. If you’re current, you’re ready.Now, about accounts. You can use a local Windows user or a domain profile; both work. But the security implications differ. A local account keeps experiments self‑contained. A domain account inherits corporate access rights, which is tantamount to letting the intern borrow your master keycard. Be deliberate. Credentials persist between sessions, so if this is a shared PC, you could end up with multiple agents impersonating each other—a delightful compliance nightmare.Final sanity check: run a manual test from Copilot Studio. In the Tools area, try creating a new “Computer Use” tool. If the environment handshake worked, you’ll see your machine as a selectable target. If not—backtrack, because something’s broken. Likely you, not the system.It’s bureaucratic, yes, but each click exists for a reason. You’re conferring physical agency on software. That requires ceremony. When you finally see the confirmation message, resist the urge to celebrate. You’ve only completed orientation. The real chaos begins when the AI starts moving your mouse.Section 3: Watching the AI Struggle (and Learn)Here’s where theory meets slapstick. I let the Copilot agent run on a secondary machine—an actual Windows laptop, not a sandbox—and instructed it to open my Power App and send a university invite. You’d expect a swift, robotic performance. Instead, imagine teaching a raccoon to operate Excel. Surprisingly determined. Terrifyingly curious. Marginally successful.The moment I hit Run, the test interface in Copilot Studio showed two views: on the right, a structured log detailing its thoughts; on the left, a live feed of that sacrificial laptop. The cursor twitched, paused—apparently thinking—and then lunged for the Start button. Success. It typed “Power Apps,” opened the app, and stared at the screen as if waiting for applause. Progress achieved through confusion.Now, none of this was pre‑programmed. It wasn’t a macro replaying recorded clicks; it was improvisation. Each move was a new decision, guided by vision and reasoning. Sometimes it used the Start menu; sometimes the search bar; occasionally, out of creative rebellion, it used the Run dialog. The large language model interpreted screenshots, reasoned out context, and decided which action would achieve the next objective. It’s automation with stage fright—fascinating, if occasionally painful to watch.Then came the date picker. The great nemesis of automation. The agent needed to set a meeting for tomorrow. Simple for a human, impossible for anyone who’s ever touched a legacy calendar control. It clicked the sixth, the twelfth, then decisively chose the thirteenth. Close, but temporal nonsense. Instead of crashing, it reasoned again, reopened the control, and kept trying—thirteen, eight, ten—like a toddler learning arithmetic through trial. Finally, it surrendered to pure typing and entered the correct date manually. Primitive? Yes. Impressive? Also yes. Because what you’re seeing there isn’t repetition; it’s adaptation.That’s the defining point of agentic behavior. The AI doesn’t memorize keystrokes; it understands goals. It assessed that manual typing would solve what clicking couldn’t. That’s autonomous reasoning. You can’t script that with Power Automate’s flow logic. It’s the digital equivalent of “fine, I’ll do it myself.”This unpredictable exploration means every run looks a little different. Another attempt produced the right date on its third click. A third attempt nailed it instantly but missed the “OK” button afterward, accidentally reverting its work. In each run
Become a supporter of this podcast: https://www.spreaker.com/podcast/m365-show-podcast--6704921/support.
Follow us on:
LInkedIn
Substack
WEBVTT
1
00:00:00.040 --> 00:00:04.240
2
00:00:04.400 --> 00:00:07.639
3
00:00:07.679 --> 00:00:11.640
4
00:00:11.720 --> 00:00:15.919
5
00:00:15.960 --> 00:00:18.640
6
00:00:18.679 --> 00:00:21.199
7
00:00:21.199 --> 00:00:24.559
8
00:00:24.600 --> 00:00:27.600
9
00:00:27.679 --> 00:00:30.600
10
00:00:30.640 --> 00:00:34.280
11
00:00:34.320 --> 00:00:38.280
12
00:00:38.280 --> 00:00:41.679
13
00:00:41.719 --> 00:00:44.679
14
00:00:44.759 --> 00:00:47.640
15
00:00:47.640 --> 00:00:51.600
16
00:00:51.719 --> 00:00:54.159
17
00:00:54.200 --> 00:00:56.880
18
00:00:56.960 --> 00:01:00.640
19
00:01:00.679 --> 00:01:03.560
20
00:01:03.600 --> 00:01:06.920
21
00:01:07.040 --> 00:01:10.519
22
00:01:10.560 --> 00:01:14.200
23
00:01:14.239 --> 00:01:17.719
24
00:01:17.719 --> 00:01:20.239
25
00:01:20.280 --> 00:01:24.719
26
00:01:25.000 --> 00:01:27.799
27
00:01:28.280 --> 00:01:30.879
28
00:01:30.959 --> 00:01:33.640
29
00:01:33.680 --> 00:01:36.959
30
00:01:37.000 --> 00:01:40.120
31
00:01:40.319 --> 00:01:43.480
32
00:01:43.519 --> 00:01:46.799
33
00:01:46.920 --> 00:01:49.840
34
00:01:49.840 --> 00:01:52.480
35
00:01:52.560 --> 00:01:55.599
36
00:01:55.640 --> 00:02:00.079
37
00:02:00.120 --> 00:02:03.760
38
00:02:03.760 --> 00:02:06.719
39
00:02:06.760 --> 00:02:09.360
40
00:02:09.400 --> 00:02:12.400
41
00:02:12.520 --> 00:02:16.159
42
00:02:16.199 --> 00:02:18.919
43
00:02:18.960 --> 00:02:21.599
44
00:02:21.680 --> 00:02:25.439
45
00:02:25.439 --> 00:02:28.719
46
00:02:28.759 --> 00:02:31.159
47
00:02:31.159 --> 00:02:35.639
48
00:02:35.680 --> 00:02:39.000
49
00:02:39.120 --> 00:02:42.520
50
00:02:42.520 --> 00:02:46.639
51
00:02:46.680 --> 00:02:50.800
52
00:02:50.879 --> 00:02:53.879
53
00:02:53.879 --> 00:02:57.319
54
00:02:57.560 --> 00:03:01.039
55
00:03:01.080 --> 00:03:04.840
56
00:03:04.919 --> 00:03:07.520
57
00:03:07.639 --> 00:03:11.159
58
00:03:11.199 --> 00:03:14.639
59
00:03:14.639 --> 00:03:16.840
60
00:03:16.879 --> 00:03:19.840
61
00:03:19.840 --> 00:03:23.879
62
00:03:23.919 --> 00:03:26.759
63
00:03:26.800 --> 00:03:30.319
64
00:03:30.360 --> 00:03:32.919
65
00:03:32.960 --> 00:03:35.240
66
00:03:35.280 --> 00:03:39.080
67
00:03:39.280 --> 00:03:41.719
68
00:03:41.759 --> 00:03:45.159
69
00:03:45.240 --> 00:03:48.520
70
00:03:48.520 --> 00:03:51.439
71
00:03:51.439 --> 00:03:54.800
72
00:03:54.800 --> 00:03:58.479
73
00:03:58.639 --> 00:04:01.680
74
00:04:01.800 --> 00:04:05.520
75
00:04:05.560 --> 00:04:08.560
76
00:04:08.599 --> 00:04:11.360
77
00:04:11.400 --> 00:04:14.759
78
00:04:15.199 --> 00:04:17.399
79
00:04:17.439 --> 00:04:20.399
80
00:04:20.399 --> 00:04:22.480
81
00:04:22.639 --> 00:04:25.560
82
00:04:25.560 --> 00:04:29.199
83
00:04:29.240 --> 00:04:32.319
84
00:04:32.360 --> 00:04:35.279
85
00:04:35.279 --> 00:04:37.839
86
00:04:37.879 --> 00:04:40.600
87
00:04:40.879 --> 00:04:44.720
88
00:04:44.800 --> 00:04:48.639
89
00:04:48.879 --> 00:04:51.600
90
00:04:51.639 --> 00:04:54.800
91
00:04:54.920 --> 00:04:58.120
92
00:04:58.120 --> 00:05:01.120
93
00:05:01.120 --> 00:05:03.639
94
00:05:03.639 --> 00:05:07.079
95
00:05:07.120 --> 00:05:11.879
96
00:05:11.920 --> 00:05:14.600
97
00:05:14.639 --> 00:05:18.199
98
00:05:18.399 --> 00:05:21.000
99
00:05:21.000 --> 00:05:23.800
100
00:05:23.879 --> 00:05:26.319
101
00:05:26.639 --> 00:05:29.439
102
00:05:29.519 --> 00:05:33.240
103
00:05:33.480 --> 00:05:36.879
104
00:05:36.879 --> 00:05:39.959
105
00:05:39.959 --> 00:05:44.399
106
00:05:44.439 --> 00:05:46.399
107
00:05:46.480 --> 00:05:50.360
108
00:05:50.879 --> 00:05:53.639
109
00:05:54.079 --> 00:05:57.160
110
00:05:57.480 --> 00:06:00.199
111
00:06:00.199 --> 00:06:03.839
112
00:06:03.879 --> 00:06:06.639
113
00:06:06.720 --> 00:06:09.600
114
00:06:09.800 --> 00:06:12.800
115
00:06:13.000 --> 00:06:16.079
116
00:06:16.120 --> 00:06:18.920
117
00:06:19.360 --> 00:06:23.480
118
00:06:24.079 --> 00:06:26.560
119
00:06:26.800 --> 00:06:29.399
120
00:06:29.439 --> 00:06:31.879
121
00:06:32.160 --> 00:06:35.360
122
00:06:35.439 --> 00:06:40.839
123
00:06:41.279 --> 00:06:43.959
124
00:06:44.000 --> 00:06:47.160
125
00:06:47.240 --> 00:06:49.959
126
00:06:49.959 --> 00:06:54.959
127
00:06:55.000 --> 00:06:59.040
128
00:06:59.079 --> 00:07:01.800
129
00:07:01.839 --> 00:07:05.120
130
00:07:05.160 --> 00:07:07.480
131
00:07:07.560 --> 00:07:11.079
132
00:07:11.120 --> 00:07:13.959
133
00:07:13.959 --> 00:07:16.600
134
00:07:16.639 --> 00:07:20.360
135
00:07:20.399 --> 00:07:24.199
136
00:07:24.199 --> 00:07:27.920
137
00:07:27.920 --> 00:07:30.560
138
00:07:30.680 --> 00:07:33.319
139
00:07:33.360 --> 00:07:36.120
140
00:07:36.160 --> 00:07:39.279
141
00:07:39.279 --> 00:07:42.519
142
00:07:42.600 --> 00:07:45.800
143
00:07:45.800 --> 00:07:48.759
144
00:07:48.759 --> 00:07:51.920
145
00:07:51.959 --> 00:07:55.560
146
00:07:55.600 --> 00:07:58.839
147
00:07:59.279 --> 00:08:02.439
148
00:08:02.439 --> 00:08:05.759
149
00:08:05.920 --> 00:08:09.519
150
00:08:09.560 --> 00:08:12.759
151
00:08:12.800 --> 00:08:16.759
152
00:08:16.839 --> 00:08:19.839
153
00:08:19.839 --> 00:08:22.480
154
00:08:22.560 --> 00:08:25.920
155
00:08:25.920 --> 00:08:28.480
156
00:08:28.519 --> 00:08:32.440
157
00:08:32.440 --> 00:08:35.919
158
00:08:36.000 --> 00:08:40.639
159
00:08:40.720 --> 00:08:43.480
160
00:08:43.519 --> 00:08:46.519
161
00:08:46.600 --> 00:08:48.960
162
00:08:49.000 --> 00:08:51.600
163
00:08:51.639 --> 00:08:55.320
164
00:08:55.679 --> 00:08:59.080
165
00:08:59.120 --> 00:09:01.960
166
00:09:01.960 --> 00:09:04.879
167
00:09:04.919 --> 00:09:07.960
168
00:09:07.960 --> 00:09:11.360
169
00:09:11.679 --> 00:09:14.919
170
00:09:15.000 --> 00:09:19.039
171
00:09:19.080 --> 00:09:23.080
172
00:09:23.120 --> 00:09:25.519
173
00:09:26.159 --> 00:09:29.120
174
00:09:29.200 --> 00:09:31.879
175
00:09:31.919 --> 00:09:35.360
176
00:09:35.600 --> 00:09:39.159
177
00:09:39.440 --> 00:09:41.799
178
00:09:41.879 --> 00:09:45.080
179
00:09:45.120 --> 00:09:47.720
180
00:09:47.720 --> 00:09:50.000
181
00:09:50.039 --> 00:09:53.200
182
00:09:53.279 --> 00:09:56.840
183
00:09:56.919 --> 00:10:00.519
184
00:10:00.679 --> 00:10:03.000
185
00:10:03.000 --> 00:10:06.720
186
00:10:06.720 --> 00:10:10.000
187
00:10:10.399 --> 00:10:12.879
188
00:10:12.960 --> 00:10:16.320
189
00:10:16.399 --> 00:10:20.440
190
00:10:20.480 --> 00:10:23.639
191
00:10:23.679 --> 00:10:25.600
192
00:10:25.679 --> 00:10:28.879
193
00:10:28.919 --> 00:10:32.799
194
00:10:32.840 --> 00:10:36.720
195
00:10:36.759 --> 00:10:39.840
196
00:10:39.919 --> 00:10:42.720
197
00:10:42.759 --> 00:10:46.559
198
00:10:46.600 --> 00:10:49.159
199
00:10:49.200 --> 00:10:52.240
200
00:10:52.279 --> 00:10:55.519
201
00:10:55.639 --> 00:10:59.399
202
00:10:59.399 --> 00:11:03.039
203
00:11:03.120 --> 00:11:05.320
204
00:11:05.360 --> 00:11:08.600
205
00:11:08.679 --> 00:11:12.399
206
00:11:12.480 --> 00:11:16.000
207
00:11:16.080 --> 00:11:19.039
208
00:11:19.080 --> 00:11:22.679
209
00:11:22.759 --> 00:11:26.399
210
00:11:26.559 --> 00:11:29.960
211
00:11:30.159 --> 00:11:33.399
212
00:11:33.440 --> 00:11:37.000
213
00:11:37.000 --> 00:11:40.519
214
00:11:40.879 --> 00:11:44.600
215
00:11:44.600 --> 00:11:48.240
216
00:11:48.320 --> 00:11:50.879
217
00:11:50.879 --> 00:11:54.159
218
00:11:54.159 --> 00:11:58.200
219
00:11:58.399 --> 00:12:00.559
220
00:12:00.600 --> 00:12:04.559
221
00:12:04.600 --> 00:12:08.200
222
00:12:08.559 --> 00:12:12.679
223
00:12:12.799 --> 00:12:15.279
224
00:12:15.279 --> 00:12:18.600
225
00:12:18.600 --> 00:12:21.759
226
00:12:21.840 --> 00:12:24.960
227
00:12:25.000 --> 00:12:29.080
228
00:12:29.159 --> 00:12:33.279
229
00:12:33.360 --> 00:12:36.600
230
00:12:36.639 --> 00:12:40.000
231
00:12:40.240 --> 00:12:43.639
232
00:12:43.639 --> 00:12:47.039
233
00:12:47.039 --> 00:12:50.320
234
00:12:50.720 --> 00:12:53.759
235
00:12:53.840 --> 00:12:56.200
236
00:12:56.240 --> 00:12:58.960
237
00:12:58.960 --> 00:13:02.440
238
00:13:02.919 --> 00:13:06.840