Copilot Is Broken Until You Do THIS
Out-of-the-box Microsoft Copilot sounds like a genius—but in real enterprises it’s a dangerously confident intern. In this episode, we expose where default Copilot quietly fails on the questions that actually matter: “Can I share this file?”, “Who’s on-call right now?”, “Is this HIPAA-safe?” You’ll see how generic, Graph-only Copilot ignores your DLP exceptions, regional SOPs, escalation paths, and legal memos—and why that’s how incidents are born.
Then we show you the fix: plug your own specialist engine agent straight into Microsoft 365 Copilot Chat with a simple manifest upgrade. You’ll learn the retrieval + actions + guardrails pattern, how to wire Azure AI Search, internal APIs, and tenant controls, and the exact schema 1.22 tweaks (copilotAgents + customEngineAgents) that flip Copilot from smooth-talking generalist to hard-edged policy enforcer.
The before vs. after is brutal: vague essays and hallucinated “best practices” turn into crisp, cited decisions, “Page now” buttons, and answers your security team will actually sign off on. Same UI, different brain—the intern just graduated.
Out-of-the-box Microsoft Copilot sounds confident—but in real organizations, it frequently gives generic, incomplete, or misleading answers about internal rules, DLP policies, regional SOPs, and compliance workflows. The problem isn’t the model. The problem is that Copilot doesn’t know your company’s rules, exceptions, or processes. In this episode, you’ll learn the exact fix: bring your own custom engine agent—your own specialist—into Microsoft 365 Copilot Chat using a simple manifest upgrade. We break down why default Copilot fails, what custom agents can do that Copilot can’t, the architecture behind retrieval + actions + guardrails, and the two-minute manifest tweak that unlocks Copilot Chat. If you want to eliminate hallucinations, increase policy accuracy, and make Copilot a real enterprise asset instead of a polite intern, this is your playbook. What You’ll Learn in This Episode 1. The Real Reason Copilot Feels “Broken” in Enterprises Despite the hype, default Copilot cannot:
- Interpret your company’s DLP exceptions
- Apply region-specific SOPs
- Follow internal escalation rules
- Know your compliance restrictions
- Understand your security classifications
- Execute your internal decision trees
Because Copilot is grounded in public knowledge + Microsoft Graph, it becomes a generalist—great at broad help, terrible at local nuance. We explore real examples:
- “Can I share this customer spreadsheet externally?” → Generic answer, missing your DLP exception list
- “Who handles a Sev-2 outage in EMEA after 6 p.m.?” → Generic ITIL nonsense
- “Can we send HIPAA updates via Outlook campaigns?” → A polite hallucination that ignores legal rules
These answers sound authoritative—but they’re dangerously incomplete. You’ll learn why users trust these confident responses, how incidents happen, and why “Copilot hallucination” is often just “missing internal policy context.” 2. Why Your Organization Needs a Specialist, Not a Generalist A custom engine agent fixes the gap by giving Copilot: ✔ Your rules ✔ Your policies ✔ Your SOPs ✔ Your exceptions ✔ Your approvals ✔ Your internal APIs ✔ Your decision logic ✔ Your citations A specialist agent is not a plugin and not a fancy prompt. It’s a governed, orchestrated agent with:
- Your retrieval index (Azure AI Search)
- Your actions (internal APIs, policy lookups, exception verification)
- Your guardrails (tenant controls + data scopes)
- Your reasoning (Semantic Kernel / LangChain orchestration)
Copilot becomes the user interface.
Your agent becomes the brain. 3. Where Default Copilot Fails (With Real Examples) We break down three high-risk categories: A. Data Loss Prevention (DLP) Questions Copilot knows Microsoft’s DLP theory but not your:
- Project-code exceptions
- Allowed domains
- Threshold rules
- Special carve-outs
- Vendor sharing restrictions
Without a specialist agent, it answers confidently—and wrong. B. Regional + Role-Specific SOPs Users ask: “It’s 19:10 CET. Sev-2 in EMEA. Who do I page?” Default Copilot:
- Quotes ITIL
- Suggests calling “the on-call team”
- Misses the actual after-hours vendor
- Misses the 20-minute SLA
- Misses the escalation chain
Your agent can answer with:
- The correct vendor
- The correct channel
- The SLA
- A “Page Now” action
- The exact SOP citation
C. Compliance & Legal Requirements Default Copilot can’t recall:
- HIPAA communication rules
- GDPR region-specific handling
- SOC2 audit requirements
- Legal memos
- Confidentiality exceptions
Your agent can fetch the real memo and produce a compliant answer with citations. 4. The Architecture Behind a Real Enterprise-Ready Agent This episode walks you step-by-step through the specialist architecture: Retrieval (your knowledge)
- Azure AI Search
- Hybrid search: vector + keyword
- Chunking optimized for policy documents
- Entity extraction for project codes, regions, severities, etc.
Orchestration (your reasoning)
- Semantic Kernel planners
- LangChain tools + chains
- Typed outputs instead of prose
- Deterministic response patterns
Actions (your operational truth) Examples:
- ValidateProjectCode
- CheckOnCallSchedule
- LookupDlpException
- VerifyComplianceChannel
These actions collapse uncertainty into simple, accurate decisions. Guardrails (your safety mechanisms)
- Tenant controls
- Data-scope boundaries
- RAI filters
- Logging, observability, redaction
This is what makes security teams approve the rollout. 5. The Two-Minute Manifest Tweak That Changes Everything This is the core of the episode—the part most people miss. To make your specialist appear inside Copilot Chat, you must: 1. Upgrade the manifest schema to 1.22 Copilot Chat looks for capabilities that only exist in schema 1.22. 2. Add copilotAgents and customEngineAgents capabilities No tag → no Copilot
Tag present → Copilot routes queries to your agent 3. Add conversation starters (up to 12) These teach users what your agent knows:
- “Ask about DLP sharing exceptions”
- “Check EMEA after-hours escalation path”
- “Verify HIPAA-approved communication channels”
4. Define actions clearly These help Copilot’s planner understand:
- What your agent can do
- How to call your APIs
- What parameters exist
- What work it can automate
5. Package & deploy Once deployed, your agent shows up in Copilot Chat’s sidebar and can be invoked inline. This simple manifest tweak turns Copilot from a generalist into a specialist gateway. 6. Before vs. After: The Proof We walk through the dramatic differences: Before (Default Copilot)
- Generic answers
- Hallucinated best practices
- Wrong SOP routing
- Missing DLP exceptions
- No links to internal processes
- High-risk compliance answers
After (Your Specialist Engine Agent)
- Precise decisions
- API-verified logic
- Only your approved policies
- Region-specific, time-specific answers
- Action buttons
- Full citations with permalinks
- Reduced hallucinations
- Faster time-to-answer
Same UI.
Different brain. 7. Governance, Lifecycle, and Scaling to the Enterprise We cover the operational side:
- How to version your agent
- How to evaluate hallucinations weekly
- How to tie outputs to citations
- How to manage environment boundaries
Wh
Become a supporter of this podcast: https://www.spreaker.com/podcast/m365-show-podcast--6704921/support.
Follow us on:
LInkedIn
Substack
1
00:00:00,000 --> 00:00:02,440
You think Copilot is smart because it speaks confidently.
2
00:00:02,440 --> 00:00:04,520
The truth, it's a pleasant generalist that
3
00:00:04,520 --> 00:00:07,720
stalls the second your question touches your company's rules.
4
00:00:07,720 --> 00:00:11,000
You paid for a digital butler, you got an intern with Wi-Fi.
5
00:00:11,000 --> 00:00:12,120
Here's the fix.
6
00:00:12,120 --> 00:00:16,160
Bring your custom engine agent into Microsoft 365 Copilot chat.
7
00:00:16,160 --> 00:00:18,520
I'll show you the exact two-minute manifest tweak that
8
00:00:18,520 --> 00:00:21,600
unlocks it plus the guardrails to keep security teams calm.
9
00:00:21,600 --> 00:00:23,320
We'll test it against real policies,
10
00:00:23,320 --> 00:00:26,360
data loss prevention, regional SOPs and compliance,
11
00:00:26,360 --> 00:00:28,880
so you see speed, accuracy, and fewer hallucinations
12
00:00:28,880 --> 00:00:29,880
immediately.
13
00:00:29,880 --> 00:00:32,960
Let's upgrade the intern into a specialist.
14
00:00:32,960 --> 00:00:36,080
The problem, where default Copilot fails in the real world.
15
00:00:36,080 --> 00:00:36,800
Picture this.
16
00:00:36,800 --> 00:00:39,160
You ask Copilot, can I share a customer spreadsheet
17
00:00:39,160 --> 00:00:40,600
with an external vendor?
18
00:00:40,600 --> 00:00:43,040
It gives a polished paragraph on best practices,
19
00:00:43,040 --> 00:00:45,800
sprinkles in generic Microsoft 365 guidance,
20
00:00:45,800 --> 00:00:48,120
and spoiler alert says nothing about your tenant's
21
00:00:48,120 --> 00:00:49,800
data loss prevention policy.
22
00:00:49,800 --> 00:00:51,400
The average user notes shares the file
23
00:00:51,400 --> 00:00:52,600
and triggers a policy alert.
24
00:00:52,600 --> 00:00:54,840
Yes, they require restarts, pause it,
25
00:00:54,840 --> 00:00:56,960
because Microsoft is not performing magic tricks.
26
00:00:56,960 --> 00:01:00,120
Copilot cannot intuit your internal rules unless you inject them.
27
00:01:00,120 --> 00:01:01,760
Now compare that to what you actually need.
28
00:01:01,760 --> 00:01:04,120
Your security team defined a DLP policy
29
00:01:04,120 --> 00:01:07,240
with very specific conditions, customer PII over a threshold,
30
00:01:07,240 --> 00:01:09,520
financial identifiers, sharing destinations
31
00:01:09,520 --> 00:01:13,080
limited by region and exceptions for a named project code.
32
00:01:13,080 --> 00:01:15,320
Default Copilot doesn't know your exception list.
33
00:01:15,320 --> 00:01:17,320
It doesn't know your project Orion carve out.
34
00:01:17,320 --> 00:01:18,840
It can describe DLP in theory,
35
00:01:18,840 --> 00:01:20,520
it can't apply your DLP in practice.
36
00:01:20,520 --> 00:01:22,200
That gap is where incidents happen.
37
00:01:22,200 --> 00:01:24,880
Let's run a second scenario, regional SOPs.
38
00:01:24,880 --> 00:01:26,800
You ask, what's the approved escalation
39
00:01:26,800 --> 00:01:30,360
path for a servo2 outage in EMIRA after 6pm?
40
00:01:30,360 --> 00:01:33,640
Default Copilot happily cites generic ITIL principles,
41
00:01:33,640 --> 00:01:37,040
then recommends contacting the on-call team, fascinating,
42
00:01:37,040 --> 00:01:37,960
which team?
43
00:01:37,960 --> 00:01:41,320
In your org, EMIRA after hours roots to a regional vendor,
44
00:01:41,320 --> 00:01:44,320
not the core SRE rotation, there's a name distribution list,
45
00:01:44,320 --> 00:01:47,320
a paging policy, and a 20 minute acknowledgement SLA tied
46
00:01:47,320 --> 00:01:48,440
to a penalty clause.
47
00:01:48,440 --> 00:01:51,280
The generic answer is not only useless, it's dangerous.
48
00:01:51,280 --> 00:01:54,000
The clock is ticking and you're emailing the wrong queue.
49
00:01:54,000 --> 00:01:56,720
Third scenario, compliance.
50
00:01:56,720 --> 00:01:58,840
Are we allowed to email HIPAA related updates
51
00:01:58,840 --> 00:02:01,120
to patients using Outlook campaigns?
52
00:02:01,120 --> 00:02:03,840
Default Copilot will serenade you with reassuring words
53
00:02:03,840 --> 00:02:05,880
about encryption and secure handling.
54
00:02:05,880 --> 00:02:07,880
It will not recall your legal department's memo
55
00:02:07,880 --> 00:02:09,880
that mandates patient notices must go
56
00:02:09,880 --> 00:02:12,520
through the patient portal with multifactor re-auth,
57
00:02:12,520 --> 00:02:13,720
not bulk email.
58
00:02:13,720 --> 00:02:16,600
Again, general knowledge, masquerading, as authority,
59
00:02:16,600 --> 00:02:18,720
if you wanted a TED talk, you wouldn't be here.
60
00:02:18,720 --> 00:02:19,840
Why does this keep happening?
61
00:02:19,840 --> 00:02:22,240
Because out of the box, Copilot is optimized for breadth,
62
00:02:22,240 --> 00:02:23,040
not your depth.
63
00:02:23,040 --> 00:02:25,480
It's grounded by Microsoft Graph and Public Knowledge,
64
00:02:25,480 --> 00:02:27,400
which is the right default for most people,
65
00:02:27,400 --> 00:02:29,600
but the wrong default for enterprise precision.
66
00:02:29,600 --> 00:02:31,320
Think of Microsoft Graph as the skeleton,
67
00:02:31,320 --> 00:02:33,120
structure without your muscle memory.
68
00:02:33,120 --> 00:02:35,400
Your policies, SOPs and compliance interpretations
69
00:02:35,400 --> 00:02:36,600
are the tendons and nerves.
70
00:02:36,600 --> 00:02:39,120
Without them, the system moves, but clumsily.
71
00:02:39,120 --> 00:02:41,800
And the thing most people miss, hallucinations feel helpful.
72
00:02:41,800 --> 00:02:43,960
The language is coherent, the verbs are confident.
73
00:02:43,960 --> 00:02:45,080
So users trusted.
74
00:02:45,080 --> 00:02:47,920
The problem isn't that Copilot is malicious, it's oblivious.
75
00:02:47,920 --> 00:02:49,280
It doesn't know your tenant controls,
76
00:02:49,280 --> 00:02:51,040
your RRI filters, your data scopes,
77
00:02:51,040 --> 00:02:53,240
or the political landmines inside your company.
78
00:02:53,240 --> 00:02:54,880
It's not just missing a database,
79
00:02:54,880 --> 00:02:57,040
it's missing the spine of your governance.
80
00:02:57,040 --> 00:02:59,120
I watched a team waste an hour last week
81
00:02:59,120 --> 00:03:01,280
chasing a recommended approval path
82
00:03:01,280 --> 00:03:03,720
Copilot fabricated from adjacent truths.
83
00:03:03,720 --> 00:03:06,040
They had the right documents, buried in SharePoint,
84
00:03:06,040 --> 00:03:08,600
written in legalese, updated quarterly.
85
00:03:08,600 --> 00:03:10,560
Default Copilot couldn't follow the breadcrumbs
86
00:03:10,560 --> 00:03:12,440
because it didn't know which crumbs mattered.
87
00:03:12,440 --> 00:03:15,320
When this finally clicks, you stop expecting the generalist
88
00:03:15,320 --> 00:03:16,560
to behave like a specialist
89
00:03:16,560 --> 00:03:18,400
and start wiring in the specialist.
90
00:03:18,400 --> 00:03:19,920
Here's the brutal audit.
91
00:03:19,920 --> 00:03:22,520
Anywhere your question depends on local nuance.
92
00:03:22,520 --> 00:03:27,160
Exception codes, region-specific SOPs, conditional approvals,
93
00:03:27,160 --> 00:03:30,080
trade-restricted SKUs, retention schedules,
94
00:03:30,080 --> 00:03:32,080
or compensation rules.
95
00:03:32,080 --> 00:03:35,360
Default Copilot will either generalize or hallucinate.
96
00:03:35,360 --> 00:03:38,600
Both waste time, one creates risk.
97
00:03:38,600 --> 00:03:41,080
The truth, until you bring your own engine,
98
00:03:41,080 --> 00:03:43,680
your retrieval, your actions, your policies,
99
00:03:43,680 --> 00:03:45,800
Copilot will skate on the surface.
100
00:03:45,800 --> 00:03:48,160
You need an agent that can pass your policy text,
101
00:03:48,160 --> 00:03:50,400
apply your decision tree, respect your tenant boundaries,
102
00:03:50,400 --> 00:03:52,560
and answer with citations from your corpus.
103
00:03:52,560 --> 00:03:54,280
Otherwise, you're paying for premium tires
104
00:03:54,280 --> 00:03:55,640
and driving on ice.
105
00:03:55,640 --> 00:03:57,760
The fix is not a motivational speech.
106
00:03:57,760 --> 00:04:00,360
It's a two-minute manifest change that invites your specialist
107
00:04:00,360 --> 00:04:01,680
into Copilot chat.
108
00:04:01,680 --> 00:04:04,320
And yes, we're doing that next on the better method,
109
00:04:04,320 --> 00:04:06,400
custom engine agents as specialists.
110
00:04:06,400 --> 00:04:08,640
Enter the specialist, a custom engine agent.
111
00:04:08,640 --> 00:04:10,800
Not a cute plug-in, not a glorified prompt,
112
00:04:10,800 --> 00:04:13,760
a programmable colleague that speaks your organization's dialect
113
00:04:13,760 --> 00:04:15,240
because you taught it the language.
114
00:04:15,240 --> 00:04:18,240
The truth, this is how you stop Copilot from freelancing
115
00:04:18,240 --> 00:04:20,160
and started practicing policy.
116
00:04:20,160 --> 00:04:21,120
What is it precisely?
117
00:04:21,120 --> 00:04:22,800
A custom engine agent is an agent you
118
00:04:22,800 --> 00:04:25,640
build with real orchestration, semantic kernel or lang chain,
119
00:04:25,640 --> 00:04:27,440
wired into your data and your rules.
120
00:04:27,440 --> 00:04:30,480
It plans, calls, tools, keeps state,
121
00:04:30,480 --> 00:04:32,760
and returns answers grounded in your corpus.
122
00:04:32,760 --> 00:04:35,400
Think of default Copilot as the operating room.
123
00:04:35,400 --> 00:04:38,400
Your custom agent is the surgeon you credential to work there.
124
00:04:38,400 --> 00:04:39,760
No credentials, no scalpel.
125
00:04:39,760 --> 00:04:40,840
Why this works is simple.
126
00:04:40,840 --> 00:04:42,800
The thing most people miss is that authority
127
00:04:42,800 --> 00:04:45,960
comes from three ingredients, retrieval, actions, and guardrails.
128
00:04:45,960 --> 00:04:48,960
Retrieval gives the agent memory beyond vibes.
129
00:04:48,960 --> 00:04:50,640
Actions let it do real work.
130
00:04:50,640 --> 00:04:54,080
Hit an API, query a system, generate a report.
131
00:04:54,080 --> 00:04:56,880
Guardrails keep it inside the lines, tenant controls,
132
00:04:56,880 --> 00:04:59,320
responsible AI filters, and data scopes,
133
00:04:59,320 --> 00:05:01,480
so it never wanders into places it shouldn't.
134
00:05:01,480 --> 00:05:03,680
Let me show you exactly how the specialist is assembled.
135
00:05:03,680 --> 00:05:04,760
First, retrieval.
136
00:05:04,760 --> 00:05:07,280
You connect Azure AI search to your governed content,
137
00:05:07,280 --> 00:05:10,800
SOPs, policy pages, legal memos, DLP rule descriptions,
138
00:05:10,800 --> 00:05:11,640
and change logs.
139
00:05:11,640 --> 00:05:13,960
Use hybrid search, vector plus keyword,
140
00:05:13,960 --> 00:05:16,000
because your users ask messy questions
141
00:05:16,000 --> 00:05:18,160
and your documents use precise terms.
142
00:05:18,160 --> 00:05:21,400
The agent decomposes the query, pulls targeted chunks,
143
00:05:21,400 --> 00:05:23,200
and returns answers with citations.
144
00:05:23,200 --> 00:05:24,760
If you remember nothing else, citations
145
00:05:24,760 --> 00:05:26,920
are the antidote to hallucinations.
146
00:05:26,920 --> 00:05:29,960
Second, reasoning and orchestration.
147
00:05:29,960 --> 00:05:34,680
With semantic kernel, you define plug-ins, skills that encapsulate tasks.
148
00:05:34,680 --> 00:05:37,720
Get DLP exception list, resolve regional escalation,
149
00:05:37,720 --> 00:05:39,320
check retention schedule.
150
00:05:39,320 --> 00:05:41,720
The planner can chain these based on the question.
151
00:05:41,720 --> 00:05:44,160
Lang chain does the same with tools and chains
152
00:05:44,160 --> 00:05:46,440
if you prefer more open architecture.
153
00:05:46,440 --> 00:05:50,040
Either way, you get a repeatable flow instead of improv theater.
154
00:05:50,040 --> 00:05:51,600
Third, actions.
155
00:05:51,600 --> 00:05:53,600
Retrieval is nice, action is value.
156
00:05:53,600 --> 00:05:57,160
Your agent can call internal APIs to validate an exception code,
157
00:05:57,160 --> 00:05:59,200
check who's on call in a mere tonight,
158
00:05:59,200 --> 00:06:02,400
or verify that a patient notification channel is approved.
159
00:06:02,400 --> 00:06:06,960
The shortcut nobody teaches, tiny boring actions drive massive accuracy.
160
00:06:06,960 --> 00:06:09,240
A single validate project code tool
161
00:06:09,240 --> 00:06:13,080
can collapse paragraphs of uncertainty into a one-line answer.
162
00:06:13,080 --> 00:06:15,000
Now guardrails, before we continue,
163
00:06:15,000 --> 00:06:19,200
you need to understand this, or you'll get blocked by security in 20 minutes.
164
00:06:19,200 --> 00:06:21,320
Tenant controls determine where the agent can run
165
00:06:21,320 --> 00:06:22,880
and what data it can touch.
166
00:06:22,880 --> 00:06:25,400
Data scopes, restrict indices, and repositories
167
00:06:25,400 --> 00:06:27,960
by department, region, and sensitivity.
168
00:06:27,960 --> 00:06:30,760
Our eye filters handle content safety and prompt hygiene.
169
00:06:30,760 --> 00:06:33,040
Compare that to letting a new hire roam your file shares
170
00:06:33,040 --> 00:06:36,160
because they ask nicely, no, you scope them, you monitor them,
171
00:06:36,160 --> 00:06:37,240
you log them.
172
00:06:37,240 --> 00:06:37,960
Same here.
173
00:06:37,960 --> 00:06:40,720
Observability and access boundaries are not optional.
174
00:06:40,720 --> 00:06:43,000
Once you nail that, everything else clicks.
175
00:06:43,000 --> 00:06:44,640
The agent doesn't guess at DLP.
176
00:06:44,640 --> 00:06:48,080
It reads your DLP artifact, checks the exception list, and returns.
177
00:06:48,080 --> 00:06:50,280
No, external sharing is blocked for PII
178
00:06:50,280 --> 00:06:52,520
unless Project Orion Exemption is active.
179
00:06:52,520 --> 00:06:53,800
Open a request here.
180
00:06:53,800 --> 00:06:57,920
For regional SOPs, it uses time, region, and severity as parameters,
181
00:06:57,920 --> 00:07:02,840
then cites the exact page that names the after-hours vendor and the 20-minute SLA.
182
00:07:02,840 --> 00:07:06,200
For HIPPA notices, it calls the policy endpoint and replies,
183
00:07:06,200 --> 00:07:10,360
use the patient portal, MFA-inforced, bulk email is prohibited,
184
00:07:10,360 --> 00:07:13,200
with the legal memo link, Confidence with Receipts.
185
00:07:13,200 --> 00:07:15,480
The game changer nobody talks about is ownership.
186
00:07:15,480 --> 00:07:17,960
With a custom engine agent, you control the model choice,
187
00:07:17,960 --> 00:07:20,680
the retrieval settings, the action contracts, and the logging.
188
00:07:20,680 --> 00:07:24,360
You can swap GPT variants, tune chunking, re-rank results,
189
00:07:24,360 --> 00:07:25,520
and instrument every step.
190
00:07:25,520 --> 00:07:28,360
That's how you get speed, accuracy, and reduced hallucinations
191
00:07:28,360 --> 00:07:30,000
without praying to the demo gods.
192
00:07:30,000 --> 00:07:32,520
And yes, you still want co-pilot's UX and discovery.
193
00:07:32,520 --> 00:07:33,520
That's where we're headed.
194
00:07:33,520 --> 00:07:36,840
The two-minute manifest tweak that gives your specialist a seat in co-pilot chat,
195
00:07:36,840 --> 00:07:41,000
so users ask one assistant and the right brain answers, you build a surgeon,
196
00:07:41,000 --> 00:07:43,520
then you badge it into the operating room.
197
00:07:43,520 --> 00:07:46,160
The two-minute manifest tweak that unlocks co-pilot chat.
198
00:07:46,160 --> 00:07:48,080
Okay, surgeon built, now we badge them in.
199
00:07:48,080 --> 00:07:50,080
This is the two-minute tweak everyone skips,
200
00:07:50,080 --> 00:07:52,840
then wonders why their agent only shows up in teams.
201
00:07:52,840 --> 00:07:56,440
It's not magic, it's the manifest, the apps passport, update the passport,
202
00:07:56,440 --> 00:07:58,800
the border agent waves you through, refuse,
203
00:07:58,800 --> 00:08:01,160
and you stay in the lobby with the average users.
204
00:08:01,160 --> 00:08:02,720
Step one, bump the schema.
205
00:08:02,720 --> 00:08:05,280
Open your app manifest JSON, find the schema reference,
206
00:08:05,280 --> 00:08:09,200
and update it to version 1.22, not 120, not latest.
207
00:08:09,200 --> 00:08:13,600
1.22, the truth, co-pilot chat looks for capabilities that only exist in that schema.
208
00:08:13,600 --> 00:08:17,040
If the schema's older, your agent can be perfect and still get ignored,
209
00:08:17,040 --> 00:08:19,040
you called it the cloud.
210
00:08:19,040 --> 00:08:21,880
Incorrect, it's a distributed rules engine,
211
00:08:21,880 --> 00:08:24,640
and the schema is how you speak those rules.
212
00:08:24,640 --> 00:08:27,880
Step two, tell co-pilot you are, in fact, a co-pilot agent.
213
00:08:27,880 --> 00:08:31,360
In the capabilities section for your bot at the co-pilot agent tags,
214
00:08:31,360 --> 00:08:34,360
you'll see a capabilities array include co-pilot agents,
215
00:08:34,360 --> 00:08:38,440
and if you're bringing your own orchestration, declare custom engine agents.
216
00:08:38,440 --> 00:08:39,680
This is the identity badge.
217
00:08:39,680 --> 00:08:43,600
Without it, co-pilot assumes you're just here for team's small talk and adaptive cards.
218
00:08:43,600 --> 00:08:46,920
With it, co-pilot routes relevant chats to your specialist.
219
00:08:46,920 --> 00:08:49,400
Inside the same experience, users already live in.
220
00:08:49,400 --> 00:08:51,160
Step three, conversation starters.
221
00:08:51,160 --> 00:08:54,400
Still in 1.22, you can define up to 12 localized prompts.
222
00:08:54,400 --> 00:08:55,600
Use them.
223
00:08:55,600 --> 00:08:57,840
Ask about DLP sharing exceptions.
224
00:08:57,840 --> 00:09:00,120
Check Emia after our escalation.
225
00:09:00,120 --> 00:09:02,440
Verify HIPAA notification channel.
226
00:09:02,440 --> 00:09:03,760
These aren't decorative.
227
00:09:03,760 --> 00:09:07,840
They are discovery beacons that teach users what the agent is good at.
228
00:09:07,840 --> 00:09:10,920
Fewer aimless prompts, fewer hallucinations higher accuracy.
229
00:09:10,920 --> 00:09:14,160
Yes, this is how you train humans without making them read a manual.
230
00:09:14,160 --> 00:09:16,640
Step four, actions.
231
00:09:16,640 --> 00:09:21,240
The updated schema lets you enumerate richer actions your agent can invoke.
232
00:09:21,240 --> 00:09:24,080
Your internal APIs, your policy lookup endpoints,
233
00:09:24,080 --> 00:09:26,960
declare them cleanly with parameters and descriptions.
234
00:09:26,960 --> 00:09:28,640
Co-pilot uses this to plan.
235
00:09:28,640 --> 00:09:30,280
If you leave it blank, it guesses.
236
00:09:30,280 --> 00:09:32,480
When software guesses, people file tickets.
237
00:09:32,480 --> 00:09:33,680
Now, packaging.
238
00:09:33,680 --> 00:09:37,600
Zip the app with the updated manifest and deploy it the way you normally do.
239
00:09:37,600 --> 00:09:39,920
Teams admin center or your app catalog.
240
00:09:39,920 --> 00:09:41,760
Here's the test most people miss.
241
00:09:41,760 --> 00:09:45,240
In co-pilot chat, start typing a scenario your agent owns.
242
00:09:45,240 --> 00:09:49,840
You should see your agent offered as a participant or invoked in line based on triggers.
243
00:09:49,840 --> 00:09:54,480
If you don't, you either forgot the tags, bot ID mismatch, or your tenant controls are blocking
244
00:09:54,480 --> 00:09:55,480
the channel.
245
00:09:55,480 --> 00:09:57,480
Yes, tenant controls exist.
246
00:09:57,480 --> 00:09:59,080
No you can't wish them away.
247
00:09:59,080 --> 00:10:00,400
Verification is simple.
248
00:10:00,400 --> 00:10:02,280
Ask the three scenarios we said earlier.
249
00:10:02,280 --> 00:10:06,240
You want answers with citations from your corpus, not generic platitudes.
250
00:10:06,240 --> 00:10:07,480
Watch for latency.
251
00:10:07,480 --> 00:10:12,680
Schema 1.22 plus the right capabilities cuts routing friction so responses land faster.
252
00:10:12,680 --> 00:10:17,720
If it still feels sluggish, that's your retrieval settings, not the manifest.
253
00:10:17,720 --> 00:10:19,360
Common mistakes to avoid.
254
00:10:19,360 --> 00:10:23,840
Leaving the app scope to teams only, forgetting localization, omitting security attestations
255
00:10:23,840 --> 00:10:26,080
and deploying to the wrong environment.
256
00:10:26,080 --> 00:10:28,640
Also don't add every capability because you can.
257
00:10:28,640 --> 00:10:29,880
Precision beats maximalism.
258
00:10:29,880 --> 00:10:32,760
Declare what you actually support and log everything.
259
00:10:32,760 --> 00:10:36,960
Once you nail this, your specialist sits inside co-pilot chat like it always belonged
260
00:10:36,960 --> 00:10:37,960
there.
261
00:10:37,960 --> 00:10:39,480
Same UI, different brain.
262
00:10:39,480 --> 00:10:44,440
And yes, we're about to wrap guard rails around it so your security team doesn't faint.
263
00:10:44,440 --> 00:10:45,440
Implementation.
264
00:10:45,440 --> 00:10:48,200
Build, deploy, verify, end to end.
265
00:10:48,200 --> 00:10:49,200
Let's do the end to end.
266
00:10:49,200 --> 00:10:50,200
No hand waving.
267
00:10:50,200 --> 00:10:55,080
You'll build the specialist with real retrieval and actions, package it with the 1.22 manifest,
268
00:10:55,080 --> 00:10:59,360
deploy to your tenant and verify against DLP, regional SOPs and compliance.
269
00:10:59,360 --> 00:11:02,120
You'll see speed, accuracy, and fewer hallucinations.
270
00:11:02,120 --> 00:11:04,280
Not because we prayed, but because we engineered.
271
00:11:04,280 --> 00:11:05,280
Build.
272
00:11:05,280 --> 00:11:06,280
Part 1.
273
00:11:06,280 --> 00:11:07,280
Build.
274
00:11:07,280 --> 00:11:10,360
Create an Azure AI search index for your governed content.
275
00:11:10,360 --> 00:11:16,240
DLP policy docs, exception procedures, regional SOPs, legal memos, and change logs, use hybrid
276
00:11:16,240 --> 00:11:19,200
search, vector plus keyword, and enable semantic ranking.
277
00:11:19,200 --> 00:11:23,520
Chunk documents by headings with overlaps of context survives boundaries, store citations
278
00:11:23,520 --> 00:11:28,420
with persistent IDs, configure enrichment to extract key entities like project codes,
279
00:11:28,420 --> 00:11:33,480
regions, severities and distribution lists because spoiler alert your users ask in those terms.
280
00:11:33,480 --> 00:11:37,440
Index updates, automate them, a stale index is how yesterday's policy becomes today's
281
00:11:37,440 --> 00:11:38,440
incident.
282
00:11:38,440 --> 00:11:39,440
Build.
283
00:11:39,440 --> 00:11:40,440
Part 2.
284
00:11:40,440 --> 00:11:41,840
Orchestration that doesn't improvise.
285
00:11:41,840 --> 00:11:46,480
In semantic kernel, define plugins, get DLP exceptions, project code, resolve escalation,
286
00:11:46,480 --> 00:11:50,320
region severity, local time, check notice channel policy area.
287
00:11:50,320 --> 00:11:56,040
Give each plugin narrow testable behavior and return structured outputs, objects, not essays.
288
00:11:56,040 --> 00:12:00,240
Wrap as your AI search as a retriever with agente query planning turned on.
289
00:12:00,240 --> 00:12:05,480
Decompose complex prompts into subquaries, fan out, gather top-k, re-rank, synthesize
290
00:12:05,480 --> 00:12:08,040
with citations, set a conservative temperature.
291
00:12:08,040 --> 00:12:09,560
You want answers, not poetry.
292
00:12:09,560 --> 00:12:14,120
If you prefer lang chain tools and a graph-based agent do the same job, just apply the same discipline,
293
00:12:14,120 --> 00:12:17,000
typed outputs, defensive defaults, exhaustive logging.
294
00:12:17,000 --> 00:12:18,000
Build.
295
00:12:18,000 --> 00:12:19,000
Part 3.
296
00:12:19,000 --> 00:12:21,080
Actions that collapse uncertainty.
297
00:12:21,080 --> 00:12:22,720
Connect to your internal APIs.
298
00:12:22,720 --> 00:12:27,800
The DLP exception registry, the on-call schedule service and the compliance policy endpoint,
299
00:12:27,800 --> 00:12:31,960
protect them with managed identities and scope permissions to the agent service principle.
300
00:12:31,960 --> 00:12:35,680
Add a tiny boring tool, validate project code, project code.
301
00:12:35,680 --> 00:12:37,720
It returns true or false and a reason.
302
00:12:37,720 --> 00:12:41,560
That single boolean cuts paragraphs of hedging into a definitive sentence.
303
00:12:41,560 --> 00:12:45,600
The thing most people miss is that one or two high signal calls beat five pages of speculative
304
00:12:45,600 --> 00:12:50,280
reasoning, guardrails because you like your job, enable tenant controls, so the app is
305
00:12:50,280 --> 00:12:53,080
available only in approved environments.
306
00:12:53,080 --> 00:12:55,000
Restrict data scopes.
307
00:12:55,000 --> 00:12:59,240
The agent's retriever touches only the indices it needs, no wandering through HR files
308
00:12:59,240 --> 00:13:01,440
because someone typed benefits.
309
00:13:01,440 --> 00:13:05,720
Turn on Rye filters appropriate for your content, profanity, sensitive terms and jailbreak
310
00:13:05,720 --> 00:13:06,720
defense.
311
00:13:06,720 --> 00:13:10,960
Instrument everything, trace IDs across retrieval, tools and synthesis, log prompts and responses
312
00:13:10,960 --> 00:13:14,240
with reduction, emit metrics for latency and hit rates.
313
00:13:14,240 --> 00:13:18,720
If security asks for an audit trail and you shrug, you will lose this program in the meeting.
314
00:13:18,720 --> 00:13:23,400
Now package, update the manifest schema to 1.22 at copilot agents and if you're bringing
315
00:13:23,400 --> 00:13:27,200
your own orchestration, custom engine agents and capabilities.
316
00:13:27,200 --> 00:13:33,200
Define conversation status, 12 max localized, declare actions with parameters and descriptions.
317
00:13:33,200 --> 00:13:35,640
Include security attestations if your org requires them.
318
00:13:35,640 --> 00:13:37,560
Map your bot ID correctly.
319
00:13:37,560 --> 00:13:40,920
Production is not your devgoid, zip the app with icon privacy and terms.
320
00:13:40,920 --> 00:13:41,920
This is not busy work.
321
00:13:41,920 --> 00:13:46,160
This is the difference between visible in copilot and why can't I find it.
322
00:13:46,160 --> 00:13:49,160
Deploy via team's admin center or your app catalog.
323
00:13:49,160 --> 00:13:53,960
Go up to a pilot group first, operations and compliance champions who will actually use it.
324
00:13:53,960 --> 00:13:56,640
Verify tenant-wide settings for copilot extensibility.
325
00:13:56,640 --> 00:14:00,480
If your admin disabled external plugins and agent channels, your specialist will sit outside
326
00:14:00,480 --> 00:14:03,440
like a locked out surgeon, install for pilot users.
327
00:14:03,440 --> 00:14:07,840
In copilot chat, your agent should appear as an available participant and as inline invocation
328
00:14:07,840 --> 00:14:08,920
on relevant prompts.
329
00:14:08,920 --> 00:14:13,760
If it doesn't, check capabilities, app permission policies and bot messaging endpoints.
330
00:14:13,760 --> 00:14:16,640
Copilot doesn't guess your intent, it reads your manifest.
331
00:14:16,640 --> 00:14:18,040
Time for the three proof tests.
332
00:14:18,040 --> 00:14:23,000
One, DLP, ask, can I share this customer spreadsheet with our external vendor?
333
00:14:23,000 --> 00:14:25,080
Provide a fake path and a project code.
334
00:14:25,080 --> 00:14:26,280
Expect, no.
335
00:14:26,280 --> 00:14:30,480
External sharing of PII is blocked unless project Orion exemption is active.
336
00:14:30,480 --> 00:14:32,000
Status, inactive.
337
00:14:32,000 --> 00:14:33,200
Request exemption here.
338
00:14:33,200 --> 00:14:36,960
With a link and citations to the DLP policy and exception runbook.
339
00:14:36,960 --> 00:14:38,240
That is accuracy plus action.
340
00:14:38,240 --> 00:14:42,160
If it hedges, your validate project code tool is missing or your retriever couldn't find
341
00:14:42,160 --> 00:14:43,160
the exception policy.
342
00:14:43,160 --> 00:14:44,960
Two, regional SO piece.
343
00:14:44,960 --> 00:14:46,720
It's 1910 CET, save two in MIR.
344
00:14:46,720 --> 00:14:47,960
Who do I page?
345
00:14:47,960 --> 00:14:51,800
To the vendor's on call name, the escalation channel and the 20 minute SLA with citations
346
00:14:51,800 --> 00:14:54,680
to the SOP page and an action button to page now.
347
00:14:54,680 --> 00:14:59,000
Latency should be crisp, agentic retrieval with hybrid search and planar beats meandering
348
00:14:59,000 --> 00:15:00,080
chat.
349
00:15:00,080 --> 00:15:03,520
If it suggests contact the on call team, congratulations.
350
00:15:03,520 --> 00:15:05,480
You routed back to generic copilot.
351
00:15:05,480 --> 00:15:08,400
Check your capability tags and channel availability.
352
00:15:08,400 --> 00:15:10,040
Three, compliance.
353
00:15:10,040 --> 00:15:12,480
Can we email HIPAA updates with Outlook campaigns?
354
00:15:12,480 --> 00:15:13,480
Expect, no.
355
00:15:13,480 --> 00:15:14,800
Use patient portal with MFA?
356
00:15:14,800 --> 00:15:18,400
Bulk email prohibited with the legal memo citation and the service link.
357
00:15:18,400 --> 00:15:22,740
If you get a milk toast lecture on encryption, your index is wrong or your policy doc isn't
358
00:15:22,740 --> 00:15:24,240
authoritative in ranking.
359
00:15:24,240 --> 00:15:25,240
Tune re-ranking.
360
00:15:25,240 --> 00:15:27,880
Raise weight on compliance source of truth.
361
00:15:27,880 --> 00:15:28,880
Measure outcomes.
362
00:15:28,880 --> 00:15:32,680
Speed, compare default copilot's time to answer to the specialists.
363
00:15:32,680 --> 00:15:36,800
You should see fewer back and forths and lower end to end latency because the planar pulls
364
00:15:36,800 --> 00:15:38,200
exactly what's needed.
365
00:15:38,200 --> 00:15:41,360
Accuracy, sample answers for citations and correctness.
366
00:15:41,360 --> 00:15:42,760
Hallucinations.
367
00:15:42,760 --> 00:15:43,760
Track answers.
368
00:15:43,760 --> 00:15:46,680
Ranking citations and reduce them over time.
369
00:15:46,680 --> 00:15:50,240
Raise retrieval K, improve chunking at a disambiguation prompt.
370
00:15:50,240 --> 00:15:51,240
Itterate weekly.
371
00:15:51,240 --> 00:15:52,920
This is software not a statue.
372
00:15:52,920 --> 00:15:53,920
Final checklist.
373
00:15:53,920 --> 00:15:55,560
Schema 1.22 present.
374
00:15:55,560 --> 00:15:57,760
Copilot, agents and custom.
375
00:15:57,760 --> 00:15:59,400
Engine agents declared.
376
00:15:59,400 --> 00:16:01,000
Conversation status localized.
377
00:16:01,000 --> 00:16:02,480
Actions defined with parameters.
378
00:16:02,480 --> 00:16:03,960
Tenant controls configured.
379
00:16:03,960 --> 00:16:05,160
Data scopes enforced.
380
00:16:05,160 --> 00:16:06,560
RI filters active.
381
00:16:06,560 --> 00:16:07,560
Locks flowing.
382
00:16:07,560 --> 00:16:08,800
And three scenario tests.
383
00:16:08,800 --> 00:16:10,720
Passing with citations and links.
384
00:16:10,720 --> 00:16:12,720
Test that's green rollout beyond the pilot.
385
00:16:12,720 --> 00:16:14,480
Same UI, different brain.
386
00:16:14,480 --> 00:16:16,480
The intern just graduated.
387
00:16:16,480 --> 00:16:17,480
Proof?
388
00:16:17,480 --> 00:16:18,480
DLP policy.
389
00:16:18,480 --> 00:16:19,480
Regional SOPs.
390
00:16:19,480 --> 00:16:20,480
Compliance.
391
00:16:20,480 --> 00:16:21,480
Before versus after.
392
00:16:21,480 --> 00:16:22,840
Let's do the side by side.
393
00:16:22,840 --> 00:16:23,840
Same questions.
394
00:16:23,840 --> 00:16:25,160
Same users.
395
00:16:25,160 --> 00:16:26,920
Different brain.
396
00:16:26,920 --> 00:16:27,920
Case 1.
397
00:16:27,920 --> 00:16:29,240
DLP policy.
398
00:16:29,240 --> 00:16:30,800
Before default copilot.
399
00:16:30,800 --> 00:16:34,280
You ask, can I share a customer spreadsheet with an external vendor?
400
00:16:34,280 --> 00:16:38,680
It produces a smooth essay about safe sharing, labels and general DLP concepts.
401
00:16:38,680 --> 00:16:42,440
No mention of your exception list, no project Orion, no link to your runbook.
402
00:16:42,440 --> 00:16:47,120
The user nods, shares anyway, triggers an alert, and now security is babysitting cleanup.
403
00:16:47,120 --> 00:16:50,320
After the specialist, same prompt plus a project code.
404
00:16:50,320 --> 00:16:51,320
Response.
405
00:16:51,320 --> 00:16:55,520
No PII external sharing is blocked unless exemption project Orion is active.
406
00:16:55,520 --> 00:16:58,120
Current status inactive, submit exemption at requests.
407
00:16:58,120 --> 00:16:59,120
DLP exemption.
408
00:16:59,120 --> 00:17:00,120
Approval.
409
00:17:00,120 --> 00:17:01,120
Data protection office.
410
00:17:01,120 --> 00:17:02,120
C. DLP policy.
411
00:17:02,120 --> 00:17:03,640
Favore 3.2 and exception.
412
00:17:03,640 --> 00:17:04,640
Runbook.
413
00:17:04,640 --> 00:17:05,640
Favore 0.1.
414
00:17:05,640 --> 00:17:06,640
Two citations.
415
00:17:06,640 --> 00:17:07,640
One action.
416
00:17:07,640 --> 00:17:08,640
Zero poetry.
417
00:17:08,640 --> 00:17:10,680
Agent doesn't negotiate with ambiguity.
418
00:17:10,680 --> 00:17:15,000
It calls validate project code, checks the exception registry and answers with receipts.
419
00:17:15,000 --> 00:17:19,840
Accuracy improves because retrieval narrows to your indexed DLP artifacts and hallucinations
420
00:17:19,840 --> 00:17:23,120
plummet because the answer template requires citations to ship.
421
00:17:23,120 --> 00:17:24,120
Case 2.
422
00:17:24,120 --> 00:17:25,760
Regional SOPs after hours.
423
00:17:25,760 --> 00:17:26,760
Before default copilot.
424
00:17:26,760 --> 00:17:28,960
It's 1910 CET-72 in Emia.
425
00:17:28,960 --> 00:17:29,960
Who do I page?
426
00:17:29,960 --> 00:17:33,160
It responds with ITIL platitudes and contact the on-call team.
427
00:17:33,160 --> 00:17:34,160
Which one?
428
00:17:34,160 --> 00:17:35,160
How?
429
00:17:35,160 --> 00:17:36,160
Silence.
430
00:17:36,160 --> 00:17:37,160
Minutes pass.
431
00:17:37,160 --> 00:17:38,160
After.
432
00:17:38,160 --> 00:17:39,160
The specialist.
433
00:17:39,160 --> 00:17:42,920
It's 1910 CET-72 in Emia.
434
00:17:42,920 --> 00:17:43,920
Response.
435
00:17:43,920 --> 00:17:48,560
Page Emia vendor ops via Emia vendor SV2R-D.
436
00:17:48,560 --> 00:17:49,560
SLA.
437
00:17:49,560 --> 00:17:51,320
Acknowledge in 20 minutes.
438
00:17:51,320 --> 00:17:53,280
Escalate to duty manager if no arc.
439
00:17:53,280 --> 00:17:56,400
Use page now to send incident template.
440
00:17:56,400 --> 00:17:57,400
Citations.
441
00:17:57,400 --> 00:18:00,560
Emia SOP R2.3 and after hours playbook are for 1.1.
442
00:18:00,560 --> 00:18:01,560
The model didn't guess.
443
00:18:01,560 --> 00:18:02,560
It read your SOP.
444
00:18:02,560 --> 00:18:07,440
Use time and region as parameters and surface the exact channel plus the SLA.
445
00:18:07,440 --> 00:18:09,040
Speed 1 message to action.
446
00:18:09,040 --> 00:18:10,040
Accuracy.
447
00:18:10,040 --> 00:18:12,680
The name distribution list and escalation rule.
448
00:18:12,680 --> 00:18:13,840
Reduced hallucinations.
449
00:18:13,840 --> 00:18:17,040
No invented teams because the tool verified on call data.
450
00:18:17,040 --> 00:18:18,040
Case 3.
451
00:18:18,040 --> 00:18:19,040
HIPAA communications.
452
00:18:19,040 --> 00:18:20,640
Before default copilot.
453
00:18:20,640 --> 00:18:23,840
Can we email HIPAA updates using Outlook campaigns?
454
00:18:23,840 --> 00:18:25,240
It drifts into encryption.
455
00:18:25,240 --> 00:18:26,240
Data handling.
456
00:18:26,240 --> 00:18:28,080
Maybe suggest secure email.
457
00:18:28,080 --> 00:18:32,280
The average user interprets that as sure if careful as legal screams later.
458
00:18:32,280 --> 00:18:33,960
After the specialist.
459
00:18:33,960 --> 00:18:34,960
No.
460
00:18:34,960 --> 00:18:37,280
Patient notices require the patient portal with MFA.
461
00:18:37,280 --> 00:18:39,120
Bulk email is prohibited.
462
00:18:39,120 --> 00:18:42,760
Citations legal memo HIPAA comes our 5 and patient notice policy R2.
463
00:18:42,760 --> 00:18:43,760
Link.
464
00:18:43,760 --> 00:18:45,200
Open portal notice workflow.
465
00:18:45,200 --> 00:18:49,640
The agent calls the policy endpoint retrieves the governing memo and returns a crisp decision
466
00:18:49,640 --> 00:18:51,200
with a compliant path.
467
00:18:51,200 --> 00:18:52,200
Notice the pattern.
468
00:18:52,200 --> 00:18:53,920
Before is plausible narration.
469
00:18:53,920 --> 00:18:54,920
After is constrained.
470
00:18:54,920 --> 00:18:55,920
Decisioning.
471
00:18:55,920 --> 00:18:56,920
With sources and actions.
472
00:18:56,920 --> 00:19:00,920
That's why speed accuracy and fewer hallucinations show up together.
473
00:19:00,920 --> 00:19:04,560
It comes from a genetic retrieval plus tiny high signal tools.
474
00:19:04,560 --> 00:19:08,120
Accuracy comes from scoping to your indices and verifying with APIs.
475
00:19:08,120 --> 00:19:11,960
hallucinations drop because every final sentence has to stand on a citation.
476
00:19:11,960 --> 00:19:13,880
Let's quantify without the hand waving.
477
00:19:13,880 --> 00:19:14,880
Latency.
478
00:19:14,880 --> 00:19:18,680
Default copilot often needs 2 or 3 clarifying turns to stumble towards specificity.
479
00:19:18,680 --> 00:19:22,860
The specialist lands in 1 because the planner decomposes the prompt policy exception
480
00:19:22,860 --> 00:19:25,360
routing and fetches exactly those chunks.
481
00:19:25,360 --> 00:19:26,360
Token waste shrinks.
482
00:19:26,360 --> 00:19:28,400
The clock stops sooner.
483
00:19:28,400 --> 00:19:29,400
Detection rate.
484
00:19:29,400 --> 00:19:31,960
With default users correct the assistant or each other.
485
00:19:31,960 --> 00:19:35,400
With the specialist corrections become rare because the automation points to the single
486
00:19:35,400 --> 00:19:37,120
source of truth.
487
00:19:37,120 --> 00:19:38,520
Incident deflection.
488
00:19:38,520 --> 00:19:41,920
Support tickets about who do I page or is this allowed e?
489
00:19:41,920 --> 00:19:45,760
Decrease because the action buttons take people directly into the sanctioned workflow.
490
00:19:45,760 --> 00:19:47,840
Failure modes because reality exists.
491
00:19:47,840 --> 00:19:50,040
If after looks like before generic answers.
492
00:19:50,040 --> 00:19:52,560
You miss scoped data or forgot capability tags.
493
00:19:52,560 --> 00:19:54,640
So copilot never invoked your agent.
494
00:19:54,640 --> 00:19:58,200
If citations point to stale docs your index refresh is broken.
495
00:19:58,200 --> 00:20:01,600
If latency spikes your retriever is fishing too broadly.
496
00:20:01,600 --> 00:20:04,120
Tune hybrid search weights and re-ranker.
497
00:20:04,120 --> 00:20:07,560
And if a user still finds a hallucination, raise the citation threshold.
498
00:20:07,560 --> 00:20:09,280
No citation, no claim.
499
00:20:09,280 --> 00:20:10,280
Same UI.
500
00:20:10,280 --> 00:20:11,960
Different brain, that's the proof.
501
00:20:11,960 --> 00:20:14,480
Operations, life cycle, governance and scale.
502
00:20:14,480 --> 00:20:15,640
You build one specialist.
503
00:20:15,640 --> 00:20:17,760
Now act like an adult and run the program.
504
00:20:17,760 --> 00:20:19,160
Life cycle first.
505
00:20:19,160 --> 00:20:20,560
Version your agent like software.
506
00:20:20,560 --> 00:20:22,640
Vnext in dev, promotion to test.
507
00:20:22,640 --> 00:20:24,760
Then production via release tags.
508
00:20:24,760 --> 00:20:27,160
Componentize topics in action so you can reuse them.
509
00:20:27,160 --> 00:20:29,040
No copy-based policies.
510
00:20:29,040 --> 00:20:30,440
Automate evaluation.
511
00:20:30,440 --> 00:20:33,800
Regression prompts for DLP, SOP and compliance.
512
00:20:33,800 --> 00:20:36,560
Fail the build on citation drift or rising latency.
513
00:20:36,560 --> 00:20:39,160
Governance, scope identities with least privilege.
514
00:20:39,160 --> 00:20:42,160
Tie indices to data classifications and regions.
515
00:20:42,160 --> 00:20:44,600
EMEA agent doesn't see APAC records.
516
00:20:44,600 --> 00:20:47,760
Turn on audit, trace IDs, action logs retrieval sources.
517
00:20:47,760 --> 00:20:52,080
Security once at a stations, document models, actions, data flows and RAI filters.
518
00:20:52,080 --> 00:20:54,360
Publish risk notes for every new capability.
519
00:20:54,360 --> 00:20:56,440
Scale, treat adoption like product.
520
00:20:56,440 --> 00:20:58,720
Lead conversation status by persona.
521
00:20:58,720 --> 00:21:01,760
Monitor question coverage and add actions where user stall.
522
00:21:01,760 --> 00:21:07,520
Cost control, cap tool invocations, cash retrieval, right size models.
523
00:21:07,520 --> 00:21:12,440
And yes, set tenant controls by environment so experiments don't hit payroll.
524
00:21:12,440 --> 00:21:13,440
Key takeaway.
525
00:21:13,440 --> 00:21:18,280
Copilot becomes reliable only when your specialist agent brings retrieval, actions and
526
00:21:18,280 --> 00:21:20,320
guardrails into copilot chat.
527
00:21:20,320 --> 00:21:25,800
If this saved you time, repay the debt, subscribe, tap follow, enable notifications and
528
00:21:25,800 --> 00:21:29,240
watch the next deep dive on automated agent evaluation.
529
00:21:29,240 --> 00:21:31,120
Scheduled like a well-behaved con job.