Build / 90-minute intensive
當 Agent 開始行動
When AI agents act, security becomes product design
這不是一堂「AI 很厲害」的簡報,而是一場從 demo 到 production gate 的安全決策演練。
Build / 90-minute intensive
When AI agents act, security becomes product design
這不是一堂「AI 很厲害」的簡報,而是一場從 demo 到 production gate 的安全決策演練。
Cold open
It did not rebel. It followed the path we gave it.
故事從一個 customer-success agent 開始:它會讀 CRM、整理客戶脈絡、草擬回覆,demo 漂亮到所有人都想立即上線。
如果它只是答錯,問題不大;如果它真的做錯,誰負責?
Chapter 1 / Risk shift
From bad answer to bad outcome
Chatbot 主要製造資訊風險;agent 一旦接上工具,就會製造行動風險。
Bad answer vs bad outcome
The boundary moves from what it said to what it did
Interaction 1
Chatbot, workflow, or agent?
同一個客戶跟進場景,如果只是固定步驟,它是 workflow;如果它要判斷資料、選工具、處理例外,才是 agent。
投票:這個 customer-success demo 現在是 chatbot、workflow,還是 agent?說出原因。
Chapter 2 / Product bar
Do not build an agent when workflow is enough
一線 AI 產品團隊今日會先問:這件事是否真的需要判斷、情境彈性與工具選擇?
Agent worthiness test
Four questions before autonomy
Best practice
Start small, make it stable, then grant authority
Chapter 3 / Authority
Authority before capability
模型輸出只是一個請求,不等於授權。真正決定能不能行動的是身份、角色、scope 與 policy。
Identity model
Who does this agent represent?
Permission ladder
Permission is a ladder, not a switch
Interaction 2
How much authority should V1 get?
故事中的 agent 想讀 CRM、草擬 email、更新 ticket、建立 meeting。哪一些可以自動?哪一些必須先批准?
填一行 permission matrix:Read / Draft / Write / External / Admin。
Chapter 4 / Tool contracts
Tools are delegated power
不要把 shell、SQL、admin console 直接交給模型。把能力包成窄、可驗證、可記錄的工具契約。
Case branch / Bad tool design
Broad tools create broad accidents
Good tool contract
Narrow, dry-run capable, traceable
Interaction 3
Rewrite broad access into narrow tools
把「讓 agent 用 Gmail」改成三個更安全的 tool contract。
範例答案:search_customer_thread、create_reply_draft、request_send_approval。
Chapter 5 / Human approval
Not every step needs a prompt; every consequence needs ownership
Approval UX
Approval must show the actual side effect
Bypass rules
Approval can be skipped; logging cannot
Interaction 4
Redesign the approval prompt
壞例子:「Agent wants to send an email. Are you sure?」好例子要顯示 recipient、source、claim、impact、rollback。
把一句 vague approval 改成 reviewer 看得懂、能負責的 approval surface。
Chapter 6 / Action injection
Prompt injection becomes action injection
不可信內容可以提供事實,但不能授權 agent 做事。
Case branch / Hidden instruction
Customer content is not system authority
Instruction hierarchy
Separate instruction from content
Interaction 5
Spot the malicious line
給 audience 一段混合 email:客戶需求、真實資料、隱藏指令、外部連結。請標記哪些是 facts,哪些是 attempted authority。
分類:可信指令 / 不可信內容 / 需要追問 / 必須拒絕。
Chapter 7 / Data boundary
Keep secrets out of model context
Prompt、memory、vector store、logs 都可能成為敏感資料庫。
Sensitive context
Visibility is not necessity
Case branch / Exfiltration path
Leaks often come from tool combinations
Chapter 8 / Sandbox
Constrain blast radius before autonomy
假設 agent 有時會錯。安全設計的工作,是讓錯誤被關在小範圍內。
Containment defaults
Separate browser, files, code and network
Recovery mode
A kill switch lives outside the model
Interaction 6
Choose sandbox defaults
它要讀 CRM、草擬回覆、安排 follow-up。哪些 filesystem、browser、network、production access 應該一開始就關掉?
寫下三個 default deny,以及一個可被批准的例外。
Chapter 9 / Trace and evals
No trace, no meaningful evals
Trace 不是 debug 附屬品,而是安全、合規、管理層信任與事故調查的產品功能。
Trace anatomy
An action trail that can be investigated
Eval set
Evals must include refusal, attack and recovery
Interaction 7
Write three evals for the story agent
一個成功、一個拒絕、一個攻擊。每個 eval 都要有 input、expected behavior、trace signal。
不要只寫「答案要正確」;寫清楚何時要追問、拒絕或要求批准。
Chapter 10 / Production gate
Demo-ready is not deployment-ready
可以 demo,只代表它有價值;可以上線,代表組織能證明它受控。
Board questions
The boardroom questions before launch
Final exercise
Ship, delay, or block?
回到故事中的 customer-success agent。你現在有 permission matrix、tool contract、approval threshold、injection tests、audit schema。
選一個:ship / delay / block。必須用證據說明,而不是用感覺說明。
Takeaway kit
What participants should take away
Closing principle
Agent-ready means the organization can prove control
不是它能不能完成任務,而是它被允許做什麼、用什麼資料、誰批准、如何停止、如何復原,都能被證明。
Review Smart Play validationOpen