Day 12 — Guardrails & Responsible AI

Day 12. Once an agent is in front of real users, "guardrails" stop being theory — every output has to be safe, on-policy, and free of PII leakage. Picking this up now so it's wired in before scale, not after.

Topics to cover:

1. Bedrock Guardrails — content filters, denied topics, PII redaction, contextual grounding checks
2. Prompt-level guardrails — system prompt patterns, refusal templates, jailbreak resistance
3. Output filtering — post-generation safety checks, regex / model-as-judge, escalation paths
4. PII handling — detection, masking, audit logs, GDPR / DPDP basics
5. Eval for safety — building a red-team test set, LLM-as-judge for harmful outputs

Plan: Chandana on Bedrock Guardrails + PII / KMS integration, me on prompt-level + output-level guardrails + safety eval, shipping a guardrailed version of one of our existing agents.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Day 12 — Guardrails & Responsible AI #15

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Day 12 — Guardrails & Responsible AI #15

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions