Whether it's user input or model output, Emil classifies content across 14 safety categories and five risk axes, then blocks, flags, or redacts based on the policy you set.
Age-appropriate and audience-specific policy presets
Built for production
Per-tenant policies and thresholds you control
Fail-closed option so unsafe content is never let through on error
Audit trail of every decision, content never retained
Questions
What categories does Emil moderate?
Fourteen safety categories spanning violence, self-harm, hate, sexual content, illegal activity and more, scored across five risk axes — safety, privacy, security, compliance, and brand.
Can I set different rules per product?
Yes. Policies and score thresholds are per-tenant, with presets for general, brand-safe, family-safe, healthcare, finance, education, and more.
What happens if a classifier errors?
You choose the fail mode. Fail-closed blocks the request when a classifier is unavailable, so unsafe content is never let through on a best-effort basis.
Is content stored?
No. Text is classified in memory; only the decision, score, and a short redacted preview are kept for the audit trail.