ODOCK.AI
Security & GuardrailsSecurity Engine

Security Engine

Understand SafetySec as Odock's modular prompt and response security engine.

Security Engine

The Odock security engine is called SafetySec in the runtime. It is a modular engine for prompt safety, response safety, redaction, leakage detection, and repeated-risk awareness.

SafetySec is separate from policy limits. Policies control traffic shape and resource boundaries. SafetySec inspects prompt and response content when text is available.

What SafetySec Does

CapabilityPurpose
Prompt-injection detectionDetect attempts to override instructions, reveal hidden context, or bypass policy.
Jailbreak-pattern detectionDetect jailbreak phrasing and roleplay-style bypass attempts.
Sensitive redactionRedact secrets and PII-like values in requests and responses.
Data-leakage detectionDetect sensitive material in model output or echoed from the request.
Repeated-risk awarenessTreat repeated suspicious behavior differently from isolated low-risk events.
Safety evidenceProduce evidence that helps users understand whether a request was allowed, redacted, observed, or blocked.

Engine Architecture

SafetySec is lifecycle-aware:

MomentTypical use
Before upstream workPrompt checks, sensitive input handling, request-side redaction.
After upstream workResponse checks, output redaction, leakage protection.
After response handlingNon-blocking audit, metrics, or analysis where configured.

The exact internal plan is deployment-managed. The public model is that SafetySec modules run at the points where their required context exists.

How It Differs From Guardrail Policies

Policy guardrailsSecurity engine
Configured through resource policies, access grants, budgets, and quotas.Configured through security modules in the Odock deployment.
Enforce network, traffic, payload, token, cost, and resource access boundaries.Inspects prompt and response text for safety and sensitive data.
Often blocks before upstream calls.Can redact before the upstream call or block/redact after response.
Works from request metadata and token envelope.Works from request/response content and session history.

For policy guardrails, see Guardrails.

User Impact

When SafetySec blocks, the caller receives a structured gateway error instead of an upstream response. When it redacts, the request or response continues with sensitive values replaced by redaction markers.

Because SafetySec can work before and after the provider call, it can protect both sides:

  • before upstream: prevent sensitive input and suspicious prompts from leaving through the provider request
  • after upstream: prevent sensitive output or leakage from reaching the caller

Continue with Security workflow.

On this page