Prompt Gaurd/Moderation

_Description_: Users often would like to enforce some prompt guard or moderation policies to protect applications which uses the LLMs. The feature would fit for some enterprise users who build internal LLM/GenAI gateway by Envoy AI Gateway.

> Describe the desired behavior, what scenario it enables and how it
> would be used.

<img width="1125" height="517" alt="Image" src="https://github.com/user-attachments/assets/cb9a5911-b4e8-4de3-9fc5-ab86bc65b86b" />

[optional *Relevant Links*:]
- Concept:
  - [Navigating the LLM Landscape: Uber’s Innovation with GenAI Gateway](https://www.uber.com/blog/genai-gateway/)
- Gateway Implementations
  - [kgateway - Prompt Guards](kgateway.dev/docs/latest/ai/prompt-guards)
- Other Guard/Moderation Implementations
  - LiteLLM -> https://docs.litellm.ai/docs/proxy/guardrails/quick_start
  - Bifrost -> https://docs.getbifrost.ai/enterprise/guardrails
- Guard/Moderation API
  - [Open AI - Moderation](https://platform.openai.com/docs/guides/moderation)
  - [Anthoropic - Content moderation](https://docs.claude.com/en/docs/about-claude/use-case-guides/content-moderation)
  - [Gemini for safety filtering and content moderation](https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/gemini-for-filtering-and-moderation)

> Any extra documentation required to understand the issue.

From the API level perspective, I just imagined that the feature might be integrated into `AIGatewayRoute` or would introduce new API(CRD) to configure the guard/moderation policy because such policies are complex.

Perhaps, `BackendTrafficPolicy` or `SecurityPolicy` could be a candidate for integration target. But, these APIs are not Envoy AI Gateway's one. So, I think we should design more _generic_ apis so that it can call external apis in this case.

Note: although Envoy AI Gateway does not support this feature natively, users could achieve this by implementing [External Processing](https://gateway.envoyproxy.io/v1.4/tasks/extensibility/ext-proc/) or [External Authrization](gateway.envoyproxy.io/docs/tasks/security/ext-auth)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prompt Gaurd/Moderation #1409

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Prompt Gaurd/Moderation #1409

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions