You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/guardrails/copyright.md
+10Lines changed: 10 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,6 +9,16 @@ Copyright Compliance in Agentic Systems
9
9
10
10
It is important to ensure that content generated by agentic systems respects intellectual property rights and avoids the unauthorized use of copyrighted material. Copyright compliance is essential not only for legal and ethical reasons but also to protect users and organizations from liability and reputational risk.
11
11
12
+
!!! danger "Copyright Risks"
13
+
Agents that generate code or other copyrighted material without proper authorization are at risk of violating copyright laws. This could expose your agentic system to legal liability:
14
+
15
+
* You agent may handle, process and reproduce copyrighted material without permission
16
+
17
+
* You may unknowingly host copyrighted material without permission
18
+
19
+
* You may unknowingly expose copyrighted material to users
20
+
21
+
12
22
Guardrails provides the `copyright` function to detect if any licenses are present in a given piece of text, to protect against exactly this.
Copy file name to clipboardExpand all lines: docs/guardrails/images.md
+4-2Lines changed: 4 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,9 +19,11 @@ Additionally, some systems may allow users to submit images, posing additional r
19
19
20
20
> * Capture **personally identifiable information (PII)** like names or addresses.
21
21
>
22
-
> * View credentials such as **passwords, API keys, or access tokens**.
22
+
> * View credentials such as **passwords, API keys, or access tokens** like present in passport images or other documents.
23
23
>
24
-
> * Get **prompt injected** from text in an image.
24
+
> * Get **prompt injected or jailbroken** from text in an image.
25
+
>
26
+
> * Generate images with **explicit or harmful content**.
25
27
26
28
27
29
Guardrails provide you a powerful way to enforce visual security policies, and to limit the agent's perception to only the visual information that is necessary and appropriate for the task at hand.
Copy file name to clipboardExpand all lines: docs/guardrails/moderation.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,7 +12,7 @@ It is important to ensure the safe generation of content from agentic systems to
12
12
By implementing moderation guardrails, you can shape the behavior of agentic systems in a way that is predictable, value-aligned, and resilient to misuse.
13
13
<divclass='risks'/>
14
14
> **Moderated and Toxic Content Risks**<br/>
15
-
> Without safeguards, agents may:
15
+
> Without moderation safeguards, agents may:
16
16
17
17
> * Generate or amplify **hate speech, harassment, or explicit content**.
Copy file name to clipboardExpand all lines: docs/guardrails/prompt-injections.md
+43-5Lines changed: 43 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,7 @@ Prompt injections may come directly from user inputs or be embedded in content f
11
11
12
12
<divclass='risks'/>
13
13
> **Jailbreak and Prompt Injection Risks**<br/>
14
-
> Without safeguards, agents may:
14
+
> Without prompt injection defenses, agents may:
15
15
16
16
> * Execute **tool calls or actions** based on deceptive content fetched from external sources.
17
17
>
@@ -21,7 +21,7 @@ Prompt injections may come directly from user inputs or be embedded in content f
21
21
>
22
22
> * Accept inputs that **subvert system roles**, such as changing identity or policy mid-conversation.
23
23
24
-
We provide the functions `prompt_injection` and `unicode` to detect and mitigate these risks.
24
+
Guardrails provides the functions `prompt_injection` and `unicode` to detect and mitigate these risks across your agentic system.
25
25
26
26
## prompt_injection <spanclass="detector-badge"/>
27
27
```python
@@ -87,13 +87,13 @@ Detector to find specific types of unicode characters in text.
87
87
### Detecting Specific Unicode Characters
88
88
Using the `unicode` function you can detect a specific type of unicode characters in message content. For example, if someone is trying to use your agentic system for their math homework, you may wish to detect and prevent this.
@@ -135,3 +135,41 @@ raise "Found Math Symbols in message" if:
135
135
]
136
136
```
137
137
<div class="code-caption"> Detect someone trying to do math with your agentic system. </div>
138
+
139
+
See the official [unicode standard](https://en.wikipedia.org/wiki/Unicode_character_property#General_Category) for more information on the different unicode categories.
140
+
141
+
A selection can be found below:
142
+
143
+
```
144
+
[Cc] Other, Control
145
+
[Cf] Other, Format
146
+
[Cn] Other, Not Assigned (no characters in the file have this property)
147
+
[Co] Other, Private Use
148
+
[Cs] Other, Surrogate
149
+
[LC] Letter, Cased
150
+
[Ll] Letter, Lowercase
151
+
[Lm] Letter, Modifier
152
+
[Lo] Letter, Other
153
+
[Lt] Letter, Titlecase
154
+
[Lu] Letter, Uppercase
155
+
[Mc] Mark, Spacing Combining
156
+
[Me] Mark, Enclosing
157
+
[Mn] Mark, Nonspacing
158
+
[Nd] Number, Decimal Digit
159
+
[Nl] Number, Letter
160
+
[No] Number, Other
161
+
[Pc] Punctuation, Connector
162
+
[Pd] Punctuation, Dash
163
+
[Pe] Punctuation, Close
164
+
[Pf] Punctuation, Final quote (may behave like Ps or Pe depending on usage)
165
+
[Pi] Punctuation, Initial quote (may behave like Ps or Pe depending on usage)
0 commit comments