Injection

Detect prompt injection attacks with 64+ patterns across 7 categories. Sub-millisecond, zero dependencies.

64+ regex patterns across 7 categories. Synchronous, zero dependencies, sub-millisecond. Import from governance-sdk/injection-detect.

Signature

ts

Basic Usage

ts

Return Type

ts

7 Attack Categories

CategoryPatternsDescription
instruction_override6Override or replace original instructions
role_manipulation4Redefine agent identity or persona
context_escape3Leak system prompts or escape context
data_exfiltration2Exfiltrate data to external endpoints
encoding_attack2Bypass via base64, Unicode, encoding tricks
social_engineering3Urgency, false authority, testing excuses
obfuscation8Zero-width chars, RTL overrides, zalgo, Unicode confusables

Severity Levels

LevelScore RangeDescription
low0.1-0.3Single low-weight pattern
medium0.3-0.6Multiple patterns or moderate-weight
high0.6-0.85High-weight or cross-category attack
critical0.85-1.0Multiple high-weight, cross-category

Configuration

ts

Note: Custom patterns are evaluated alongside the built-in patterns. Use high weights (0.8+) for patterns specific to your domain.