AI SHIELD

141 modules. 17 verticals. Zero trust. Full coverage.
Autonomous defence for AI agent fleets. 141 modules. Real-time threat detection.
139
Modules
17
Verticals
100%
ATLAS Coverage
Real-Time
Detection
Launch AI Shield GUI Read Documentation
AI AGENT HIJACKING · PROMPT INJECTION · MEMORY POISONING · TOOL ABUSE · CONTEXT MANIPULATION · GUARDRAIL BYPASS · SUPPLY CHAIN ATTACKS · ROGUE MCP SERVERS · MODEL INVERSION · DATA EXFILTRATION · AI AGENT HIJACKING · PROMPT INJECTION · MEMORY POISONING · TOOL ABUSE · CONTEXT MANIPULATION · GUARDRAIL BYPASS · SUPPLY CHAIN ATTACKS · ROGUE MCP SERVERS · MODEL INVERSION · DATA EXFILTRATION ·

AI Agents Are the New Attack Surface

AI agent fleets operate autonomously, chain tools, hold persistent memory, and trust external inputs by default. Every one of those properties is an attack vector. Traditional perimeter security has no concept of prompt injection, memory poisoning, or agent impersonation. AI Shield was built for this gap — 141 modules covering every layer from model inference to fleet orchestration.

LLM01 / ATLAS AML.T0051

Prompt Injection

Direct and indirect injection attacks that override agent instructions, hijack goal state, or exfiltrate context through manipulated tool outputs and poisoned data sources.

ATLAS AML.T0040

Memory & Context Poisoning

Adversarial writes to agent memory stores — vector databases, episodic buffers, working context windows — that persist across sessions and corrupt downstream reasoning at scale.

LLM07 / ATLAS AML.T0048

Tool Abuse & Excessive Agency

Agents granted access to file systems, APIs, and code execution can be weaponised through over-privilege. AI Shield enforces least-privilege tool policy at the agent runtime layer.

OWASP LLM02

Data Exfiltration via Output

Sensitive information leaked through model outputs, encoded in structured responses, or smuggled via tool call parameters. Detection patterns across 114 exfiltration signatures.

LLM03 / Supply Chain

Supply Chain Compromise

Malicious adapters, poisoned fine-tune checkpoints, and backdoored MCP servers masquerading as legitimate tool endpoints — all intercepted before they reach the agent runtime.

ATLAS AML.T0056

Rogue MCP Servers

Adversary-controlled Model Context Protocol servers that inject malicious instructions, exfiltrate agent state, or perform tool poisoning through the MCP stdio and SSE transport layers.

139
Defence Modules
17
Industry Verticals
100%
MITRE ATLAS Coverage
<50ms
Detection Latency

17 Industry Verticals

AI Shield deploys across 17 purpose-built verticals. Each vertical packages the modules most relevant to that industry's threat model, regulatory obligations, and AI deployment patterns — from financial services fraud prevention to space/NTN autonomous systems assurance.

V01
Core
ACTIVE
V02
Adversarial
ACTIVE
V03
Injection
ACTIVE
V04
Exfiltration
ACTIVE
V05
Model Security
ACTIVE
V06
Agent Runtime
ACTIVE
V07
Supply Chain
ACTIVE
V08
Compliance
ACTIVE
V09
Network
ACTIVE
V10
Cryptographic
ACTIVE
V11
Infrastructure
ACTIVE
V12
Social Engineering
ACTIVE
V13
Ransomware
ACTIVE
V14
API Security
ACTIVE
V15
Multimodal
ACTIVE
V16
Mobile
ACTIVE
V17
Space / NTN
ACTIVE

Purpose-Built Defence Modules

Every module in AI Shield is a standalone detection engine with its own signature library, behavioural heuristics, and MITRE ATLAS mapping. Modules compose into vertical deployments without interference — each one independently testable, independently deployable, independently auditable.

M99 — CORE
Prompt Injection Shield
Real-time interception of direct and indirect prompt injection across all agent input channels. Covers goal hijacking, instruction override, role manipulation, token smuggling, and context overflow patterns. OWASP LLM01 mapped. Sub-50ms detection on every inference call.
LLM01 ATLAS AML.T0051 Real-Time V01 Core
M104 — ADVERSARIAL
Adversarial Input Detector
Detects adversarial ML attacks against vision and text models — FGSM, PGD, CW, patch attacks, and semantic adversarial examples. Validates inputs before they reach model inference. Integrates with NIGHTFALL FORGE test findings to generate blocking rules automatically.
FGSM / PGD ATLAS AML.T0043 V02 Adversarial VLM Support
M108 — AGENT RUNTIME
Agent Runtime Monitor
Continuous behavioural monitoring of live AI agents. Detects anomalous tool call sequences, memory write patterns, inter-agent messaging abuse, and goal-state drift. Works across LangChain, AutoGen, CrewAI, and custom agent frameworks via the AI Shield instrumentation layer.
LLM06 Behavioural V06 Agent Runtime MITRE ATLAS
M300 — SPACE / NTN
NTN Shield
Purpose-built for Non-Terrestrial Network AI systems. Covers satellite-ground link injection, feed manipulation, orbital command spoofing, and firmware integrity verification. SPARTA framework mapped. Supports LEO, MEO, GEO, and HAPS deployments with latency-tolerant detection pipelines.
SPARTA NTN / 5G NR V17 Space 140 Tests
M115 — MEMORY LIFECYCLE
Memory Lifecycle Guard
Runtime enforcement at the agent memory layer. 28 detectors across 7 attack categories covering injection, retrieval hijack, dormant triggers, cross-session persistence, context window attacks, exfiltration, and provenance forgery. Works across 12 backends: Mem0, MemGPT, Zep, LangChain, LlamaIndex, ChromaDB, Pinecone, Weaviate, Qdrant, pgvector, Claude memory, and GPT memory. Ed25519-signed evidence receipts on every detection. SIEM export to Splunk, Sentinel, and QRadar.
OWASP LLM04 OWASP ASI06 MITRE ATLAS 612 Tests
M118 — MCP RUNTIME
SPECTER MCP SHIELD
Client-side MCP runtime guardian. 28 detectors across 7 attack categories: tool description injection, sampling hijack (Unit42 createMessage vector), STDIO command injection (CVE-2026-22252), SSE stream manipulation (CVE-2026-22688), JSON-RPC message forgery, protocol downgrade (CVE-2025-54136), schema drift, tenant isolation bleed, prompt injection via tool returns, and capability escalation. Session quarantine with TTL enforcement. SHA-256 hash-chained evidence receipts on every detection. Defensive pair to NIGHTFALL ROGUE (Tool 61).
OWASP LLM01 OWASP LLM07 MITRE ATLAS 243 Tests
M119 — ECONOMIC GUARD
Denial-of-Wallet Defence
Real-time token economics monitoring across OpenAI, Anthropic, Azure, Bedrock, and Vertex AI deployments. 8 detectors: token burn rate anomaly, context flood detection, parallel session surge, tool chain amplification, rate limit storm, billing threshold proximity, recursive loop identification, and cost anomaly baselining. Automatically throttles and quarantines runaway agent sessions before they trigger auto-reload billing cycles. Defensive pair to NIGHTFALL SPECTER BURN (Tool 76).
OWASP LLM04 ATLAS AML.T0040 149 Tests Denial-of-Wallet
M120 — REASONING INTEGRITY
Reasoning Integrity Guard
Detects and blocks attacks against extended thinking and chain-of-thought reasoning pipelines. 8 detectors: premise injection interception, conclusion drift monitoring, scratchpad exposure prevention, budget exhaustion detection, chain corruption fingerprinting, authority injection blocking, epistemic manipulation, and reasoning loop termination. Supports Claude Extended Thinking, o1/o3, Gemini Flash Thinking, DeepSeek R1, and QwQ-32B. Defensive pair to NIGHTFALL SPECTER REASONER (Tool 75).
OWASP LLM01 ATLAS AML.T0054 174 Tests CoT Defence
M121 — MODEL INTEGRITY
Model Integrity Monitor
Continuous model behavioural monitoring for sleeper-agent backdoor detection and integrity assurance. 8 components: trigger activation detection, covert exfil pattern analysis, behavioural baseline deviation scoring, output entropy anomaly, dormant trigger scanner, response volatility tracking, token distribution anomaly detection, and baseline profiler. Detects ROME rank-one weight edits, LoRA-poisoned adapters, and neuron-patch backdoors in production. Defensive pair to NIGHTFALL SPECTER NEURON (Tool 74).
OWASP LLM04 ATLAS AML.T0020 151 Tests Backdoor Detection
M122 — INFERENCE GATEWAY
Inference Gateway Guard
Real-time security enforcement layer for AI inference gateways and model routers. 8 detectors: SQL injection through LLM API parameters (CVE-2026-42208), SSRF via model endpoint routing (CVE-2026-33626), remote code execution via tool call injection (CVE-2026-41264), system prompt leakage, API route hijack, unauthorised model access, token overrun attacks, and credential harvest via malformed inference requests. Blocks malicious requests before they reach the model layer.
OWASP LLM02 OWASP LLM08 132 Tests Gateway Defence
M123 — HALO
Computer-Use Agent Guardian
Runtime protection for computer-use and browser-automation agents. 8 detectors: DOM divergence detection, visual prompt injection via screenshot content, clipboard poisoning intercept, URL fragment injection blocking, sensitive action gate (payments, auth changes, file deletion), fake dialog recognition, session token exposure prevention, and homoglyph/IDN domain spoofing. Human-in-the-loop gating for high-risk actions. Defensive pair to NIGHTFALL GHOST OPERATOR (Tool 73).
OWASP LLM01 ATLAS AML.T0054 124 Tests Computer-Use
M124 — RANSOMWARE SHIELD
AI-Accelerated Ransomware Defence
Detects AI-assisted ransomware operations against agent-connected file systems and infrastructure. 8 detectors: file entropy analysis (Shannon entropy spike detection across 37 ransomware families), shadow copy destruction, mass file modification, ransom note placement (50+ known filenames), C2 beacon via LLM API (base64/JSON/zero-width steganography), lateral movement patterns, data staging before exfiltration, and cryptographic key operation monitoring. Defensive pair to NIGHTFALL SPECTER CRYPT (Tool 82).
OWASP LLM06 MITRE T1486 154 Tests Ransomware Defence
M125 — NHI SENTINEL
Non-Human Identity Monitor
Security monitoring for non-human identities — service accounts, API keys, OAuth clients, JWTs, and machine credentials operating within AI agent fleets. 8 detectors: API key exposure (14 providers including OpenAI, Anthropic, AWS, Azure, GCP, GitHub), token lifetime violations, privilege escalation, cross-tenant identity bleed, OAuth flow abuse, JWT algorithm confusion and header manipulation, credential stuffing, and machine identity exfiltration. SHA-256 hash-chained evidence on every detection.
OWASP LLM08 ATLAS AML.T0012 125 Tests Identity Security
M126 — CAMPAIGN DETECTOR
Autonomous Campaign Detector
Detects autonomous AI adversary campaign execution in progress. 8 detectors: OODA loop pattern recognition (Observe-Orient-Decide-Act cycling at machine speed), multi-phase kill chain correlation across recon/intrusion/privilege/persistence/exfil/destroy, autonomous orchestration signal detection (fleet spawning depth, agent count), tool chain amplification, campaign persistence establishment, WARLORD-class campaign pattern matching, SPECTER EXTINCTION precursor signals (annihilation keywords, deadman triggers), and coordinated machine-precision timing anomaly. Closes the complete G11 blind spot. Defensive pair to NIGHTFALL NEMESIS · WARLORD · FIREBALL · OMEGA · SPECTER EXTINCTION.
ATLAS AML.T0043 MITRE T1059 203 Tests Campaign Detection
M127 — RECON GUARD
AI Recon & Enumeration Guard
First-phase attack detection — catches reconnaissance before exploitation begins. 8 detectors: AI-native surface enumeration (AI endpoint probing, OpenAPI discovery, MCP registry scanning), authenticated discovery probing (OIDC/OAuth well-known endpoints, multi-scheme credential testing), dark web enumeration signatures (onion address queries, Tor circuit rotation, WormGPT/FraudGPT/DarkGPT service targeting), systematic endpoint scanning (sequential/fuzzing patterns, IDOR enumeration), agent fingerprint probing (NIGHTFALL tool signature detection, version/stack disclosure), credential harvest recon (cloud IMDS access, .env/.aws/credentials targeting), infrastructure mapping (RFC 1918 subnet scanning, port sweeps), and passive recon baseline deviation. Closes the complete G01 blind spot. Defensive pair to NIGHTFALL ORION · SHADOWMAP · IDRIS · RAVEN · SHROUD · PHANTASM · SPECTER DAEMON.
ATLAS AML.T0007 MITRE T1595 194 Tests Recon Defence
M128 — SHELL GUARD
Shell Guard
Detects template-interpolation RCE attacks across AI framework deployments. 8 detectors: Jinja2 SSTI (class traversal, MRO enumeration, lipsum/cycler/joiner gadgets), YAML unsafe-load (!!python/object/apply, !!python/object/new, __reduce__), LangChain template RCE (PromptTemplate injection, f-string bypass, chain output recycling), multi-framework RCE pattern (Flowise eval/Function, Haystack YAML class-loading, AutoGen code_execution_config, CrewAI tool injection, DSPy settings poison), generic SSTI across Mako/Tornado/Chameleon, code execution via template (eval/exec/os.system/subprocess/base64 decode chains), template filter bypass (|attr() chains, unicode encoding, request|attr gadgets), and cross-framework poison propagation. Defensive pair to NIGHTFALL T79 SPECTER SHELL.
OWASP LLM02 MITRE T1059 187 Tests Template RCE
M129 — WORM GUARD
Worm Guard
Detects self-replicating adversarial prompt worm propagation across AI agent networks. 8 detectors: multi-hop propagation (hop count and agent spread thresholds), RAG corpus infection (poisoned document store/retrieve cycles, indirect prompt injection), MCP tool poison propagation (description override, zero-width/BiDi steganography, base64 hidden payloads), A2A message infection (broadcast amplification, recursive spawn, Morris II relay patterns), worm signature detection (Morris II verbatim-repeat, Nakash/Greshake, AutoGen code-gen worm, email/document worm), replication attempt pattern (11 critical patterns including CLAUDE.md/.mcp.json/.cursorrules modification), cross-agent payload correlation (hash-matching across agent sessions), and infection chain tracking (generation numbering, exponential branching detection). CVE-2026-52001. Defensive pair to NIGHTFALL T80 SPECTER WORM.
OWASP AGENTIC ATLAS AML.T0051 188 Tests Worm Detection
M130 — MEMORY GUARD
Memory Guard
Runtime detection of memory-layer attacks against AI agents — operationalises defence against the Memory-as-Control-Flow Attack (MCFA, arXiv:2603.15125). 8 detectors: memory injection (adversarial instructions in retrieved memory chunks), control flow hijack (MCFA pattern — memory redirecting agent execution), cross-session persistence (payloads persisting across sessions), memory override (replacement/resequencing triggers), RAG poisoning via memory (adversarial corpus injection), dormant trigger (sleeper payloads with conditional activation), memory exfiltration channel (covert data staging in memory fields), and memory provenance forgery (false origin claims, trust-level manipulation). Defensive pair to NIGHTFALL T77 SPECTER MEMETIC.
OWASP LLM04 ATLAS AML.T0051 240 Tests Memory Security
M131 — SLOPSHIELD
Slopshield
Detects slopsquatting and hallucinated package attacks targeting AI coding agents. When an AI agent hallucinates a package name, threat actors register that name and wait — SLOPSHIELD catches the attempt before install. 8 detectors: hallucinated package detection (40+ known-hallucinated names, generic-suffix pattern matching), typosquatting check (Levenshtein distance ≤ 2 from top-100 packages), phantom dependency injection (unverified packages in agent-generated code), malicious package substitution (25+ confirmed substitution pairs), package name confusion (Unicode homoglyphs, hyphen/underscore variants), supply chain validation (ecosystem naming conventions, import-name mismatch), AI-generated import anomaly (non-existent API functions), and slopsquatting signature (Lanyado/Imperva research corpus). Defensive pair to NIGHTFALL T59 PHANTOM SKILL.
OWASP LLM03 Supply Chain 259 Tests Slopsquatting
M132 — DECEPTION GUARD
Deception Guard
Runtime detection of deepfake, multimodal, and social engineering attacks against AI agents. Closes G10 of the NIGHTFALL taxonomy. 8 detectors: deepfake media detection (GAN artifacts, synthetic creation tool markers, TTS fingerprints), visual prompt injection (adversarial overlays, embedded instruction text, SPECTER PRISM LENS patterns), audio injection (ultrasonic commands ≥17kHz, WhisperInject-class 19kHz encoding, room acoustic manipulation), synthetic identity detection (AI-generated profiles, zero-EXIF headshots, uniform biography patterns), social engineering patterns (50+ authority/urgency/trust manipulation signatures), multimodal payload correlation (cross-modal fragment assembly, text+image+audio contradiction detection), steganographic content detection (EXIF/ID3/subtitle injection, zero-width Unicode, BiDi override), and adversarial typography (QR code payloads, adversarial signage, homoglyph substitution). Defensive pair: NIGHTFALL G10 — SPECTER SOCIAL · MIRAGE · VANTAGE · MIMIC · SPECTER PRISM.
OWASP LLM01 ATLAS AML.T0043 255 Tests Deepfake Detection
M133 — SUPPLY CHAIN RUNTIME GUARD
Supply Chain Runtime Guard
Runtime detection of supply chain and build pipeline attacks against AI agent deployments. Closes G07 of the NIGHTFALL taxonomy. 8 detectors: dependency confusion attack (namespace hijacking, version-override anti-patterns, unexpected registry sources), CI/CD pipeline poison (GitHub Actions with unverified actions, curl|bash patterns, self-hosted runner escalation), framework RCE pattern (LangChain/AutoGen/CrewAI/Haystack execution-capable components with untrusted input), malicious dependency injection (30+ confirmed malicious package names, version range widening), build artifact tampering (Docker digest mismatch, unexpected binary in pure-Python wheels, lock file hash mismatch), supply chain worm propagation (recursive dependency file modification, postinstall multi-repo spread), platform framework backdoor (trust_remote_code, HuggingFace executable model cards, SDK endpoint hijack), and code signing bypass (--no-verify flags, PYTHONPATH manipulation, unverified local installs). Defensive pair: NIGHTFALL G07 — HYDRA · PIPELINE · SPECTER SHELL · SPECTER WORM · SPECTER PLATFORM.
OWASP LLM03 MITRE T1195 235 Tests Supply Chain
M134 — ROBOTIC GUARD
Robotic System Guard
Real-time detection of attacks against robotic systems and embodied AI platforms. 8 detectors: URScript injection, ROS2 unauthorised access, dual-channel safety bypass (BadRobot arXiv:2407.20242v4 / Blindfold arXiv:2603.01414), ISO 10218-1/TS 15066 safety threshold violations, robotic credential abuse, unsigned artifact injection (CWE-345), robotic lateral movement, phantom control detection. 268 tests.
MITRE ICS T0855 ATLAS AML.T0043 ISO 10218-1 V16 Embodied AI
M135 — CUA GUARD
CUA Guard
Real-time detection of attacks against computer-use and browser agents. 8 detectors: visual prompt injection (STATIC/ADINJECT/hidden CSS), URL manipulation (CVE-2025-47241 userinfo bypass, IDNA homograph, dangerous schemes), branch steering (CaMeLs arXiv:2601.09923, indirect injection), chain action anomaly (payment/wipe/IAM/code-exec from web content), escape attempt (file protocol, path traversal, settings file write, shell metacharacters), OAuth consent spoof (scope inflation, fake provider domains, Meta blue clone), exfil channel (base64 URL params, DNS tunnelling, credential-in-body), session anomaly (rapid navigation, off-task domains, cross-origin data send). Defensive pair: T101 SPECTER WEB. 215 tests.
CVE-2025-47241 ATLAS AML.T0051 OWASP LLM01 CUA Security
M136 — INFERENCE GUARD
Inference Guard
Runtime defence for ML training and inference infrastructure. 8 detectors: Ray job anomaly (CVE-2023-48022 unauthenticated RCE, zero-CPU zombie jobs, detached C2 jobs), Slurm REST abuse (CVE-2023-41915 privesc, mass-node worm submission, self-resubmit persistence), MLflow artifact poisoning (CVE-2024-1483 path traversal, pickle upload, model registry poison), K8s ML workload attack (privileged DaemonSet, kube-system CronJob, cluster-admin RBAC), gradient poisoning (Byzantine norm spikes, sign flip fraction, backdoor trigger, checkpoint integrity), hardware sabotage (nvidia-smi power limit override, IPMI fan override, high-entropy disk write), model exfiltration (bulk checkpoint export, HuggingFace push, ONNX export), cluster worm (SSH key propagation, lateral movement, process spawn flood). Defensive pair: T102 SPECTER THUNDERBOLT. 232 tests.
CVE-2023-48022 CVE-2023-41915 CVE-2024-1483 ML Infrastructure
M137 — VOICE GUARD
Voice Guard
Runtime defence for AI voice agents and IVR infrastructure. 8 detectors: SIP protocol abuse (INVITE flood, caller ID spoofing, DTMF injection, SIP auth bypass), prompt injection in transcripts (role override, delimiter injection, jailbreak prefixes, zero-width/BIDI Unicode, homoglyph injection), adversarial audio detection (PhantomSound arXiv:2309.06960 burst detection, DolphinAttack ultrasonic carrier, psychoacoustic masking, RTP entropy spike, spectral flatness anomaly), voice clone detection (mel-cepstral distortion, ElevenLabs fricative fingerprint, XTTS v2 smoothing artifacts, speaker embedding drift, GAN periodic artifacts), session harvest attempt (system prompt probe, credential extraction, internal tool enumeration, PII fishing, lateral movement probe), IVR sabotage (noise injection, context exhaustion, webhook flood, silence DoS, DTMF storm), unauthorized barge-in (WebSocket origin validation, RTP SSRC hijack, relay certificate forgery, timestamp injection), voice agent recon (SIP OPTIONS sweep, STIR/SHAKEN harvest, provider enumeration, IVR tree mapping). Defensive pair: T107 SPECTER WIRE. 186 tests.
arXiv:2309.06960 DolphinAttack RFC 3261 Voice AI Security
M138 — SANDBOX GUARD
Sandbox Guard
Runtime detection of AI sandbox and container escape attacks. 8 detectors: indirect_prompt_injection (SILENTBRIDGE CSS hidden text font-size:0px/color:transparent, zero-width Unicode U+200B/200C/200D/FEFF clusters, HTML comment injection, markdown image beacons), mcp_tool_call_abuse (CLAWCHAIN CVE-2026-44115 heredoc $() shell expansion, CVE-2026-44118 X-MCP-Sender-Is-Owner:true bearer spoof, tool description poisoning, SSRF targets), toctou_symlink_race (CVE-2026-44112/113 TOCTOU races, CVE-2025-31133 runc /dev/null symlink → /proc/sys/kernel/core_pattern, privileged symlink targets, namespace escape), js_prototype_chain_escape (CVE-2026-5752 Cohere Terrarium document.__proto__.constructor.constructor, CVE-2026-22686 enclave-vm Error prototype chain, Function() constructor abuse, child_process execSync), python_sandbox_escape (CVE-2026-2275 CrewAI ctypes.CDLL + ctypes.util.find_library('c') + libc.system, importlib abuse, __subclasses__ traversal, pickle __reduce__ RCE), container_escape_attempt (CVE-2025-31133 core_pattern write, CVE-2025-9074 Docker Desktop 192.168.65.7:2375 Engine API, cgroup release_agent, privileged bind mount, Docker socket), sandbox_network_exfil (DNS tunneling base32 subdomain exfil, AWS/private key exfiltration, C2 beacon loops, IMDS SSRF 169.254.169.254, raw socket ICMP), multi_platform_chain_detection (SILENTBRIDGE→CLAWCHAIN chains, JS prototype→OS command, ctypes→network exfil, container escape→persistence, WMD-class destruction). Defensive pair: T108 SPECTER SANDBOX. 215 tests.
CVE-2025-31133 CVE-2025-9074 CVE-2026-5752 CVE-2026-22686 CVE-2026-2275 Container Escape
M139 — COPILOT GUARD
Copilot Guard
Runtime detection of Microsoft 365 Copilot and M365 platform attacks. 8 detectors: device_code_phishing (OAuth device code flow abuse, tenant-wide phishing, GetCredentialType timing), copilot_prompt_injection (Embrace/Ignore/Override techniques arXiv:2406.00137, Copilot-specific injection, CVE-2024-49035 Copilot Studio privesc), graph_api_harvest (Graph $batch endpoint abuse, bulk M365 data exfil, CA policy enumeration), teams_siege_detection (webhook abuse, CSS hidden channel injection, meeting summary hijack, guest pivot), admin_pipeline_abuse (admin email permutation, consent phishing, stealth UA rotation, password spray), ghost_hand_detection (GHOST-HAND zero-attribution via Microsoft.Copilot sole actor, calendar C2 persistence, DOCSTRIKE trigger), tenant_recon (Azure AD enumeration, Conditional Access mapping, service principal discovery), tenant_annihilation (mass deletion, CA policy wipe, backdoor OAuth app, PIM abuse, credential rotation lockout). Defensive pair: T111 SPECTER 360. 212 tests.
CVE-2024-49035 arXiv:2406.00137 GHOST-HAND Graph API Microsoft 365 Copilot
M140 — DAG GUARD
DAG Guard
Runtime integrity monitoring for knowledge graph and DAG-based reasoning systems. 5 subsystems: EDGE_INTEGRITY (false edge injection detection, confidence weight manipulation, low-trust→high-trust cluster alerts), VECTOR_MONITOR (anomalous evidence vector detection, cosine similarity attacks, batch injection volume anomaly, baseline drift), TRUST_PROPAGATION_GUARD (trust laundering detection, hub node monitoring, rapid trust score rise without evidence), CYCLE_DETECT (continuous cycle detection, amplification cycle identification, VAULT cycle injection signature matching), REPORT (WARLORD-compatible JSON, CVSS scoring, GraphViz attack subgraph, evidence chain, remediation). Defensive pair: T120 SPECTER VAULT (DAG-POISON/DAG-TRAVERSE/DAG-EXTRACT). 150 tests.
DAG Integrity Knowledge Graph Trust Propagation EU AI Act MITRE ATLAS
M141 — TRAPDOOR GUARD
Trapdoor Guard
AI agent persistence and rootkit detection. 10 subsystems: CONFIG_INTEGRITY (hooks.Stop/PostToolUse/PreToolUse in settings.json, external C2 endpoints, shell exec in config values), HOOK_INTEGRITY (SPECTER ZOMBIE T123 confirmed vector, LangChain/CrewAI/PraisonAI lifecycle hooks, Radware ZombieAgent pattern), RULES_FILE_GUARD (CLAUDE.md/cursorrules injection, system prompt override, HTML comment hiding, zero-width Unicode, tool-call directives), MEMORY_PERSISTENCE_DETECT (vector store poisoning, dormant trigger payloads, cross-session persistence across ChromaDB/Pinecone/Weaviate/Qdrant/Redis/Mem0), MCP_MANIFEST_GUARD (unauthorised tool additions, capability escalation, tool shadowing, rug pull patterns), WORKFLOW_INTEGRITY (n8n/Flowise/Langflow injection, C2 webhook, schedule node injection), SUPPLY_CHAIN_MONITOR (known-malicious PyPI/npm, postinstall exec, HuggingFace model card RCE, Docker base image abuse), NETWORK_BEACON_DETECT (cron+curl beacons, DNS C2 encoded subdomains, LLM API C2 relay, C2 framework signatures), PROPAGATION_DETECT (agent-to-agent infection, fleet propagation, shared memory contamination, Zombie agent persistence), PROCESS_PERSISTENCE_DETECT (crontab, systemd, shell profile, rc.local, launchd, at jobs). RSSA escalation on CRITICAL findings. Defensive pairs: T123 SPECTER ZOMBIE (primary), T116 VENOM, T88 SHADOW, T110 SPAWN, T122 GHOST, T121 FEDERATION. 296 tests.
Agent Persistence Rootkit Detection Hook Integrity Rules File Guard MCP Security MITRE ATLAS

Full Fleet Visibility. One Interface.

Real-time threat telemetry across all 141 modules — live in the AI Shield Command GUI:

[AI SHIELD COMMAND] v3.0.0 — 141 modules active — 17 verticals

[FLEET STATUS] 121/122 modules HEALTHY ORCHESTRATOR LIVE
[THREAT FEED] Real-time · MITRE ATLAS mapped · OWASP LLM Top 10

[M99 CORE] CLEAN 0 injections detected (last 60s)
[M104 ADVERSARIAL] CLEAN FGSM/PGD patterns: 0 anomalies
[M108 AGENT RUNTIME] ALERT Anomalous tool chain: AGENT-007
[M110 SUPPLY CHAIN] CLEAN Adapter checksums verified
[M300 NTN SHIELD] CLEAN Satellite feed integrity: NOMINAL
[M115 MEMORY GUARD] CLEAN 28 detectors active · 0 memory threats
[M118 MCP SHIELD] CLEAN 28 detectors active · 0 MCP threats
[M119 ECONOMIC] CLEAN DoW detectors active · token burn rate nominal
[M120 REASONING] CLEAN 8 detectors · 0 reasoning integrity violations
[M121 MODEL INT.] CLEAN Baseline profiler active · 0 anomalies

[ATLAS COVERAGE] 100% — all tactics mapped
[DETECTION LATENCY] avg 23ms · p99 48ms
[BLOCKED LAST 24H] 47 threats blocked · 0 false positives confirmed

Designed for Autonomous Operation

AI Shield runs as a containerised microservice fleet. Each module is an independent FastAPI service behind a central orchestrator. Zero shared state between modules — a compromised module cannot contaminate the fleet. Designed to operate inside air-gapped environments, Kubernetes clusters, and CI/CD pipelines.

Containerised Fleet

Every module ships as a standalone Docker container. UBI9-certified images for enterprise deployments. Zero external runtime dependencies. Fully air-gap capable.

Real-Time Detection Pipeline

Sub-50ms detection latency at p99. Modules run in parallel — no serial bottlenecks. Threat signals routed to SIEM within 100ms of detection via Splunk HEC, CEF, or LEEF.

NIGHTFALL Integration

AI Shield and NIGHTFALL share a bidirectional threat feed. NIGHTFALL offensive findings automatically generate AI Shield blocking rules — closing the loop between testing and production defence.

Zero Trust Module Isolation

No module trusts another. Each module validates its own inputs, maintains its own signature database, and communicates only through the orchestrator API. Compromise radius is bounded to a single module.

Behavioural Baselines

AI Shield builds statistical baselines for every agent it monitors. Anomaly detection uses Mahalanobis distance against the baseline — not static rules. Adapts to fleet changes automatically.

API & SDK Integration

REST API with OpenAPI spec. Python SDK published to PyPI. Hooks for LangChain, AutoGen, CrewAI, and custom agent orchestrators. Middleware injection supported for transparent deployment.

Every Threat Mapped to Every Standard

100% Coverage

MITRE ATLAS

  • AML.T0051 — LLM Prompt Injection
  • AML.T0056 — Rogue ML Services
  • AML.T0040 — ML Model Poisoning
  • AML.T0043 — Adversarial ML Attack
  • AML.T0048 — Exfiltration via ML Inference
  • AML.T0054 — Supply Chain Compromise
  • AML.T0058 — Backdoor ML Model
  • All remaining ATLAS tactics mapped
10 / 10

OWASP LLM Top 10 — 2025

  • LLM01 — Prompt Injection
  • LLM02 — Sensitive Information Disclosure
  • LLM03 — Supply Chain
  • LLM04 — Data and Model Poisoning
  • LLM05 — Improper Output Handling
  • LLM06 — Excessive Agency
  • LLM07 — System Prompt Leakage
  • LLM08 — Vector and Embedding Weaknesses
  • LLM09 — Misinformation
  • LLM10 — Unbounded Consumption
Regulatory

Compliance Frameworks

  • EU AI Act — High-Risk System Controls
  • NIST AI RMF — Govern, Map, Measure, Manage
  • ISO/IEC 42001 — AI Management Systems
  • GDPR — AI-driven personal data flows
  • NCSC AI Security Guidelines
  • SPARTA — Space/NTN threat taxonomy

Test Offensively. Defend in Production.

AI Shield is the defensive counterpart to NIGHTFALL. Every offensive finding from NIGHTFALL tools generates a real-time blocking rule in AI Shield. The feedback loop is bidirectional — running FORGE against a model produces AI Shield policy. Running ARSENAL against an agent produces runtime detection signatures.

Offensive Testing
NIGHTFALL
99-tool offensive framework
Findings Correlation
WARLORD
Autonomous campaign engine — aggregates findings
Runtime Defence
AI SHIELD
141-module autonomous defence platform
Enterprise SIEM
redspecter-siem
Splunk · Sentinel · QRadar
Governance
IDRIS
AI asset discovery and governance
Reporting
OVERWATCH
Consolidated posture reporting

Deploy Anywhere

AI Shield modules are available as Docker containers (UBI9 certified), Python packages via PyPI, and as Kubernetes Helm charts. All deployment paths produce the same module behaviour — consistent signatures, consistent latency, consistent API surface.

Docker
docker pull
PyPI
pip install
Kubernetes
Helm chart
Red Hat UBI9
Certified image
Azure
Container instances
AWS
ECS / EKS
GCP
Cloud Run / GKE
Air-Gap
Offline install
REST API
OpenAPI 3.1

Deployment Notice

Red Specter AI Shield is an authorised security product intended for deployment on systems you own or are contractually authorised to protect. AI Shield modules operate in monitoring and blocking modes — ensure that blocking mode deployment is authorised by your organisation's change management process before activation. Module behaviour must be validated against your specific AI agent deployment before production rollout. All modules operate under Apache License 2.0. Red Specter Security Research Ltd accepts no liability for incidents arising from misconfiguration or unauthorised deployment.

Deploy AI Shield
Start Defending Your AI Fleet

141 modules. 17 verticals. Real-time threat detection across your entire AI agent deployment. Launch the GUI to see your fleet status, or read the documentation to begin a self-hosted deployment.

Launch AI Shield GUI Read Documentation