Architecture Overview

Spring Prism follows a decoupled and strictly modular design to ensure portability, performance, and security.

Modules

The heartbeat of the system.

Languages: Java 21 (Virtual Threads for scanning).
Dependencies: Zero Spring or AI dependencies.
Componentry:
- PiiDetector: Interface for regional or data-type specific detection.
- PrismRulePack: Aggregate of detectors for a specific locale (e.g., EuropeRulePack).
- PrismVault: Interface for secure, TTL-managed token-to-data mapping.
- TokenGenerator: Deterministic HMAC-SHA256 signature generator for pseudonymization.
- Core detectors stay deterministic and checksum-driven. NLP models are intentionally excluded from this module.
- Legacy in-core rule packs remain available for 1.x backward compatibility while modular packs are introduced.

The default modular rule-pack baseline.

Dependencies: Depends on prism-core, but keeps all detector logic deterministic and checksum-focused.
Purpose: Provides the default UNIVERSAL detector set through standard Spring Boot auto-configuration.
Compatibility Rule: The starter prefers this modular pack when present, but falls back to legacy in-core UniversalRulePack behavior if the module is absent.

The Spring AI interception layer.

Frameworks: Spring AI 1.x, Reactor, Micrometer Observation.
Entry Point: PrismChatClientAdvisor wraps ChatClient flows for synchronous and streaming requests.
Streaming Safety: Uses StreamingBuffer so fragmented Prism tokens can still be restored across SSE chunks.

The LangChain4j chat integration layer.

Frameworks: LangChain4j chat APIs, Micrometer Observation.
Entry Point: PrismChatModel and PrismStreamingChatModel decorate LangChain4j ChatModel and StreamingChatModel.
Boundary Rule: Keeps Spring-specific concerns out of the integration so prism-core remains portable and zero-dependency.

The Spring Boot 3 autoconfiguration bridge.

Frameworks: Spring Boot 3.4+, Micrometer Observation.
Safety: spring.prism.failure-mode=FAIL_SAFE preserves the legacy fail-open posture with Micrometer error metrics. FAIL_CLOSED is opt-in for production blocking behavior.
Deployments: Supports explicit vault selection through spring.prism.vault.type with auto, in-memory, and redis. auto preserves the low-friction Redis auto-detection path; redis is the recommended mode for multi-node deployments.
Rule Packs: Uses modular auto-discovered PrismRulePack beans when available, with prism-rulepack-common as the default starter baseline and legacy in-core packs preserved as compatibility fallbacks.
Integrations: Publishes PrismChatClientAdvisor for Spring AI and primary LangChain4j wrappers when delegate chat beans are present.
Optional NLP Extensions: Person-name detection may be wired here or in prism-spring-ai through a lazily loaded backend such as Apache OpenNLP, keeping the core zero-dependency.

The MCP client protection layer.

Frameworks: Java 21 HttpClient, subprocess stdio, Micrometer Observation.
Entry Point: PrismMcpClient, with PrismStdioMcpClient and PrismHttpMcpClient as the concrete transport wrappers.
Scope: Protects the Java application in the MCP client role first by sanitizing outbound JSON-like request payloads and restoring Prism tokens in inbound MCP results.
Transport Coverage: Supports local subprocess stdio and hosted Streamable HTTP transports. Docker remains a deployment detail on top of the same transport contract.
Structured Payload Support: Recursively walks strings inside maps, lists, prompt fields, tool arguments, and textual results without introducing Spring dependencies into the module.
Protocol-Aware Support: Handles common MCP request/result shapes more explicitly, including tool argument payloads, message content blocks, and JSON-encoded structured argument values.
Streaming Foundation: Uses a dedicated event-stream parser so Streamable HTTP responses can tolerate multi-line data: events and terminal markers such as [DONE].
Server Groundwork: Includes a reusable server interceptor foundation so later MCP server-role support can share the same payload sanitization and restoration rules.

The embedded observability surface.

Packaging: Static assets are served from META-INF/resources/prism/ inside the dashboard jar.
Data Source: Reads the Prism runtime snapshot from /actuator/prism when Actuator is present and falls back to /prism/metrics otherwise.
Verification: Includes a bundled /prism/?demo=1 fixture mode for visual checks without a live runtime.
Current Scope: Dashboard shell, a live Privacy Score, top-redacted metrics, vault/runtime health, rule-pack activity bars, integration timing cards, threshold-aware alerts, vault posture cards, entity drill-downs, server-side retained history charts, rollup cards, operational filters, JSON/CSV/incident exports, a masked recent-activity audit feed with filters, and a visual refraction-flow explainer.
Operational Guidance: The dashboard payload stays masked, but the routes remain operational endpoints and should be protected by the host application's normal security boundary in production. Dashboard defaults and thresholds are configured centrally under spring.prism.dashboard.*.

Interception: A Spring AI advisor, LangChain4j wrapper, or MCP client transport captures the outbound payload.
Detection: PiiDetector implementations (e.g., EmailDetector) scan the text for PII candidates.
Tokenization: TokenGenerator creates deterministic HMAC-SHA256 tokens.
Vaulting: PrismVault stores the original ↔ token mapping.
Redaction: The original text is replaced with labels like <PRISM_EMAIL_h8a2...].
Execution: The LLM or MCP endpoint processes the sanitized request.
Detokenization: The response is scanned for <PRISM_...> patterns, and PrismVault restores the original PII based on a valid HMAC signature.

HMAC Signatures: Tokens include an HMAC-SHA256 signature calculated with spring.prism.app-secret. This ensures that even if an attacker gains access to the LLM interaction logs, they cannot reverse the tokens without the application's secret key.
TTL Lifecycle: Tokens in the vault automatically expire after a configurable period (default: 30 minutes).
Distributed Vault Option: Redis-backed deployments preserve the same signature validation model as the local vault while allowing multiple application nodes to restore the same Prism token set. Nodes must share both the same Redis-backed vault state and the same spring.prism.app-secret.
No PII Logging: Spring Prism emits Micrometer metrics (gen_ai.prism.redacted.count) rather than logging sensitive values.