No description

Python 99.9%
Dockerfile 0.1%

Find a file

nicksolarsoul 2ccfd6675b feat(eval): print BENCHMARKS.md reference and update hint at run end The e2e summary now ends by pointing at the tracked BENCHMARKS.md and restating the run's headline numbers in the same percent form, so an operator can copy an improvement or regression straight into the table. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>		2026-05-30 13:26:34 +07:00
.claude/commands	feat(eval): add inference, utility, partial-leak and adversarial benchmark checks	2026-05-29 11:46:53 +07:00
.github	chore: add issue/PR templates, contributing & security policy (#26 )	2026-05-30 10:31:30 +07:00
docs/assets	chore: remove superpowers docs and gitignore them	2026-05-21 22:37:00 +07:00
examples	feat: custom Presidio recognizers and known-entity dictionaries	2026-05-29 07:56:33 +07:00
prompts	fix(eval): forbid hedged figure restatement; size attacker token budget	2026-05-30 11:22:03 +07:00
src/hey_jude	perf(anonymizer): self-leak validator root-fix + quasi-identifier generalization	2026-05-30 10:25:06 +07:00
tests	feat(eval): print BENCHMARKS.md reference and update hint at run end	2026-05-30 13:26:34 +07:00
.env.example	feat: add tamper-evident request-level audit logging (#11 )	2026-05-29 08:21:26 +07:00
.gitignore	feat: add tamper-evident request-level audit logging (#11 )	2026-05-29 08:21:26 +07:00
AGENTS.md	chore: add no-edge-case-handling law (AGENTS.md), cover new paths, strip test salvage	2026-05-30 09:38:28 +07:00
BENCHMARKS.md	docs: percent-based scores and tighter BENCHMARKS.md	2026-05-30 13:23:44 +07:00
CLAUDE.md	chore: add no-edge-case-handling law (AGENTS.md), cover new paths, strip test salvage	2026-05-30 09:38:28 +07:00
CONTRIBUTING.md	chore: add issue/PR templates, contributing & security policy (#26 )	2026-05-30 10:31:30 +07:00
docker-compose.yml	Initial commit	2026-05-19 16:40:31 +07:00
Dockerfile	refactor: move substitutor prompt into prompts directory	2026-05-26 16:17:42 +07:00
LICENSE	Initial commit	2026-05-19 16:40:31 +07:00
pyproject.toml	feat: add tamper-evident request-level audit logging (#11 )	2026-05-29 08:21:26 +07:00
README.md	Merge pull request #22 from sure-scale/docs/model-tiering-guidance	2026-05-29 08:54:03 +07:00
SECURITY.md	chore: add issue/PR templates, contributing & security policy (#26 )	2026-05-30 10:31:30 +07:00
uv.lock	feat: add tamper-evident request-level audit logging (#11 )	2026-05-29 08:21:26 +07:00

README.md

Hey Jude

Privacy gateway for legal LLM workflows.

Hey Jude sits between your app and your LLM provider. It strips PII from prompts before they leave your environment, then restores the original details in the response. Your users see real names; the cloud LLM never does.

It uses a local LLM to understand context — so legal defined terms like "the Purchaser" stay intact while real names, emails, and addresses get replaced with semantic placeholders like INVESTMENT_BANK_01 or PERSON_02. A Presidio-based safety net catches anything the LLM misses.

This is a helper layer for data minimization, not a guarantee. Use it as part of a broader confidentiality strategy.

How It Works

Your app sends a chat completion request to Hey Jude (OpenAI-compatible API).
A local LLM analyzes the text and identifies real PII vs. legal/structural terms.
PII gets replaced with semantic placeholders. Legal defined terms are kept.
A Presidio safety net scans the result for anything the LLM missed.
The sanitized prompt is forwarded to your chosen LLM provider.
The response comes back, placeholders are swapped for originals, and your app gets a normal-looking reply.

Why It Exists

Context-aware anonymization: A local LLM understands that "Goldman Sachs" is PII but "the Purchaser" is a legal term — something regex and NER can't do reliably.
Semantic placeholders: INVESTMENT_BANK_01, not ORGANIZATION_01. The downstream LLM keeps enough context to reason well.
Safety net: Presidio runs after the LLM as a second pass. Configurable as warn (auto-fix), strict (reject), or off.
Drop-in API: Exposes OpenAI, Anthropic, and Gemini-compatible endpoints. Point existing SDKs at the gateway.
Fully local by default: Runs without cloud keys using Ollama for both anonymization and demo responses.
Cloud routing when ready: Route anonymized prompts to OpenAI, Anthropic, Gemini, Azure, or any LiteLLM-compatible provider.

Quick Start

1. Install Ollama and Pull the Default Model

ollama pull qwen3.5:4b

2. Run the Gateway

git clone https://github.com/nickwatson/hey-jude.git
cd hey-jude
cp .env.example .env
docker compose up --build

The gateway will run at http://localhost:4005.

3. Test It

In another terminal:

python3 tests/e2e/test_gateway.py

The default setup is fully local: Redis runs in Docker, and Ollama runs on your host machine.

Default Configuration

The defaults work out of the box for most users.

Variable	Default	Purpose
`REDIS_URL`	`redis://localhost:6379/0`	Temporary mapping storage
`API_KEY`	`sk-heyjude-dev`	Gateway authentication
`LOCAL_LLM_URL`	`http://localhost:11434/v1`	Local anonymization endpoint
`LOCAL_LLM_MODEL`	`qwen3.5:4b`	Local anonymization model
`LOCAL_LLM_API_KEY`	(empty)	API key for cloud-hosted anonymization models
`EXTERNAL_LLM_MODEL`	`ollama_chat/qwen3.5:4b`	Destination model via LiteLLM
`EXTERNAL_LLM_API_BASE`	`http://localhost:11434`	LiteLLM API base for Ollama
`ANONYMIZATION_MODE`	`llm`	`llm` (context-aware) or `mechanical` (NER-only)
`SAFETY_NET_STRICTNESS`	`warn`	`warn` (auto-fix), `strict` (reject), or `off`
`DOCUMENT_UNREADABLE_ACTION`	`reject`	What to do when an uploaded file has no readable text layer: `reject`, `warn`, or `skip`
`CUSTOM_RECOGNIZERS_PATH`	(unset)	Path to a YAML/JSON file of custom Presidio regex recognizers
`KNOWN_ENTITIES_PATH`	(unset)	Path to a YAML/JSON known-entity dictionary
`AUDIT_ENABLED`	`false`	Enable request-level audit logging
`AUDIT_DESTINATION`	`stdout`	`stdout` or a file path
`AUDIT_CONTENT_LEVEL`	`metadata`	`metadata` (digests only), `anonymized` (PII-free payload), `full` (raw content)
`AUDIT_ROTATION`	`monthly`	Segment files by period: `none`, `daily`, `monthly`
`AUDIT_FAILURE_MODE`	`ignore`	`ignore` (logging never blocks a request) or `fail` (fail-closed)

When running through Docker Compose, the service automatically uses host.docker.internal so the container can reach Ollama on your Mac.

Choosing Models

Hey Jude uses two separate models for two different jobs, and you size them independently:

Role	Setting	Job	Pick for
Anonymizer	`LOCAL_LLM_MODEL` (+ `LOCAL_LLM_URL`)	Classify entities and emit placeholder JSON	Speed and cost — a small, fast model is enough
Destination	`EXTERNAL_LLM_MODEL`	Do the actual legal work on the already-anonymized prompt	Capability — your strongest model

The anonymizer's task is narrow and structured (find PII, output a fixed JSON schema), so it does not need a frontier model. Putting the cheapest model that holds classification quality here cuts cost and latency on every request, because the anonymizer runs once per message before anything reaches the destination. Reserve the expensive, capable model for EXTERNAL_LLM_MODEL, which never sees raw PII anyway.

The two run on independent endpoints and providers — e.g. a small Azure or Ollama model for anonymization and Gemini, Anthropic, or OpenAI for the destination — so you tune the cost/quality trade-off on each without touching the other.

Domain-Specific Detection

Default NER misses the abbreviated, inconsistent names common in legal text ("Call w/ J. Smith re: Acme merger"). Two opt-in mechanisms close the gap. Templates live in examples/.

Custom recognizers (CUSTOM_RECOGNIZERS_PATH) add regex-based entity types — matter numbers, client codes, opposing-counsel formats. They run in the Presidio safety net and as a mechanical-mode detection strategy.

Known-entity dictionary (KNOWN_ENTITIES_PATH) is a firm-maintained list of the names you must never leak — clients, personnel, matter names. Listed entities are matched case-insensitively and guaranteed replaced before the prompt reaches the LLM, so a critical name never depends on the model noticing it. All spelling variants (term + aliases) collapse to one placeholder.

By default an auto-numbered placeholder (e.g. CLIENT_NAME_01) is assigned per request. Set replace_with on an entry to fix its placeholder so it stays identical across every request.

Hey Jude extracts text from common legal document formats before anonymization, including text PDFs, DOCX, HTML, EML, TXT, Markdown, and RTF. Scanned PDFs, flattened PDFs, and images are not OCRed yet; by default they are rejected so unreadable content is not forwarded without anonymization.

Audit Logging

Set AUDIT_ENABLED=true to record one envelope per request: timestamps, latency, which external model it was routed to, entity count, sensitivity, safety-net result, the per-entity anonymization decisions, and SHA-256 digests of the input and the anonymized output. This is the artifact that proves anonymization happened and that only PII-free content left the network.

Per-entity decisions. In LLM mode each record carries a decisions list — what the anonymizer found and what it did to it (action is replace, keep, or generalize) with the reason. At the default metadata level this stores entity_type, action, and reason only, never the raw entity text; full additionally records the original text and its replacement. This is the per-matter "what did we send, what did we withhold, why" trail for discovery and malpractice defense.

Tamper-evident. The log is hash-chained JSONL: each record carries the hash of the previous one, so editing or deleting any historical record breaks the chain from that point on — detectable even by someone with write access. Verify a segment at any time:

hey-jude audit verify audit/audit-2026-05.jsonl

Set AUDIT_HMAC_KEY to bind the chain to a secret so an attacker who cannot read the key cannot recompute valid hashes. Walk a segment for conflict checks, client audits, or discovery production:

hey-jude audit query audit/audit-2026-05.jsonl --matter M-123456 --since 2026-05-01

Content level. The default metadata stores no raw client PII — only digests — so the audit trail itself does not become a confidential-data store. Choose anonymized to retain the PII-free payload (useful as a malpractice-defensible record of what the AI was actually asked and answered, without storing client identities). full additionally persists the raw pre-anonymization content and is a deliberate PII honeypot; it logs a startup warning and should be reserved for environments where that risk is understood. Tag requests with the X-Heyjude-Matter-Id header so records are queryable by matter; enable AUDIT_ACTOR_HEADER only if your firm policy permits attorney attribution.

Immutability vs. retention. A hash chain makes history immutable, but legal duties (matter-close destruction, data-subject erasure, retention schedules) require eventual deletion. Hey Jude resolves this with period segments (AUDIT_ROTATION): each month/day is an independent chain in its own file, so an expired segment can be destroyed wholesale without invalidating the active chain. Suspend rotation and deletion while a matter is under legal hold. For cryptographic-grade WORM, point AUDIT_DESTINATION at a write-once volume (chattr +a on Linux) or ship sealed segments to object storage with an immutability lock (e.g. S3 Object Lock). Keep the log on encrypted disk; Hey Jude does not encrypt records itself.

Native Python

If you prefer not to use Docker, run Redis yourself and start the app directly:

pip install -e ".[dev]"
python3 -m spacy download en_core_web_lg
uvicorn hey_jude.main:app --host 0.0.0.0 --port 4005

Advanced Models

The default qwen3.5:4b target is chosen for low-friction local setup. If you want a different local model, pull it with Ollama and update LOCAL_LLM_MODEL plus EXTERNAL_LLM_MODEL.

Use case	Model
Fastest tiny local test	`qwen3.5:0.8b`
Default local setup	`qwen3.5:4b`
Stronger local setup	`qwen3.5:9b`
High-end local setup	`qwen3.6:35b-a3b`

Example:

ollama pull qwen3.5:9b
LOCAL_LLM_MODEL=qwen3.5:9b
EXTERNAL_LLM_MODEL=ollama_chat/qwen3.5:9b

To use Apple MLX instead of Ollama, serve an OpenAI-compatible endpoint and point LOCAL_LLM_URL or EXTERNAL_LLM_API_BASE at that server.

To route the final prompt to a cloud provider, set EXTERNAL_LLM_MODEL to any LiteLLM model identifier and provide that provider's API key, for example OPENAI_API_KEY, ANTHROPIC_API_KEY, or GEMINI_API_KEY.

Azure AI as Anonymization Backend

Instead of running Ollama locally, you can use an Azure AI-hosted model for the anonymization layer. This is useful on machines where local inference is slow (e.g., laptops without dedicated GPU).

Azure AI Foundry exposes OpenAI-compatible endpoints for models deployed from the model catalog. Set these in your .env:

LOCAL_LLM_URL=https://<your-resource>.openai.azure.com/openai/v1
LOCAL_LLM_MODEL=DeepSeek-V4-Pro
LOCAL_LLM_API_KEY=<your-azure-api-key>

Available models (same endpoint, swap LOCAL_LLM_MODEL):

Model	`LOCAL_LLM_MODEL` value	Notes
DeepSeek V4 Pro	`DeepSeek-V4-Pro`	Recommended — fast, strong at structured JSON output
Kimi K2.6	`Kimi-K2.6`	Reasoning model, needs high `max_tokens` (uses thinking tokens)

Any model deployed to your Azure AI project that serves an OpenAI-compatible chat completions endpoint will work. The gateway sends requests to {LOCAL_LLM_URL}/chat/completions with both api-key and Authorization: Bearer headers.

Prompt caching. The anonymization prompt (prompts/anonymize.txt) keeps its large static block — task, classification instructions, and output schema — first, and the per-request variables (the existing placeholder mapping and the message text) last. That fixed prefix is identical on every request, so providers with automatic prompt caching (Azure OpenAI, Anthropic, Gemini) reuse it instead of re-billing the instructions each call, cutting input-token cost and latency on the hot path. If you edit the template, keep the variables at the end or the cached prefix is lost.

E2E Testing

The end-to-end test uses three models:

Role	Model	Purpose
Anonymizer	Configured via `LOCAL_LLM_*`	PII detection and replacement
Destination	Gemini Flash	Receives anonymized prompts
Evaluator	Gemini Pro	Judges anonymization quality (PII leaks, coherence, completeness)

GEMINI_API_KEY=your-key python3 tests/e2e/test_gemini_anonymization.py

The test auto-downloads public-domain legal documents from SEC EDGAR on first run (NDAs, employment agreements, settlement agreements, etc.) and uses them alongside inline test cases. Downloaded documents are cached locally and gitignored.

SDK Integration

Since Hey Jude behaves like standard LLM endpoints, existing clients can point at the gateway.

Mike OSS

Mike can route OpenAI, Claude, and Gemini calls through Hey Jude. After starting this gateway, set in Mike's backend/.env:

HEY_JUDE_ENABLED=true
HEY_JUDE_BASE_URL=http://localhost:4005
HEY_JUDE_API_KEY=sk-heyjude-dev

Python OpenAI SDK

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:4005/v1",
    api_key="sk-heyjude-dev",
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[
        {"role": "user", "content": "I am John Doe and I work at Google."}
    ],
)
print(response.choices[0].message.content)

response = client.responses.create(
    model="gpt-4o",
    input="I am John Doe and I work at Google.",
)
print(response.output_text)

Python Anthropic SDK

from anthropic import Anthropic

client = Anthropic(
    base_url="http://localhost:4005",
    api_key="sk-heyjude-dev",
)

response = client.messages.create(
    model="claude-3-5-sonnet-20241022",
    max_tokens=1024,
    messages=[
        {"role": "user", "content": "I am John Doe and I work at Google."}
    ],
)
print(response.content[0].text)

Node.js OpenAI SDK

import OpenAI from 'openai';

const client = new OpenAI({
  baseURL: 'http://localhost:4005/v1',
  apiKey: 'sk-heyjude-dev',
});

const response = await client.chat.completions.create({
  model: 'gpt-4o',
  messages: [{ role: 'user', content: 'I am John Doe and I work at Google.' }],
});
console.log(response.choices[0].message.content);

const responsesResponse = await client.responses.create({
  model: 'gpt-4o',
  input: 'I am John Doe and I work at Google.',
});
console.log(responsesResponse.output_text);

Node.js Anthropic SDK

import Anthropic from '@anthropic-ai/sdk';

const client = new Anthropic({
  baseURL: 'http://localhost:4005',
  apiKey: 'sk-heyjude-dev',
});

const response = await client.messages.create({
  model: 'claude-3-5-sonnet-20241022',
  max_tokens: 1024,
  messages: [{ role: 'user', content: 'I am John Doe and I work at Google.' }],
});
console.log(response.content[0].text);

License

GNU Affero General Public License v3.0. See LICENSE.