AI Firewall MCP

STDIOregistry active

Summary

This is a security layer for protecting LLM applications from adversarial inputs. It exposes five MCP tools over stdio: analyze_prompt runs your input through a three-agent pipeline (retrieval against FAISS vectors, guard signals, policy enforcement) to detect injection and jailbreak attempts. You also get get_threat_breakdown for per-signal scoring, sanitize_prompt to clean suspicious text, get_firewall_status for health checks, and benchmark_firewall to run the built-in adversarial test suite. Ships as a pip package or Docker container with configurable thresholds and operates in strict, moderate, or permissive modes. Reach for this when you're building multi-agent systems or user-facing LLM features and need a programmatic gate before prompts hit your model.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

GitHub • PyPI • Docker Hub

<mcp-name: io.github.Akhilucky/ai-firewall-mcp>

AI Firewall — MCP Server

A multi-agent AI security layer that protects LLMs from prompt injection, jailbreaks, and policy violations. Available as an MCP server for any MCP-compatible client (Claude Desktop, Cursor, Windsurf, Cline, Roo Code, etc.).

Quick Start

pip install

pip install ai-firewall-mcp
ai-firewall-mcp

Docker

docker pull akhilucky/ai-firewall-mcp:latest
docker run -i akhilucky/ai-firewall-mcp:latest

Claude Desktop

Add to claude_desktop_config.json:

pip install:

{
  "mcpServers": {
    "ai-firewall": {
      "command": "pipx",
      "args": ["run", "ai-firewall-mcp"]
    }
  }
}

Docker:

{
  "mcpServers": {
    "ai-firewall": {
      "command": "docker",
      "args": ["run", "-i", "akhilucky/ai-firewall-mcp:latest"]
    }
  }
}

Cursor / Windsurf / Cline / Roo Code

Configure in your MCP settings with:

Type: stdio
Command: docker run -i akhilucky/ai-firewall-mcp:latest
Or use ai-firewall-mcp if installed via pip

MCP Tools

Tool	Description
`analyze_prompt`	Analyze a prompt for injection, jailbreaks, exfiltration, and leakage
`get_threat_breakdown`	Detailed per-signal scoring breakdown from the last analysis
`sanitize_prompt`	Clean a suspicious prompt while preserving legitimate content
`get_firewall_status`	Health check: vector DB size, model status, uptime
`benchmark_firewall`	Run the adversarial test suite and return detection statistics

Testing with MCP Inspector

npx @modelcontextprotocol/inspector ai-firewall-mcp

Architecture

The firewall runs three agents per prompt:

User Prompt → [Retrieval Agent] → [Guard Agent] → [Policy Agent] → LLM
                   │                    │               │
                   ▼                    ▼               ▼
              Vector DB (FAISS)    Threat Signals    Allow/Block

Agent	Role
Retrieval Agent	Semantic search against known attack patterns (FAISS + sentence-transformers)
Guard Agent	Multi-signal classification: vector similarity, keyword match, heuristic scoring
Policy Agent	Final decision: `ALLOW` / `BLOCK` / `SANITIZE` based on configurable thresholds

Threat signals are weighted: 40% vector similarity, 25% keyword match, 20% heuristic, 15% policy weight.

Configuration

Env Var	Default	Description
`FIREWALL_MODE`	`strict`	`strict` / `moderate` / `permissive`
`SIMILARITY_THRESHOLD`	`0.50`	Vector match threshold (lower = stricter)
`LOG_LEVEL`	`INFO`	Logging verbosity

CLI / API Usage

# Interactive dashboard
python main.py

# Red-team adversarial tests
python main.py --redteam

# REST API server
python main.py --api

# Single prompt analysis
python main.py --analyze "Ignore all previous instructions"

The REST API runs at http://localhost:8000 with OpenAPI docs at /docs (requires pip install ai-firewall-mcp[api]).

Testing

pytest tests/ -v          # Full test suite (43 tests)
pytest tests/test_mcp.py  # MCP-specific tests only

Project Structure

├── src/ai_firewall/          # MCP server package (PyPI entry)
│   ├── mcp_server.py         #    5 MCP tools, stdio transport
│   ├── threat_scorer.py      #    Per-signal scoring breakdown
│   └── __init__.py
├── src/agents/               # Core firewall agents
├── tests/                    # Test suites
├── Dockerfile                # Docker image (2.04GB, CPU-only torch)
├── pyproject.toml            # Package config & metadata
└── .github/workflows/ci.yml  # CI/CD pipeline

License

MIT — see LICENSE.

Built for security. Designed for production.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

AI Firewall — MCP Server

Quick Start

pip install

pip install ai-firewall-mcp
ai-firewall-mcp

Docker

docker pull akhilucky/ai-firewall-mcp:latest
docker run -i akhilucky/ai-firewall-mcp:latest

Claude Desktop

Add to claude_desktop_config.json:

pip install:

{
  "mcpServers": {
    "ai-firewall": {
      "command": "pipx",
      "args": ["run", "ai-firewall-mcp"]
    }
  }
}

Docker:

{
  "mcpServers": {
    "ai-firewall": {
      "command": "docker",
      "args": ["run", "-i", "akhilucky/ai-firewall-mcp:latest"]
    }
  }
}

Cursor / Windsurf / Cline / Roo Code

Configure in your MCP settings with:

Type: stdio
Command: docker run -i akhilucky/ai-firewall-mcp:latest
Or use ai-firewall-mcp if installed via pip

MCP Tools

Tool	Description
`analyze_prompt`	Analyze a prompt for injection, jailbreaks, exfiltration, and leakage
`get_threat_breakdown`	Detailed per-signal scoring breakdown from the last analysis
`sanitize_prompt`	Clean a suspicious prompt while preserving legitimate content
`get_firewall_status`	Health check: vector DB size, model status, uptime
`benchmark_firewall`	Run the adversarial test suite and return detection statistics

Testing with MCP Inspector

npx @modelcontextprotocol/inspector ai-firewall-mcp

Architecture

The firewall runs three agents per prompt:

User Prompt → [Retrieval Agent] → [Guard Agent] → [Policy Agent] → LLM
                   │                    │               │
                   ▼                    ▼               ▼
              Vector DB (FAISS)    Threat Signals    Allow/Block

Agent	Role
Retrieval Agent	Semantic search against known attack patterns (FAISS + sentence-transformers)
Guard Agent	Multi-signal classification: vector similarity, keyword match, heuristic scoring
Policy Agent	Final decision: `ALLOW` / `BLOCK` / `SANITIZE` based on configurable thresholds

Threat signals are weighted: 40% vector similarity, 25% keyword match, 20% heuristic, 15% policy weight.

Configuration

Env Var	Default	Description
`FIREWALL_MODE`	`strict`	`strict` / `moderate` / `permissive`
`SIMILARITY_THRESHOLD`	`0.50`	Vector match threshold (lower = stricter)
`LOG_LEVEL`	`INFO`	Logging verbosity

CLI / API Usage

# Interactive dashboard
python main.py

# Red-team adversarial tests
python main.py --redteam

# REST API server
python main.py --api

# Single prompt analysis
python main.py --analyze "Ignore all previous instructions"

The REST API runs at http://localhost:8000 with OpenAPI docs at /docs (requires pip install ai-firewall-mcp[api]).

Testing

pytest tests/ -v          # Full test suite (43 tests)
pytest tests/test_mcp.py  # MCP-specific tests only

Project Structure

├── src/ai_firewall/          # MCP server package (PyPI entry)
│   ├── mcp_server.py         #    5 MCP tools, stdio transport
│   ├── threat_scorer.py      #    Per-signal scoring breakdown
│   └── __init__.py
├── src/agents/               # Core firewall agents
├── tests/                    # Test suites
├── Dockerfile                # Docker image (2.04GB, CPU-only torch)
├── pyproject.toml            # Package config & metadata
└── .github/workflows/ci.yml  # CI/CD pipeline

License

MIT — see LICENSE.

Built for security. Designed for production.

AI Firewall MCP

AI Firewall — MCP Server

Quick Start

pip install

Docker

Claude Desktop

Cursor / Windsurf / Cline / Roo Code

MCP Tools

Testing with MCP Inspector

Architecture

Configuration

CLI / API Usage

Testing

Project Structure

License

AI Firewall MCP

AI Firewall — MCP Server

Quick Start

pip install

Docker

Claude Desktop

Cursor / Windsurf / Cline / Roo Code

MCP Tools

Testing with MCP Inspector

Architecture

Configuration

CLI / API Usage

Testing

Project Structure

License

Related AI & LLM Tools MCP Servers

Related AI & LLM Tools MCP Servers