Alphafold Sovereign Mcp

3STDIOregistry active

Summary

A research-grade biomedical integration that wraps AlphaFold DB and eight other public data sources (MONDO, HPO, Open Targets, ClinVar, gnomAD, DisGeNET, ChEMBL, Ensembl) behind 29 MCP tools. You get variant clinical reports, disease-target landscapes, heuristic druggability scoring, and cross-species structural comparisons via topological data analysis. Everything flows through a local SQLite knowledge graph with query and export capabilities. Ships with 730 tests and 100% coverage, but this is an unfunded independent project with no scientific validation yet and zero certification for clinical or regulated use. Reach for it when you need programmatic access to protein structures and biomedical ontologies in a research context, not production healthcare.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

AlphaFold Sovereign MCP

Answering a structural-biology or variant question usually means querying many public databases by hand — AlphaFold DB, Open Targets, ClinVar, gnomAD, and more — and reconciling their formats. This server wraps those sources behind one set of MCP tool calls that run as a local process on your own machine, with no hosted service of ours in the path, no telemetry, and a local SQLite knowledge graph that never leaves your disk. In the default online mode the tools query those public upstreams directly, so the identifiers you look up are sent to them (and one, DisGeNET, needs its own free API key); set ALPHAFOLD_OFFLINE=1 to refuse outbound requests before any socket opens, so no identifier leaves the machine (the knowledge-graph tools still answer from local data; the upstream tools report their source as unavailable). "Sovereign" here means local-first — your compute and stored results stay on your machine — not that the server runs without a network.

A Model Context Protocol server — an AlphaFold MCP server — that wraps AlphaFold DB and 8 other public biomedical data sources behind a set of MCP tool calls, backed by a local SQLite knowledge graph with query and export tools (results can be persisted through its API; automatic per-invocation persistence is not yet wired).

This is an unfunded, independent open-source project. It is not a service, not certified for any regulated use, and its outputs are research aids that should be reviewed by qualified humans before any clinical or regulatory use.

This project is not affiliated with, endorsed by, or sponsored by Google DeepMind or EMBL-EBI. "AlphaFold" is a trademark of its respective owner and is used here only to describe the public data (the AlphaFold DB API) that this software consumes.

Status: Beta. Engineering-validated (100% line and branch coverage). Not yet scientifically validated by independent domain experts; not yet deployed in production. See STATUS.md and LIMITATIONS.md.

What this is

A Python MCP server that:

Wraps AlphaFold DB, MONDO, HPO, Open Targets, ClinVar, gnomAD, DisGeNET, ChEMBL, and Ensembl behind MCP tool calls. Each call is a thin orchestration over those upstreams; the server does not add scientific judgement.
Composes upstreams into multi-source workflows: variant cross-reference reports, disease–target landscape summaries, heuristic target-druggability scoring, drug-repurposing candidate ranking, and cross-species structural-distance computation.
Ships a local SQLite knowledge graph (storage/knowledge_graph.py) with query, export, and traversal tools. It loads a curated boot seed automatically when empty (storage/seed.py, 16 entities and 15 relationships; disable with AFSMCP_DISABLE_KG_SEED=1) and can be extended by writing through the knowledge-graph API. There is no automatic per-invocation persistence: the analysis tools do not write to the store on their own.
Includes a topological-data-analysis (TDA) module that computes persistent-homology fingerprints (Betti numbers β₀, β₁, β₂) over Vietoris-Rips filtrations of Cα coordinates, and an L2-distance comparator between those fingerprint vectors. The full persistent-homology features require the optional [tda] extra (gudhi).

It targets mcp-spec 2025-06-18 and runs on Python 3.10–3.13.

What this is not

It is not a hosted service or a SaaS.
It is not certified for any regulated use (HIPAA, GxP, 21 CFR Part 11, FedRAMP, FIPS, SOC 2). The code structures audit logging in a way that could later support such a certification, but no such audit has been performed.
It does not train, fine-tune, or publish AlphaFold models — it consumes AlphaFold DB's public REST API.
The "ACMG/AMP criteria" that generate_variant_clinical_report produces are a draft surface of the upstream evidence the server can fetch automatically. They are not a substitute for clinical-laboratory variant review.
The "druggability tier" that assess_target_druggability returns is a heuristic built from drug-precedent counts, Open Targets tractability labels, pLDDT, and gnomAD constraint. It is not a validated prediction.
"Structural distance" between proteins is an L2 distance on length-normalised TDA fingerprint vectors. It measures topological similarity of the Cα point cloud. It is not a sequence similarity, RMSD, optimal-transport Wasserstein distance, or functional-equivalence measure.
The AlphaFold structures consumed here are predicted models with per-residue pLDDT confidence, not experimental structures. Low-pLDDT regions are unreliable; some proteins (BRCA1 among them) are largely low-confidence, and structural inference over those regions should be treated with caution.

For a complete, itemised list of known limitations (with module references, impact, and planned resolution), see LIMITATIONS.md. For the high-level posture — what is engineering-validated vs. what is not yet scientifically validated — see STATUS.md.

Install

From PyPI (recommended)

pip install alphafold-sovereign-mcp

Or run it without installing using uvx:

uvx alphafold-sovereign-mcp

Every release on PyPI is built by the release.yml workflow under OIDC Trusted Publishing and attached to a signed GitHub Release with Sigstore (cosign) signature bundles, a CycloneDX SBOM, and a Zenodo DOI mirror. SLSA L3 build provenance is generated in CI by slsa-github-generator; attaching the attestation to each release is a roadmap item. scripts/replicate.sh checks the published PyPI wheel hash and the presence of the release SBOM and provenance; verifying the cosign signature bundles with cosign verify-blob is not yet wired into the script (roadmap).

From source

git clone https://github.com/smaniches/alphafold-sovereign-mcp
cd alphafold-sovereign-mcp
uv pip install -e .
# With persistent-homology TDA (requires gudhi):
# uv pip install -e ".[tda]"

Verify the install

$ alphafold-sovereign --version
1.4.2
$ alphafold-sovereign --self-test
SELF-TEST PASS - ACMG helpers behave as expected on the BRCA1 c.5266dupC fixture.

If you ran it via uvx without installing, use uvx alphafold-sovereign-mcp --self-test instead (the bare alphafold-sovereign script is only on PATH after a pip/uv install).

--self-test runs fully offline: it checks the deterministic ACMG-evidence helpers (VEP, gnomAD, and AlphaMissense mapped to ACMG criteria) against a built-in BRCA1:c.5266dupC fixture. Returns exit code 0 on PASS, non-zero on FAIL. No network calls, no credentials required.

Configure Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "alphafold-sovereign": {
      "command": "alphafold-sovereign-mcp",
      "args": []
    }
  }
}

Restart Claude Desktop and the tools become available in conversations. Try asking, for example: "Triage BRCA1 c.5266dupC" or "Assess EGFR as a drug target". See the examples/ directory for three end-to-end illustrations of what a session looks like.

Offline mode

ALPHAFOLD_OFFLINE=1 alphafold-sovereign-mcp

Refuses outbound HTTP before a socket is opened (raising AirGapError), except to hosts you explicitly allowlist via ALPHAFOLD_ALLOW_HOSTS. The knowledge-graph query and export tools still answer from the local SQLite store. The upstream-querying tools have no local cache and report their source as unavailable; note that the structure tools currently surface this as a "no AlphaFold model" result rather than an explicit offline error.

Tool inventory

The server exposes 30 MCP tools across four modules. Each tool's input schema is a Pydantic model; results are JSON.

Disease & ontology (`tools/disease.py`)

Tool	What it does
`lookup_disease`	MONDO record + hierarchy + ICD cross-references
`search_diseases`	Full-text MONDO ontology search
`lookup_phenotype`	HPO term + associated diseases
`get_gene_phenotype_profile`	HPO phenotypes + gnomAD constraint for a gene
`get_disease_targets`	Top drug targets for a MONDO disease (Open Targets)
`get_target_diseases`	Top diseases for a UniProt target (Open Targets)
`get_common_disease_targets`	Parallel profiling across curated MONDO diseases
`triage_variant_3d`	HGVS → ClinVar + gnomAD constraint (disease/structure context: pointer notes)
`phenotype_to_structures`	HPO → diseases → OT targets → UniProt IDs
`get_orphan_disease_atlas`	Orphanet → MONDO → HPO + OT targets
`compare_disease_target_overlap`	Jaccard similarity of target sets for two diseases
`resolve_icd10_to_mondo`	ICD-10 code → MONDO disease record

Precision medicine (`tools/precision_medicine.py`)

Tool	What it does
`generate_variant_clinical_report`	HGVS → multi-source report + draft ACMG/AMP criteria
`assess_target_druggability`	UniProt → HOT/WARM/COLD/NOT_DRUGGABLE tier
`synthesize_protein_dossier`	UniProt → multi-source briefing
`map_disease_drug_landscape`	MONDO → approved drugs + pipeline + ChEMBL phase counts
`classify_variant_acmg`	HGVS → ACMG/AMP criteria checklist (PVS1, PM2, PP3, BP4, BP7, BS1, PP5)
`find_drug_repurposing_candidates`	MONDO → candidates ranked by OT evidence × ChEMBL phase

The ACMG/AMP criteria produced are a draft: they reflect the upstream evidence the server can fetch automatically, and they are not a substitute for clinical-laboratory review.

Structure intelligence (`tools/structure_intelligence.py`)

Tool	What it does
`analyze_structural_confidence`	mean pLDDT + confidence tier + PAE-derived domain boundaries
`compute_topology_fingerprint`	64-dim TDA fingerprint (Betti numbers β₀ β₁ β₂)
`compare_proteins_topologically`	Pairwise L2 fingerprint-distance matrix for 2–10 proteins
`find_evolutionary_structural_shifts`	Cross-species structural divergence (TDA + Ensembl orthologs)
`score_binding_pocket_geometry`	Geometric pocket detection + heuristic druggability index
`detect_intrinsically_disordered`	IDR map (linkers, tails, long IDRs)
`get_protein_structure`	Retrieve an AlphaFold model: metadata, download URLs, optional coordinates

Knowledge graph (`tools/knowledge_graph_tools.py`)

Tool	What it does
`query_variant_database`	Search locally stored variant triage results
`query_protein_database`	Search locally stored protein assessments
`get_knowledge_graph_stats`	Database size, entity counts, last activity
`export_research_dataset`	Export tables to JSON for pandas/ML pipelines
`find_drug_gene_network`	Traverse the local drug–gene–disease graph

Example usage

For three documented end-to-end illustrations of a Claude Desktop session against this server — variant triage on BRCA1 c.5266dupC, target characterisation on EGFR, and a drug-discovery walk-through on Imatinib → BCR-ABL → CML — see the examples/ directory. Each example includes the user prompt, the tool calls the model issues, the server's response shape, and the model's paraphrased reply.

Clinical variant report

generate_variant_clinical_report(hgvs="BRCA1:c.181T>G")

The server resolves the HGVS, fetches ClinVar, gnomAD, AlphaMissense (via AlphaFold DB), Open Targets disease evidence, ChEMBL drug data, and Ensembl VEP consequence annotations, and returns a single JSON record with the cross-referenced fields plus the ACMG/AMP criteria that the available evidence supports.

Drug repurposing

find_drug_repurposing_candidates(disease_mondo_id="MONDO:0007739")

Returns drugs whose Open Targets evidence connects them to the disease, ranked by a composite of OT evidence score × the maximum ChEMBL clinical phase reached against the target.

Cross-species structural divergence

find_evolutionary_structural_shifts(
    gene_symbol="ACE2",
    target_species=["mus_musculus", "rhinolophus_ferrumequinum"]
)

For each species: fetches the ortholog (Ensembl), the AlphaFold structure, computes the TDA fingerprint, and returns the L2 fingerprint distance from the human structure along with sequence identity.

Data sources

Source	What we use	License
AlphaFold DB v6 (EBI/DeepMind)	Structures, pLDDT, PAE, AlphaMissense	CC BY 4.0
MONDO (OLS4)	Disease ontology, ICD cross-refs	CC BY 4.0
HPO (JAX)	Phenotype terms, gene-disease links	HPO license (free for all use)
Open Targets	Disease–target evidence	CC0 1.0 (data)
ClinVar (NCBI)	Variant pathogenicity	Public domain
gnomAD v4	Population allele frequencies	CC0 1.0
DisGeNET	Gene–disease association scores	Free academic tier / commercial (MedBioinformatics)
ChEMBL v37 (EMBL-EBI)	Drug bioactivity, MoA, ADMET	CC BY-SA 3.0
Ensembl (EMBL-EBI)	VEP, orthologs, gene lookup	No restrictions (data); Apache 2.0 (code)

UniProt accessions are used throughout as protein identifiers — they key AlphaFold structures and Open Targets cross-references — but the UniProt API itself is not queried as a data source. Domain (InterPro), Gene Ontology, experimental-structure (RCSB PDB), and tissue-expression (Human Protein Atlas) lookups are not integrated in this release.

See NOTICE for full attributions.

Architecture

clients/_base.py
  ├── Air-gap enforcement (refuses sockets when ALPHAFOLD_OFFLINE=1)
  ├── Token-bucket rate limiting (aiolimiter)
  ├── Exponential backoff with jitter (tenacity)
  ├── Circuit breaker (CLOSED / OPEN / HALF_OPEN)
  └── HTTP/2 transport with connection pooling and keep-alive (httpx)

storage/knowledge_graph.py
  ├── SQLite WAL mode (embedded, ACID)
  ├── 6 entity tables: proteins, variants, diseases, drugs, genes, phenotypes
  ├── 4 relationship tables: protein_disease, protein_drug, variant_disease, gene_phenotype
  ├── tool_invocations audit table (SHA-256 of input + output, timestamps)
  └── Analytical views: variant_summary, drug_landscape

domain/disease.py
  └── Pure Python frozen dataclasses (PathogenicityClass, VariantReport, ...)

domain/druggability.py
  └── Pure, I/O-free target-druggability heuristic (named/cited weights and
      cut-offs, plus confidence / data-completeness / borderline reporting)

See ARCHITECTURE.md for the full module map.

Testing & quality

A comprehensive offline unit-test suite (respx-mocked upstreams, including parametrised expansions); the full suite runs hermetically in under a minute on a laptop.
Coverage on the shipped surface (src/alphafold_sovereign/clients, domain, storage, server, tools): 100% line + branch, every shipped module at 100%.
Lint: ruff (full ruleset). Type checking: mypy --strict on the full source tree.
Security: bandit plus CodeQL security-extended.
Supply chain: CycloneDX SBOM generated in CI from the installed package; scripts/replicate.sh checks the PyPI wheel hash and the presence of the release SBOM and provenance (cosign verify-blob signature-bundle verification is a roadmap item, not yet in the script).

The full CI matrix (Python 3.10, 3.11, 3.12, 3.13 × Ubuntu, macOS) runs on every push. The coverage percentage above is the number a git clone && uv run nox -s cov produces on the current HEAD; if you find a divergence, please open an issue.

Contributing

DCO sign-off required (git commit -s). No copyright assignment. Coverage gate: CI enforces 100% line and branch coverage on the shipped surface (nox -s cov). Full guide: CONTRIBUTING.md.

Related MCP servers by the same author

uniprot-mcp — Model Context Protocol server for UniProt Swiss-Prot and TrEMBL (pip install uniprot-mcp-server).
semantic-scholar-mcp — Semantic Scholar MCP server, 200M+ academic papers (pip install s2-mcp-server).

Citation

Machine-readable metadata: CITATION.cff (GitHub renders a "Cite this repository" button in the sidebar that consumes this file).

@software{maniches_alphafold_sovereign_mcp,
  author    = {Maniches, Santiago},
  title     = {AlphaFold Sovereign MCP},
  year      = {2026},
  version   = {1.4.2},
  url       = {https://github.com/smaniches/alphafold-sovereign-mcp},
  license   = {Apache-2.0},
  orcid     = {0009-0005-6480-1987},
  doi       = {10.5281/zenodo.20134773}
}

When citing results derived from this software, please also cite the upstream data sources (AlphaFold DB, Open Targets, ChEMBL, Ensembl, ClinVar, gnomAD, MONDO, HPO, DisGeNET) according to their own citation requirements.

License

Licensed under the Apache License, Version 2.0. See LICENSE.

Patent reservation: see PATENTS.md. Trademark policy: see TRADEMARKS.md.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

CodeScene MCP Server

Your agent targets a perfect 10 Code Health score. Deterministic. Every commit.

Try For Free →

AlphaFold Sovereign MCP

What this is

A Python MCP server that:

Wraps AlphaFold DB, MONDO, HPO, Open Targets, ClinVar, gnomAD, DisGeNET, ChEMBL, and Ensembl behind MCP tool calls. Each call is a thin orchestration over those upstreams; the server does not add scientific judgement.
Composes upstreams into multi-source workflows: variant cross-reference reports, disease–target landscape summaries, heuristic target-druggability scoring, drug-repurposing candidate ranking, and cross-species structural-distance computation.
Ships a local SQLite knowledge graph (storage/knowledge_graph.py) with query, export, and traversal tools. It loads a curated boot seed automatically when empty (storage/seed.py, 16 entities and 15 relationships; disable with AFSMCP_DISABLE_KG_SEED=1) and can be extended by writing through the knowledge-graph API. There is no automatic per-invocation persistence: the analysis tools do not write to the store on their own.
Includes a topological-data-analysis (TDA) module that computes persistent-homology fingerprints (Betti numbers β₀, β₁, β₂) over Vietoris-Rips filtrations of Cα coordinates, and an L2-distance comparator between those fingerprint vectors. The full persistent-homology features require the optional [tda] extra (gudhi).

It targets mcp-spec 2025-06-18 and runs on Python 3.10–3.13.

What this is not

It is not a hosted service or a SaaS.
It is not certified for any regulated use (HIPAA, GxP, 21 CFR Part 11, FedRAMP, FIPS, SOC 2). The code structures audit logging in a way that could later support such a certification, but no such audit has been performed.
It does not train, fine-tune, or publish AlphaFold models — it consumes AlphaFold DB's public REST API.
The "ACMG/AMP criteria" that generate_variant_clinical_report produces are a draft surface of the upstream evidence the server can fetch automatically. They are not a substitute for clinical-laboratory variant review.
The "druggability tier" that assess_target_druggability returns is a heuristic built from drug-precedent counts, Open Targets tractability labels, pLDDT, and gnomAD constraint. It is not a validated prediction.
"Structural distance" between proteins is an L2 distance on length-normalised TDA fingerprint vectors. It measures topological similarity of the Cα point cloud. It is not a sequence similarity, RMSD, optimal-transport Wasserstein distance, or functional-equivalence measure.
The AlphaFold structures consumed here are predicted models with per-residue pLDDT confidence, not experimental structures. Low-pLDDT regions are unreliable; some proteins (BRCA1 among them) are largely low-confidence, and structural inference over those regions should be treated with caution.

Install

From PyPI (recommended)

pip install alphafold-sovereign-mcp

Or run it without installing using uvx:

uvx alphafold-sovereign-mcp

From source

git clone https://github.com/smaniches/alphafold-sovereign-mcp
cd alphafold-sovereign-mcp
uv pip install -e .
# With persistent-homology TDA (requires gudhi):
# uv pip install -e ".[tda]"

Verify the install

$ alphafold-sovereign --version
1.4.2
$ alphafold-sovereign --self-test
SELF-TEST PASS - ACMG helpers behave as expected on the BRCA1 c.5266dupC fixture.

If you ran it via uvx without installing, use uvx alphafold-sovereign-mcp --self-test instead (the bare alphafold-sovereign script is only on PATH after a pip/uv install).

Configure Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "alphafold-sovereign": {
      "command": "alphafold-sovereign-mcp",
      "args": []
    }
  }
}

Offline mode

ALPHAFOLD_OFFLINE=1 alphafold-sovereign-mcp

Tool inventory

The server exposes 30 MCP tools across four modules. Each tool's input schema is a Pydantic model; results are JSON.

Disease & ontology (`tools/disease.py`)

Tool	What it does
`lookup_disease`	MONDO record + hierarchy + ICD cross-references
`search_diseases`	Full-text MONDO ontology search
`lookup_phenotype`	HPO term + associated diseases
`get_gene_phenotype_profile`	HPO phenotypes + gnomAD constraint for a gene
`get_disease_targets`	Top drug targets for a MONDO disease (Open Targets)
`get_target_diseases`	Top diseases for a UniProt target (Open Targets)
`get_common_disease_targets`	Parallel profiling across curated MONDO diseases
`triage_variant_3d`	HGVS → ClinVar + gnomAD constraint (disease/structure context: pointer notes)
`phenotype_to_structures`	HPO → diseases → OT targets → UniProt IDs
`get_orphan_disease_atlas`	Orphanet → MONDO → HPO + OT targets
`compare_disease_target_overlap`	Jaccard similarity of target sets for two diseases
`resolve_icd10_to_mondo`	ICD-10 code → MONDO disease record

Precision medicine (`tools/precision_medicine.py`)

Tool	What it does
`generate_variant_clinical_report`	HGVS → multi-source report + draft ACMG/AMP criteria
`assess_target_druggability`	UniProt → HOT/WARM/COLD/NOT_DRUGGABLE tier
`synthesize_protein_dossier`	UniProt → multi-source briefing
`map_disease_drug_landscape`	MONDO → approved drugs + pipeline + ChEMBL phase counts
`classify_variant_acmg`	HGVS → ACMG/AMP criteria checklist (PVS1, PM2, PP3, BP4, BP7, BS1, PP5)
`find_drug_repurposing_candidates`	MONDO → candidates ranked by OT evidence × ChEMBL phase

The ACMG/AMP criteria produced are a draft: they reflect the upstream evidence the server can fetch automatically, and they are not a substitute for clinical-laboratory review.

Structure intelligence (`tools/structure_intelligence.py`)

Tool	What it does
`analyze_structural_confidence`	mean pLDDT + confidence tier + PAE-derived domain boundaries
`compute_topology_fingerprint`	64-dim TDA fingerprint (Betti numbers β₀ β₁ β₂)
`compare_proteins_topologically`	Pairwise L2 fingerprint-distance matrix for 2–10 proteins
`find_evolutionary_structural_shifts`	Cross-species structural divergence (TDA + Ensembl orthologs)
`score_binding_pocket_geometry`	Geometric pocket detection + heuristic druggability index
`detect_intrinsically_disordered`	IDR map (linkers, tails, long IDRs)
`get_protein_structure`	Retrieve an AlphaFold model: metadata, download URLs, optional coordinates

Knowledge graph (`tools/knowledge_graph_tools.py`)

Tool	What it does
`query_variant_database`	Search locally stored variant triage results
`query_protein_database`	Search locally stored protein assessments
`get_knowledge_graph_stats`	Database size, entity counts, last activity
`export_research_dataset`	Export tables to JSON for pandas/ML pipelines
`find_drug_gene_network`	Traverse the local drug–gene–disease graph

Example usage

Clinical variant report

generate_variant_clinical_report(hgvs="BRCA1:c.181T>G")

Drug repurposing

find_drug_repurposing_candidates(disease_mondo_id="MONDO:0007739")

Returns drugs whose Open Targets evidence connects them to the disease, ranked by a composite of OT evidence score × the maximum ChEMBL clinical phase reached against the target.

Cross-species structural divergence

find_evolutionary_structural_shifts(
    gene_symbol="ACE2",
    target_species=["mus_musculus", "rhinolophus_ferrumequinum"]
)

For each species: fetches the ortholog (Ensembl), the AlphaFold structure, computes the TDA fingerprint, and returns the L2 fingerprint distance from the human structure along with sequence identity.

Data sources

Source	What we use	License
AlphaFold DB v6 (EBI/DeepMind)	Structures, pLDDT, PAE, AlphaMissense	CC BY 4.0
MONDO (OLS4)	Disease ontology, ICD cross-refs	CC BY 4.0
HPO (JAX)	Phenotype terms, gene-disease links	HPO license (free for all use)
Open Targets	Disease–target evidence	CC0 1.0 (data)
ClinVar (NCBI)	Variant pathogenicity	Public domain
gnomAD v4	Population allele frequencies	CC0 1.0
DisGeNET	Gene–disease association scores	Free academic tier / commercial (MedBioinformatics)
ChEMBL v37 (EMBL-EBI)	Drug bioactivity, MoA, ADMET	CC BY-SA 3.0
Ensembl (EMBL-EBI)	VEP, orthologs, gene lookup	No restrictions (data); Apache 2.0 (code)

See NOTICE for full attributions.

Architecture

clients/_base.py
  ├── Air-gap enforcement (refuses sockets when ALPHAFOLD_OFFLINE=1)
  ├── Token-bucket rate limiting (aiolimiter)
  ├── Exponential backoff with jitter (tenacity)
  ├── Circuit breaker (CLOSED / OPEN / HALF_OPEN)
  └── HTTP/2 transport with connection pooling and keep-alive (httpx)

storage/knowledge_graph.py
  ├── SQLite WAL mode (embedded, ACID)
  ├── 6 entity tables: proteins, variants, diseases, drugs, genes, phenotypes
  ├── 4 relationship tables: protein_disease, protein_drug, variant_disease, gene_phenotype
  ├── tool_invocations audit table (SHA-256 of input + output, timestamps)
  └── Analytical views: variant_summary, drug_landscape

domain/disease.py
  └── Pure Python frozen dataclasses (PathogenicityClass, VariantReport, ...)

domain/druggability.py
  └── Pure, I/O-free target-druggability heuristic (named/cited weights and
      cut-offs, plus confidence / data-completeness / borderline reporting)

See ARCHITECTURE.md for the full module map.

Testing & quality

A comprehensive offline unit-test suite (respx-mocked upstreams, including parametrised expansions); the full suite runs hermetically in under a minute on a laptop.
Coverage on the shipped surface (src/alphafold_sovereign/clients, domain, storage, server, tools): 100% line + branch, every shipped module at 100%.
Lint: ruff (full ruleset). Type checking: mypy --strict on the full source tree.
Security: bandit plus CodeQL security-extended.
Supply chain: CycloneDX SBOM generated in CI from the installed package; scripts/replicate.sh checks the PyPI wheel hash and the presence of the release SBOM and provenance (cosign verify-blob signature-bundle verification is a roadmap item, not yet in the script).

Contributing

DCO sign-off required (git commit -s). No copyright assignment. Coverage gate: CI enforces 100% line and branch coverage on the shipped surface (nox -s cov). Full guide: CONTRIBUTING.md.

Related MCP servers by the same author

uniprot-mcp — Model Context Protocol server for UniProt Swiss-Prot and TrEMBL (pip install uniprot-mcp-server).
semantic-scholar-mcp — Semantic Scholar MCP server, 200M+ academic papers (pip install s2-mcp-server).

Citation

Machine-readable metadata: CITATION.cff (GitHub renders a "Cite this repository" button in the sidebar that consumes this file).

@software{maniches_alphafold_sovereign_mcp,
  author    = {Maniches, Santiago},
  title     = {AlphaFold Sovereign MCP},
  year      = {2026},
  version   = {1.4.2},
  url       = {https://github.com/smaniches/alphafold-sovereign-mcp},
  license   = {Apache-2.0},
  orcid     = {0009-0005-6480-1987},
  doi       = {10.5281/zenodo.20134773}
}

License

Licensed under the Apache License, Version 2.0. See LICENSE.

Patent reservation: see PATENTS.md. Trademark policy: see TRADEMARKS.md.

Alphafold Sovereign Mcp

AlphaFold Sovereign MCP

What this is

What this is not

Install

From PyPI (recommended)

From source

Verify the install

Configure Claude Desktop

Offline mode

Tool inventory

Disease & ontology (tools/disease.py)

Precision medicine (tools/precision_medicine.py)

Structure intelligence (tools/structure_intelligence.py)

Knowledge graph (tools/knowledge_graph_tools.py)

Example usage

Clinical variant report

Drug repurposing

Cross-species structural divergence

Data sources

Architecture

Testing & quality

Contributing

Related MCP servers by the same author

Citation

License

Alphafold Sovereign Mcp

AlphaFold Sovereign MCP

What this is

What this is not

Install

From PyPI (recommended)

From source

Verify the install

Configure Claude Desktop

Offline mode

Tool inventory

Disease & ontology (tools/disease.py)

Precision medicine (tools/precision_medicine.py)

Structure intelligence (tools/structure_intelligence.py)

Knowledge graph (tools/knowledge_graph_tools.py)

Example usage

Clinical variant report

Drug repurposing

Cross-species structural divergence

Data sources

Architecture

Testing & quality

Contributing

Related MCP servers by the same author

Citation

License

Related Databases MCP Servers

Related Databases MCP Servers

Disease & ontology (`tools/disease.py`)

Precision medicine (`tools/precision_medicine.py`)

Structure intelligence (`tools/structure_intelligence.py`)

Knowledge graph (`tools/knowledge_graph_tools.py`)

Disease & ontology (`tools/disease.py`)

Precision medicine (`tools/precision_medicine.py`)

Structure intelligence (`tools/structure_intelligence.py`)

Knowledge graph (`tools/knowledge_graph_tools.py`)