ToolUniverse provides an MCP server that enables AI agents and language models to access a curated ecosystem of scientific tools and capabilities for conducting research tasks. The server exposes tools through multiple interfaces—an MCP protocol for AI agents, a Python SDK for developers, and a CLI for terminal access—allowing users to discover, inspect, and execute scientific tools across domains. It solves the problem of democratizing access to specialized scientific capabilities by centralizing tool discovery and execution for AI-driven research workflows.
AI agent (recommended) — open your AI agent and run:
Read https://aiscientist.tools/setup.md and set up ToolUniverse for me.
The agent will walk you through MCP configuration, API keys, skill installation, and validation.
Add to your MCP config file:
{
"mcpServers": {
"tooluniverse": {
"command": "uvx",
"args": ["--refresh", "tooluniverse"],
"env": {"PYTHONIOENCODING": "utf-8"}
}
}
}
Install agent skills:
npx skills add mims-harvard/ToolUniverse
Python developers — install the SDK:
uv pip install tooluniverse
tu CLI — discover, inspect, run, and test tools from the terminal.
Python SDK — programmatic access for building AI scientist systems.
Click to watch the demo (YouTube) (Bilibili)
ToolUniverse is an ecosystem for creating AI scientist systems from any large language model. Powered by the AI-Tool Interaction Protocol, it standardizes how LLMs identify and call tools, integrating more than 1000 machine learning models, datasets, APIs, and scientific packages for data analysis, knowledge retrieval, and experimental design.
Key features:
tu): Discover, inspect, run, and test tools directly from the terminal — 9 subcommands for interactive and scripted workflows
Building your project with ToolUniverse? Submit via GitHub Pull Request or contact us.
TxAgent: AI Agent for Therapeutic Reasoning [Project] [Paper] [PyPI] [GitHub] [HuggingFace]
TxAgent leverages ToolUniverse's scientific tool ecosystem to solve complex therapeutic reasoning tasks.
Medea: An Omics AI Agent for Therapeutic Discovery [Project] [Paper] [GitHub]
Medea integrates ToolUniverse tools for multi-omics analysis to identify therapeutic targets and predict drug responses across cancer, autoimmune, and other diseases.
Full documentation: zitniklab.hms.harvard.edu/ToolUniverse
tu)Shanghua Gao, the lead creator of this project, is currently on the job market.
Slack · GitHub Issues · Shanghua Gao · Marinka Zitnik
Leaders: Shanghua Gao · Marinka Zitnik
Contributors: Shanghua Gao · Richard Zhu · Pengwei Sui · Zhenglun Kong · Sufian Aldogom · Yepeng Huang · Ayush Noori · Reza Shamji · Krishna Parvataneni · Theodoros Tsiligkaridis · Marinka Zitnik
@article{gao2025democratizingaiscientistsusing,
title={Democratizing AI scientists using ToolUniverse},
author={Shanghua Gao and Richard Zhu and Pengwei Sui and Zhenglun Kong and Sufian Aldogom and Yepeng Huang and Ayush Noori and Reza Shamji and Krishna Parvataneni and Theodoros Tsiligkaridis and Marinka Zitnik},
year={2025},
eprint={2509.23426},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2509.23426},
}
OPENAI_API_KEYsecretOpenAI API key for LLM-based tool finding and embedding features
ANTHROPIC_API_KEYsecretAnthropic API key for Claude-based features
GEMINI_API_KEYsecretGoogle Gemini API key for Gemini-based features
OPENROUTER_API_KEYsecretOpenRouter API key for accessing 100+ LLM models
AZURE_OPENAI_API_KEYsecretAzure OpenAI API key (alternative to OpenAI)
HF_TOKENsecretHuggingFace token for accessing models and datasets
NCBI_API_KEYsecretNCBI/PubMed API key for higher rate limits (10 req/sec vs 3 req/sec). Register at https://www.ncbi.nlm.nih.gov/account/
SEMANTIC_SCHOLAR_API_KEYsecretSemantic Scholar API key for higher rate limits (100 req/sec vs 1 req/sec). Register at https://www.semanticscholar.org/product/api
BIOGRID_API_KEYsecretBioGRID API key for protein-protein interaction queries. Register free at https://webservice.thebiogrid.org/
DISGENET_API_KEYsecretDisGeNET API key for gene-disease association data. Register free at https://www.disgenet.org/
OMIM_API_KEYsecretOMIM API key for Mendelian disease data. Register at https://omim.org/api
ONCOKB_API_TOKENsecretOncoKB API token for precision oncology annotations. Register at https://www.oncokb.org/apiAccess
NVIDIA_API_KEYsecretNVIDIA NIM API key for AlphaFold2 structure prediction, molecular docking, and genomics tools. Get key at https://build.nvidia.com
USPTO_API_KEYsecretUSPTO API key for patent data access. Register at https://developer.uspto.gov/
UMLS_API_KEYsecretUMLS API key for medical terminology and concept mapping. Register at https://uts.nlm.nih.gov/uts/
BRENDA_EMAILBRENDA enzyme database login email. Register at https://www.brenda-enzymes.org/
BRENDA_PASSWORDsecretBRENDA enzyme database login password