CLAUDE CODE MARKETPLACES
SkillsMarketplacesMCPDigestLearnAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Web & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web CrawlingAutomation & Workflows
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Claude Code Marketplaces

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Learn
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic
  1. Skills
  2. /
  3. vercel-labs
  4. /
  5. vercel-plugin
  6. /
  7. Benchmark Sandbox

Benchmark Sandbox

Editor's Note

This runs your vercel-plugin benchmark scenarios inside ephemeral Firecracker microVMs instead of local terminals. Each sandbox gets a fresh Claude Code install, runs a three-phase pipeline (build, verify with agent-browser, deploy to Vercel), and scores each phase with structured JSON via a separate Haiku pass. The architecture is battle-tested: snapshots preserve both files and npm globals, port 3000 maps to a public URL at creation time, and extendTimeout keeps sandboxes alive overnight if you need it. You get concurrency control, per-phase timeouts, and the ability to load scenarios from JSON instead of hardcoding them. The real win is isolating each eval run so one broken scenario doesn't poison your local environment, plus you can parallelize up to 10 at once.

Install

npx skills add https://github.com/vercel-labs/vercel-plugin --skill benchmark-sandbox
Votes
0
Installs181
GitHub Stars183
Categories
DevOps & CI/CDCloud & Infrastructure
First SeenJun 3, 2026
View on GitHub

Comments

Login to comment

Related DevOps & CI/CD Skills

View all →
monitoring-observability

ahmedasmar/devops-claude-skills

0
391
165
monitoring observability
monitoring-observability

supercent-io/skills-template

1
11k
88
Comprehensive monitoring setup with metrics collection, log aggregation, alerting, and health checks.
ci-cd-pipeline-builder

alirezarezvani/claude-skills

0
544
16.9k
ci cd pipeline builder
infrastructure-monitoring

aj-geddes/useful-ai-prompts

0
362
245
infrastructure monitoring
observability-monitoring

andrelandgraf/fullstackrecipes

0
312
17
observability monitoring
observability-monitoring-monitor-setup

sickn33/antigravity-awesome-skills

0
262
39.4k
observability monitoring monitor setup