CLAUDE CODE MARKETPLACES
SkillsMarketplacesMCPDigestLearnAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Web & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web CrawlingAutomation & Workflows
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Claude Code Marketplaces

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Learn
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic
  1. Skills
  2. /
  3. shipshitdev
  4. /
  5. library
  6. /
  7. Advanced Evaluation

Advanced Evaluation

Editor's Note

Gives you the patterns for using LLMs to evaluate other LLM outputs, which is trickier than it sounds. Covers direct scoring versus pairwise comparison (pairwise is more reliable for subjective stuff), plus the biases that'll mess you up: position bias, length bias, self-enhancement when models grade themselves. The decision framework is simple: if there's ground truth, use direct scoring. If it's subjective, use pairwise with position swapping. Most useful when you're building eval pipelines or trying to figure out why your automated scoring keeps giving weird results. The bias mitigation table alone will save you debugging time.

Install

npx skills add https://github.com/shipshitdev/library --skill advanced-evaluation
Votes
0
Installs114
GitHub Stars24
Categories
Release Management
First SeenJun 3, 2026
View on GitHub

Comments

Login to comment

Related Release Management Skills

View all →
generate-release-notes

teambit/bit

0
18.3k
Generate comprehensive release notes for Bit from git commits and pull requests. Use when creating release notes, building changelogs, documenting version releases, or preparing a new Bit release.
release-manager

finos/morphir

0
177
Assists with Morphir release management, including pre-release verification, changelog generation, and release coordination. Use when preparing releases, checking release readiness, or managing version bumps.
ce-release-notes

everyinc/compound-engineering-plugin

0
1k
19.2k
ce release notes
release-notes

phuryn/pm-skills

0
1k
11.8k
release notes
version-release

lobehub/lobehub

0
890
78.1k
version release
release-notes-one-pager

nexu-io/open-design

0
596
57.2k
release notes one pager