CLAUDE CODE MARKETPLACES
SkillsMarketplacesMCPDigestLearnAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Web & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web CrawlingAutomation & Workflows
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Claude Code Marketplaces

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Learn
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic
  1. Skills
  2. /
  3. orchestra-research
  4. /
  5. ai-research-skills
  6. /
  7. Simpo Training

Simpo Training

Editor's Note

SimPO is a reference-free preference optimization method that outperforms DPO by 6.4 points on AlpacaEval 2.0 without needing a reference model during training. You use it when you have preference pairs (chosen/rejected responses) and want simpler, more efficient alignment than DPO or PPO. The implementation lives in Hugging Face's alignment-handbook and requires careful tuning of learning rate (3e-7 to 1e-6) and beta (2.0-10.0) parameters. Works on single-node setups with DeepSpeed ZeRO-3, making it accessible if you don't have massive distributed infrastructure. The honest take: if DPO is your baseline for preference alignment, SimPO gives you better results with less complexity, though you still need to babysit hyperparameters to avoid divergence.

Install

npx skills add https://github.com/orchestra-research/ai-research-skills --skill simpo-training
Votes
0
Installs248
GitHub Stars9.2k
Categories
AI & Agent BuildingData Science & ML
First SeenJun 3, 2026
View on GitHub

Comments

Login to comment

Related AI & Agent Building Skills

View all →
agentica-prompts

parcadei/continuous-claude-v3

0
398
3.8k
agentica prompts
llm-application-dev-langchain-agent

sickn33/antigravity-awesome-skills

0
306
39.4k
llm application dev langchain agent
agentic-eval

github/awesome-copilot

0
9.4k
34.3k
Iterative evaluation and refinement patterns for improving AI agent outputs through self-critique loops.
ai-prompt-engineering-safety-review

github/awesome-copilot

0
9.4k
34.3k
Comprehensive safety analysis and improvement framework for AI prompts with detailed assessment methodologies.
emblem-ai-prompt-examples

emblemcompany/agent-skills

0
8.7k
10
emblem ai prompt examples
finalize-agent-prompt

github/awesome-copilot

0
8.6k
34.3k
Polish and refine agent prompt files against proven best practices.