Finetuning

1.1k starsApache-2.0

Summary

Reach for this when you're choosing between SFT, LoRA, DPO, GRPO, or any other post-training method, or when debugging why your training run isn't moving the needle. It starts with a decision tree keyed to reward shape: verifiable benchmarks get RL, preference pairs get DPO, demonstrations get SFT. The diagnostics are practical: stuck at zero on a verifiable task means you picked the wrong technique class, not that you need to tune hyperparameters. It enforces smoke runs on ten examples before you burn GPU budget, builds mid-training eval and early stopping directly into scripts, and caps retries at one for training-scale experiments. The literature brief pattern via evo:ideator is smart, most teams skip that step and waste time rediscovering what already works for their model family.

Install to Claude Code

npx -y skills add evo-hq/evo --skill finetuning --agent claude-code

Installs into .claude/skills of the current project.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Files

SKILL.md

Select a file.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Finetuning

Install to Claude Code

Finetuning

Install to Claude Code

Recommended

Recommended