CLAUDE CODE MARKETPLACES
SkillsMarketplacesMCPDigestLearnAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Web & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web CrawlingAutomation & Workflows
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Claude Code Marketplaces

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Learn
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic
  1. Skills
  2. /
  3. orchestra-research
  4. /
  5. ai-research-skills
  6. /
  7. Speculative Decoding

Speculative Decoding

Editor's Note

This covers three main approaches to making LLMs generate text faster without quality loss: classic speculative decoding with a small draft model that guesses tokens for a large target model to verify in parallel (1.5-2× speedup), Medusa which adds extra prediction heads to guess multiple future tokens at once using tree-based attention (2.3-3.6× speedup), and Lookahead Decoding which reformulates generation as parallel Jacobi iteration over n-grams. The implementations are practical, with working code for transformers' assisted generation, the Medusa library, and lookahead parameters. Best when you need lower latency for chatbots or code completion and have the memory to load draft models or train lightweight heads. The math checks out and these actually work in production.

Install

npx skills add https://github.com/orchestra-research/ai-research-skills --skill speculative-decoding
Votes
0
Installs250
GitHub Stars9.2k
Categories
Testing & QAAI & Agent BuildingData Science & ML
First SeenJun 3, 2026
View on GitHub

Comments

Login to comment

Related Testing & QA Skills

View all →
playwright-e2e-testing

bobmatnyc/claude-mpm-skills

0
2.7k
49
playwright e2e testing
qa-testing-playwright

vasilyu1983/ai-agents-public

0
423
60
qa testing playwright
playwright-e2e-testing

fugazi/test-automation-skills-agents

0
306
156
playwright e2e testing
e2e-testing-patterns

wshobson/agents

0
17.1k
36.2k
Comprehensive guide to building reliable, maintainable end-to-end test suites with Playwright and Cypress.
e2e-testing

affaan-m/everything-claude-code

0
5.1k
202.7k
e2e testing
typescript-e2e-testing

bmad-labs/skills

0
1.9k
9
typescript e2e testing