CLAUDE CODE MARKETPLACES
SkillsMarketplacesMCPDigestLearnAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Web & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web CrawlingAutomation & Workflows
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Claude Code Marketplaces

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Learn
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic
  1. Skills
  2. /
  3. huggingface
  4. /
  5. kernels
  6. /
  7. Cuda Kernels

Cuda Kernels

Editor's Note

This walks you through writing optimized CUDA kernels for HuggingFace diffusers and transformers on H100, A100, and T4 GPUs. You get kernel templates for RMSNorm, attention, RoPE, and activation functions, plus benchmarking scripts that compare against PyTorch baselines. The included RMSNorm kernel hits 2.67x speedup in microbenchmarks but only 6% end-to-end improvement in LTX-Video generation because normalization is a small fraction of total compute. Worth using if you're already profiling and know which kernels are your bottleneck. The skill includes working examples for both diffusers and transformers integration, architecture-specific optimization guides, and support for loading pre-compiled kernels from HuggingFace Hub to skip local compilation entirely.

Install

npx skills add https://github.com/huggingface/kernels --skill cuda-kernels
Votes
0
Installs130
GitHub Stars674
First SeenJun 3, 2026
View on GitHub

Comments

Login to comment