CLAUDE CODE MARKETPLACES
SkillsMarketplacesMCPDigestLearnAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Web & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web CrawlingAutomation & Workflows
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Claude Code Marketplaces

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Learn
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic
  1. Skills
  2. /
  3. aradotso
  4. /
  5. trending-skills
  6. /
  7. Flashkda Delta Attention

Flashkda Delta Attention

Editor's Note

This is a CUDA kernel implementation for Kimi's Delta Attention architecture, built on CUTLASS and targeting H100-class GPUs. It's designed as a drop-in backend for flash-linear-attention's chunk_kda operation, handling the recurrent state updates that make delta attention work. You'll need SM90+ hardware and CUDA 12.9 to run it. The implementation is opinionated: it only supports 128-dim keys and values, and works exclusively with bfloat16 precision. The varlen batching support is genuinely useful for production inference where you're packing multiple sequences together. If you're already using flash-linear-attention for KDA models and have the right hardware, this should just plug in and speed things up.

Install

npx skills add https://github.com/aradotso/trending-skills --skill flashkda-delta-attention
Votes
0
Installs261
GitHub Stars26
First SeenJun 3, 2026
View on GitHub

Comments

Login to comment