CLAUDE CODE MARKETPLACES
SkillsMarketplacesMCPDigestLearnJobsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Web & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web CrawlingAutomation & Workflows
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Claude Code Marketplaces

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Learn
  • Feedback
  • Privacy Policy
  • Jobs
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic
  1. Skills
  2. /
  3. aradotso
  4. /
  5. trending-skills
  6. /
  7. Turboquant Pytorch

Turboquant Pytorch

Editor's Note

This is a from-scratch PyTorch implementation of Google's TurboQuant algorithm for compressing LLM key-value caches down to 2-4 bits per coordinate. It uses two-stage vector quantization: random rotation plus Lloyd-Max scalar quantization for the first pass, then QJL residual correction to make inner product estimates unbiased. The trick is it preserves attention scores, not individual vector fidelity. At 3-bit you get 5x compression with 99.5% attention accuracy on Qwen2.5-3B, which is the practical sweet spot. Use it when you're running long-context inference and need to fit more tokens in VRAM. The repo includes both synthetic tests and real model validation, plus production compressors that let you compute attention scores directly from compressed keys without decompressing.

Install

npx skills add https://github.com/aradotso/trending-skills --skill turboquant-pytorch
Votes
0
Installs642
GitHub Stars7
Categories
Frontend DevelopmentTesting & QAAI & Agent BuildingData Science & MLRelease ManagementProductivity & PlanningFinance & Trading
First SeenMay 16, 2026
View on GitHub

Comments

Login to comment

Related Frontend Development Skills

View all →
frontend-design

anthropics/skills

10
418.1k
135.1k
Distinctive, production-grade frontend interfaces that reject generic AI aesthetics.
vercel-react-best-practices

vercel-labs/agent-skills

5
402.7k
26.6k
3
React and Next.js performance optimization guide with 64 prioritized rules across 8 categories.
remotion-best-practices

remotion-dev/skills

0
312.3k
3.2k
Domain-specific knowledge base for building videos with Remotion and React.
vercel-composition-patterns

vercel-labs/agent-skills

0
175.4k
26.6k
React composition patterns for scaling components and avoiding boolean prop proliferation.
ui-ux-pro-max

nextlevelbuilder/ui-ux-pro-max-skill

4
167k
79k
Comprehensive design intelligence for web and mobile UI/UX across 10 technology stacks.
shadcn

shadcn/ui

0
143.8k
114.5k
Complete shadcn/ui component management for adding, searching, fixing, styling, and composing UI.