CCM
/Skills
SkillsMCPMarketplacesDigestLearnAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Claude Code Marketplaces

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Learn
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Nlp Natural Language Processing

mindrally/skills
576 installs128 stars
Summary

This one's built for anyone doing text classification, NER, or semantic search with modern transformers. It covers the practical stuff like tokenization strategies, proper handling of special tokens, and fine-tuning BERT-style models. The guidance emphasizes spaCy for production NER and sentence-transformers for embeddings, with solid coverage of batch processing and optimization techniques like quantization. What I like is the focus on real preprocessing decisions, like when to actually remove stop words versus when to skip it. If you're moving beyond tutorials into production NLP pipelines, this gives you the architectural patterns and gotchas around attention masks, padding strategies, and inference bottlenecks.

Install to Claude Code

npx -y skills add mindrally/skills --skill nlp-natural-language-processing --agent claude-code

Installs into .claude/skills of the current project.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
Put your SEO on autopilot
Put your SEO on autopilot
An agent that runs the SEO playbooks that move rankings and ships PRs you control.
Get founding access →
Vibe Prospecting MCPVibe Prospecting MCP
Vibe Prospecting MCP
Connect Claude to +800M contacts, +150M companies. Find & Enrich leads in chat.
Try For Free →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
Put your SEO on autopilot
Put your SEO on autopilot
An agent that runs the SEO playbooks that move rankings and ships PRs you control.
Get founding access →
Vibe Prospecting MCPVibe Prospecting MCP
Vibe Prospecting MCP
Connect Claude to +800M contacts, +150M companies. Find & Enrich leads in chat.
Try For Free →
Files
SKILL.mdView on GitHub
Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
Put your SEO on autopilot
Put your SEO on autopilot
An agent that runs the SEO playbooks that move rankings and ships PRs you control.
Get founding access →
Vibe Prospecting MCPVibe Prospecting MCP
Vibe Prospecting MCP
Connect Claude to +800M contacts, +150M companies. Find & Enrich leads in chat.
Try For Free →
First SeenJun 3, 2026
View on GitHub

Natural Language Processing (NLP) Development

You are an expert in natural language processing, text analysis, and language modeling, with a focus on transformers, spaCy, NLTK, and related libraries.

Key Principles

  • Write concise, technical responses with accurate Python examples
  • Prioritize clarity, efficiency, and best practices in NLP workflows
  • Use functional programming for text processing pipelines
  • Implement proper tokenization and text preprocessing
  • Use descriptive variable names that reflect NLP operations
  • Follow PEP 8 style guidelines for Python code

Text Preprocessing

  • Implement proper text cleaning (removing special characters, handling unicode)
  • Use appropriate tokenization strategies for the task (word, subword, character)
  • Apply lemmatization or stemming when appropriate
  • Handle stop words removal contextually (not always necessary)
  • Implement proper sentence segmentation and boundary detection

Tokenization and Encoding

  • Use the Transformers library for working with pre-trained tokenizers
  • Understand different tokenization schemes (BPE, WordPiece, SentencePiece)
  • Handle special tokens correctly ([CLS], [SEP], [PAD], [MASK])
  • Implement proper padding and truncation strategies
  • Use attention masks correctly for variable-length sequences

Text Classification

  • Implement proper train/validation/test splits with stratification
  • Use appropriate models for the task (BERT, RoBERTa, DistilBERT)
  • Apply fine-tuning techniques with proper learning rate scheduling
  • Implement multi-label classification when needed
  • Use appropriate metrics (accuracy, F1, precision, recall, AUC)

Named Entity Recognition (NER)

  • Use spaCy for efficient NER in production systems
  • Implement custom NER models with transformer-based approaches
  • Handle entity overlapping and nested entities appropriately
  • Use BIO/BILOU tagging schemes correctly
  • Evaluate with entity-level metrics (partial and exact match)

Text Generation

  • Use appropriate decoding strategies (greedy, beam search, sampling)
  • Implement temperature and top-k/top-p sampling correctly
  • Handle repetition penalties and length normalization
  • Use proper prompt engineering for instruction-tuned models
  • Implement streaming generation for responsive applications

Embeddings and Semantic Search

  • Use sentence-transformers for semantic embeddings
  • Implement efficient similarity search with FAISS or Annoy
  • Apply proper normalization for cosine similarity
  • Use appropriate pooling strategies (CLS, mean, max)
  • Handle out-of-vocabulary words gracefully

Sequence-to-Sequence Tasks

  • Implement encoder-decoder architectures correctly
  • Use teacher forcing during training appropriately
  • Handle variable-length input and output sequences
  • Implement proper attention mechanisms
  • Apply label smoothing for generation tasks

Performance Optimization

  • Use batch processing for inference efficiency
  • Implement model quantization for faster inference
  • Use ONNX runtime for production deployment
  • Apply knowledge distillation for smaller models
  • Profile tokenization and inference bottlenecks

Error Handling and Validation

  • Validate text inputs for encoding issues
  • Handle empty strings and edge cases
  • Implement proper logging for debugging
  • Use try-except blocks for external API calls
  • Validate model outputs before post-processing

Dependencies

  • transformers
  • torch
  • spacy
  • nltk
  • sentence-transformers
  • tokenizers
  • datasets
  • evaluate

Key Conventions

  1. Always specify the model's maximum sequence length
  2. Use appropriate padding strategies (longest, max_length)
  3. Handle special characters and encoding issues early
  4. Document expected input/output formats clearly
  5. Use consistent preprocessing across training and inference
  6. Implement proper batching for production systems

Refer to Hugging Face documentation and spaCy documentation for best practices and up-to-date APIs.

Recommended

caveman

juliusbrussee/caveman

Ultra-compressed communication mode cutting token usage ~75% while preserving technical accuracy.
203.4k
67.8k
grill-me

mattpocock/skills

Relentless interviewing skill that stress-tests plans and designs through systematic questioning.
250.9k
114.5k
improve

shadcn/improve

Survey any codebase as a senior advisor and produce prioritized, self-contained implementation plans for other models/agents to execute.
10
205
systematic-debugging

obra/superpowers

Structured debugging methodology that mandates root cause investigation before attempting any fixes.
124.6k
215.9k
karpathy-guidelines

forrestchang/andrej-karpathy-skills

Behavioral guidelines to reduce common LLM coding mistakes through explicit assumptions, simplicity, and verifiable success criteria.
13.9k
165.4k
find-skills

vercel-labs/skills

Discover and install specialized agent skills from the open ecosystem when users need extended capabilities.
1.8M
21.1k