This is a comprehensive reference for production LLM integration covering function calling, streaming, local inference with Ollama, and fine-tuning. You get 20 rules across 7 categories with concrete examples: strict mode tool schemas, SSE streaming endpoints, QLoRA training configs, and context optimization patterns. The documentation is opinionated in the right ways, recommending deepseek-r1:70b for local reasoning, 5-15 tools max per request, and trying prompting before fine-tuning. Most useful when you're moving beyond basic API calls and need battle-tested patterns for tool use, real-time responses, or cost optimization through local models. The quick reference tables and code snippets make this genuinely reusable rather than just theoretical guidance.
npx skills add https://github.com/yonatangross/orchestkit --skill llm-integration