Engineering Data Pipelines

224 starsMIT

Summary

This is a reference guide for building production data pipelines, covering both batch orchestration (Airflow, Dagster, Prefect) and stream processing (Kafka Streams, Flink, Spark Streaming). It gives you checklists for the things that actually matter: idempotency patterns, incremental loads, watermark tuning for out-of-order events, state management in streaming jobs, and layered data quality checks with tools like Great Expectations and dbt tests. The bilingual formatting is a bit quirky, but the technical checklists are solid. Use this when you're setting up ETL workflows or real-time streaming and need a quick reference for best practices around exactly-once semantics, backpressure monitoring, or cross-DAG dependencies without digging through framework docs.

Install to Claude Code

npx -y skills add telagod/code-abyss --skill engineering-data-pipelines --agent claude-code

Installs into .claude/skills of the current project.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Files

SKILL.md

Select a file.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

AI notepad for back-to-back meetings

Notes, actions and memory. Without a meeting bot. First month 100% off.

Download for free →

Advertise on claudemarketplaces.com

Show your product to 350K+ AI developers monthly. (Empty days caused by temporary data issue)

Try for a month →

Give your AI the whole web as clean markdown

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

belt - the only tool your agent needs

belt cli automatically finds the best tools and skills for your agent. image, video, music, tts...

one prompt install →

Email for Agents: Free tier available

Give your AI agent a complete email layer—sending, inbound inboxes, and sandbox testing.

Get 4K emails/month free →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Engineering Data Pipelines

Install to Claude Code

Engineering Data Pipelines

Install to Claude Code

Recommended

Recommended