How to Implement Content-Based Splitting
Transform bundled messages into individual events for precise routing and processing.
The Problem
Data pipelines receive bundled messages preventing granular operations:
- Single HTTP request carries 100 sensor readings
- Batch files contain thousands of transactions
- Log streams deliver multiline stack traces
- No per-event routing, transformation, or metrics
The Solution
Learn 5 content splitting techniques:
- JSON Array Splitting - Split sensor arrays with
unarchiveprocessor and metadata preservation - CSV Batch Splitting - Parse CSV files line-by-line with structured field mapping
- Nested Structure Splitting - Split complex nested JSON while preserving hierarchical context
- Advanced Patterns - Combine splitting with routing, filtering, and intelligent re-batching
- Production Optimization - Size validation, atomic operations, and edge-first processing
Get Started
Choose your path:
Interactive Explorer
See each splitting technique with side-by-side before/after views
Step-by-Step Tutorial
Build the pipeline incrementally:
- Split JSON Arrays
- Split CSV Batches
- Split Nested Structures
- Advanced Patterns
- Production Considerations
Complete Pipeline
Download the production-ready solution