Data Lineage for StreamSets
StreamSets Data Collector is an open-source execution engine for fast data ingestion and light transformations. The engine is designed to execute smart data pipelines for streaming and batch data without hand coding.
MANTA understands pipelines from StreamSets Data Collector and is able to analyze and visualize them. This includes extracting pipelines, resolving individual stages, working with provided runtime values, and processing expression language and database connections.
MANTA currently scans
- Pipelines and their stages
- Database connections