Data Lineage for IBM Infosphere Datastage
IBM InfoSphere DataStage is an ETL platform that integrates data across multiple enterprise systems. The scalable platform provides extended metadata management and enterprise connectivity. It integrates heterogeneous data, including big data at rest (Hadoop-based) and big data in motion (stream-based), on both distributed and mainframe platforms. IBM DataStage applies workload and business rules, and it integrates real-time data in an easy-to-deploy platform.
MANTA analyzes DataStage parallel jobs provided by the user. Using this information, MANTA creates a detailed visualization of the data lineage that can be pushed into any third-party metadata management solution or viewed in MANTA’s native visualization.
MANTA currently scans
- Parallel Jobs
- Parameter Sets
- Operational Metadata