Supported technologies
Data Lineage for Apache Kafka
Apache Kafka is an open-source distributed event streaming platform used for many different use cases such as messaging, website activity tracking, and stream processing.
MANTA can either connect to Confluent Platform Schema Registry and extract the schemas contained in Kafka topics in an automated way
or allow the user to describe the Kafka environment on their own to visualize it and benefit from integrations with other scanners. The Kafka visualization includes objects such as a cluster, topics, schemas, and columns.
Main scanner features
- Metadata extraction from Confluent Platform Schema Registry
- Option to define the elements in Kafka manually by providing a simple JSON file
- Schema definitions in JSON schema and Avro format
- Integrations with DataStage and StreamSets scanners
- Schema definitions using “raw” JSON files or payloads
What you can look forward to
- Exports to third-party tools
- HTTPs support for extraction from Confluent Schema Registry
- Support for the different naming strategies used in Confluent Schema Registry