BistroGeneral-purpose data processing engine for both batch and stream analytics. It is based on a novel data model, which represents data via functions and processes data via column operations as opposed to having only set operations in conventional approaches like MapReduce or SQL.
IBM StreamsPlatform for distributed processing and real-time analytics. Integrates with many of the popular technologies in the Big Data ecosystem (Kafka, HDFS, Spark, etc.)
Apache HadoopFramework for distributed processing. Integrates MapReduce (parallel processing), YARN (job scheduling) and HDFS (distributed file system).
Tigon High Throughput Real-time Stream Processing Framework.
Pachyderm Pachyderm is a data storage platform built on Docker and Kubernetes to provide reproducible data processing and analysis.
Polyaxon A platform for reproducible and scalable machine learning and deep learning.
Similar Posts