Category Archives: Data Science

Order By
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Clickhouse

Clickhouse is a column-oriented database management system used for the online analytical processing of queries (...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Apache Flink

Apache Flink is a stateful computation framework. It serves as a distributed processing engine for both...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Apache Spark

Apache Spark is an open-source cluster computing framework. It comes with programming interfaces for entire clusters....
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Presto

Presto is an open-source distributed SQL query engine. It enables the users to run interactive analytic...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Apache Zeppelin

Apache Zeppelin is a multi-purpose notebook that supports Data Ingestion, Data Discovery, Data Analytics, Data Visualization,...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

CMAK

CMAK stands for Cluster Manager for Apache Kafka, previously known as Kafka Manager, is a tool...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Cython

Cython is a static optimizer for the Python programming language. It also works well for the...