Category Archives: Data Science

Order By
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Apache Airflow

Apache Airflow is a programming-based framework for automating authoring, scheduling, and monitoring Beam data pipelines. These...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Trino

Trino is a distributed SQL query engine. It has the potential to query large datasets from...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Delta Lake

Delta Lake is an open-source project that allows you to create a Lakehouse design based on...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Vespa

Vespa is a low-latency computing engine for massive data sets. It indexes and stores your data...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Apache Calcite

Apache Calcite is a full-stack category tool used for managing dynamic data. It’s an open-source database...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Koalas

The Koalas project implements the pandas DataFrame API on top of Apache Spark, making data scientists...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

PalmerPenguins

PalmerPenguins is an open-sourced dataset. This dataset was built and developed to replace the very well-known...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

Caffe

Caffe is a deep learning framework that was designed and built with speed, modularity, and expression...
Tweet about this on TwitterPin on PinterestShare on LinkedInShare on Google+Email this to someoneShare on FacebookShare on VkontakteShare on Odnoklassniki

NeoML

NeoML is an end-to-end machine learning framework that allows you to build, train, and deploy ML...