Share Apache Airflow Apache Airflow is a programming-based framework for automating authoring, scheduling, and monitoring Beam data pipelines. These...Read More Jul 31 49 0 by Thomas
Share Delta Lake Delta Lake is an open-source project that allows you to create a Lakehouse design based on...Read More Jul 31 66 0 by Thomas
Share Trino Trino is a distributed SQL query engine. It has the potential to query large datasets from...Read More Jul 31 78 0 by Thomas
Share Apache Cassandra Apache Cassandra is a scalable and high-performance database that can run on commodity hardware or cloud...Read More Jul 31 49 0 by Thomas
Share Vespa Vespa is a low-latency computing engine for massive data sets. It indexes and stores your data...Read More Jul 31 72 0 by Thomas
Share Apache Calcite Apache Calcite is a full-stack category tool used for managing dynamic data. It’s an open-source database...Read More Jul 31 83 0 by Thomas
Share Koalas The Koalas project implements the pandas DataFrame API on top of Apache Spark, making data scientists...Read More Jul 31 60 0 by Thomas
Share PalmerPenguins PalmerPenguins is an open-sourced dataset. This dataset was built and developed to replace the very well-known...Read More Jul 31 50 0 by Thomas
Share Caffe Caffe is a deep learning framework that was designed and built with speed, modularity, and expression...Read More Jul 31 53 0 by Thomas
Share NeoML NeoML is an end-to-end machine learning framework that allows you to build, train, and deploy ML...Read More Jul 31 72 0 by Thomas