Share Apache Airflow Apache Airflow is a programming-based framework for automating authoring, scheduling, and monitoring Beam data pipelines. These...Read More Jul 31 30 0 by Thomas
Share Trino Trino is a distributed SQL query engine. It has the potential to query large datasets from...Read More Jul 31 33 0 by Thomas
Share Delta Lake Delta Lake is an open-source project that allows you to create a Lakehouse design based on...Read More Jul 31 37 0 by Thomas
Share Apache Cassandra Apache Cassandra is a scalable and high-performance database that can run on commodity hardware or cloud...Read More Jul 31 30 0 by Thomas
Share Vespa Vespa is a low-latency computing engine for massive data sets. It indexes and stores your data...Read More Jul 31 35 0 by Thomas
Share Apache Calcite Apache Calcite is a full-stack category tool used for managing dynamic data. It’s an open-source database...Read More Jul 31 49 0 by Thomas
Share Koalas The Koalas project implements the pandas DataFrame API on top of Apache Spark, making data scientists...Read More Jul 31 35 0 by Thomas
Share PalmerPenguins PalmerPenguins is an open-sourced dataset. This dataset was built and developed to replace the very well-known...Read More Jul 31 32 0 by Thomas
Share Caffe Caffe is a deep learning framework that was designed and built with speed, modularity, and expression...Read More Jul 31 35 0 by Thomas
Share NeoML NeoML is an end-to-end machine learning framework that allows you to build, train, and deploy ML...Read More Jul 31 39 0 by Thomas