Presto is an open-source distributed SQL query engine. It enables the users to run interactive analytic queries for data sources of varied sizes ranging from gigabytes to petabytes. It was built from the ground up for interactive analytics and can scale to the size of Facebook while approaching the speed of commercial data warehouses. Presto allows you to query data stored in Hive, Cassandra, relational databases, and even bespoke data storage. Presto can aggregate data from numerous sources in a single query, allowing you to do analytics across your whole enterprise. It eliminates the false option of adopting an expensive commercial solution for quick analytics or a sluggish “free” alternative that requires a lot of hardware.

Project Aria, Project Presto Unlimited, User Defined Functions, Apache Pinot and Druid Connectors, RaptorX, Presto-on-Spark, Disaggregated Coordinator (a.k.a. Fireball) are some latest innovations in Presto. Some of the disadvantages that the user may face while using Presto maybe its unsuitability for large fact joins and the absence of UDF (User-Defined Functions) Support.

To learn more about the recent updates and contribute: https://github.com/prestodb/presto

(Visited 65 times, 1 visits today)