DuckDB is an embedded database, similar to SQLite, but designed for OLAP-style analytics. It is crazy fast and allows you to read and write data stored in CSV, JSON, and Parquet files directly, ...
The ultimate duo I didn’t know I needed ...
An open-source Python library for simplifying local testing of Databricks workflows using PySpark and Delta tables. This library enables seamless testing of PySpark processing logic outside Databricks ...