Category: article

Comparing PySpark with Map Reduce programming

PySpark is the Python library for Spark programming. It allows developers to interface with RDDs (Resilient Distributed Datasets) and perform…

Continue Reading Comparing PySpark with Map Reduce programming
getDbt

How does DBT handle dependencies and data lineage?

DBT handles dependencies and data lineage by providing a set of features that allow users to manage and organize data…

Continue Reading How does DBT handle dependencies and data lineage?
Snowflake

How does Snowflake differ from other data warehousing solutions

Snowflake is a cloud-based data warehousing solution that differs from traditional on-premises and other cloud-based data warehousing solutions in several…

Continue Reading How does Snowflake differ from other data warehousing solutions