Author: user

Comparing PySpark with Map Reduce programming

PySpark is the Python library for Spark programming. It allows developers to interface with RDDs (Resilient Distributed Datasets) and perform…

Continue Reading Comparing PySpark with Map Reduce programming
getDbt

How does DBT handle dependencies and data lineage?

DBT handles dependencies and data lineage by providing a set of features that allow users to manage and organize data…

Continue Reading How does DBT handle dependencies and data lineage?