Author: user

Engineering : Overview of Bayes network

A Bayes network, also known as a Bayesian network or a probabilistic graphical model, is a graphical representation of probabilistic…

Continue Reading Engineering : Overview of Bayes network
getDbt

How do you use DBT to manage your data lineage?

Data lineage refers to the history of data as it moves from its source to its destination, including transformations and…

Continue Reading How do you use DBT to manage your data lineage?
PySpark @ Freshers.in

PySpark : Formatting numbers to a specific number of decimal places.

pyspark.sql.functions.format_number One of the useful functions in PySpark is the format_number function, which is used to format numbers to a…

Continue Reading PySpark : Formatting numbers to a specific number of decimal places.
PySpark @ Freshers.in

PySpark : Creating multiple rows for each element in the array[explode]

pyspark.sql.functions.explode One of the important operations in PySpark is the explode function, which is used to convert a column of…

Continue Reading PySpark : Creating multiple rows for each element in the array[explode]
PySpark @ Freshers.in

PySpark : How decode works in PySpark ?

One of the important concepts in PySpark is data encoding and decoding, which refers to the process of converting data…

Continue Reading PySpark : How decode works in PySpark ?
PySpark @ Freshers.in

PySpark : Extracting dayofmonth, dayofweek, and dayofyear in PySpark

pyspark.sql.functions.dayofmonth pyspark.sql.functions.dayofweek pyspark.sql.functions.dayofyear One of the most common data manipulations in PySpark is working with date and time columns. PySpark…

Continue Reading PySpark : Extracting dayofmonth, dayofweek, and dayofyear in PySpark