Use case : If you have multiple files for example chapter wise question papers etc.…
Category: article
Multiple ways that we can remove letters from a String using Python.
In this article you can see multiple ways that we can remove letters from a String using Python. my_string =…
PySpark how to find the date difference between two date and how to round it just days without decimal (datediff,floor)
pyspark.sql.functions.datediff and pyspark.sql.functions.floor In this article we will learn two function , mainly datediff and floor. pyspark.sql.functions.datediff : To get…
PySpark – How to convert string date to Date datatype
pyspark.sql.functions.to_date In this article will give you brief on how can you convert string date to Date datatype . With…
Does Snowflake support indexes which is supported by other databases ?
The answer is NO. Indexes are not utilized or supported by Snowflake. By keeping extensive metadata in each of its…
DBT command not found after intalling DBT-How to resolve.
DBT command not found Steps to check and get this resolved Once you installed dbt after going through the https://docs.getdbt.com/dbt-cli/install/overview…
DBT – Best practices that developers show follow
DBT – Best practices The article will give you the cumulative knowledge of dbt’s seasoned users on how to use…
PySpark-How to returns the first column that is not null
pyspark.sql.functions.coalesce If you want to return the first non zero from list of column you can use coalesce function in…
How can you convert PySpark Dataframe to JSON ?
pyspark.sql.DataFrame.toJSON There may be some situation that you need to send your dataframe to a file to a server or…
How can I see the full column values in a Spark Dataframe ?
When we do a dataframe.show () , we can see that some of the column values got truncated. Here we…
What is the difference between repartition() and coalesce() ?
The repartition algorithm will perform a full shuffle and creates new partitions with data that’s distributed evenly. The repartition algorithm makes…