Tag: Spark_Interview

Hive @ Freshers.in

Hive : Different types of Hive execution engines

Hive is an open-source data warehouse tool built on top of Hadoop. It allows users to write SQL-like queries, called…

Continue Reading Hive : Different types of Hive execution engines
PySpark @ Freshers.in

PySpark : LongType and ShortType data types in PySpark

pyspark.sql.types.LongType pyspark.sql.types.ShortType In this article, we will explore PySpark’s LongType and ShortType data types, their properties, and how to work…

Continue Reading PySpark : LongType and ShortType data types in PySpark
PySpark @ Freshers.in

PySpark : HiveContext in PySpark – A brief explanation

One of the key components of PySpark is the HiveContext, which provides a SQL-like interface to work with data stored…

Continue Reading PySpark : HiveContext in PySpark – A brief explanation
PySpark @ Freshers.in

PySpark: Explanation of PySpark Full Outer Join with example.

One of the most commonly used operations in PySpark is joining two dataframes together. Full outer join is one of…

Continue Reading PySpark: Explanation of PySpark Full Outer Join with example.
PySpark @ Freshers.in

PySpark : Extracting minutes of a given date as integer in PySpark [minute]

pyspark.sql.functions.minute The minute function in PySpark is part of the pyspark.sql.functions module, and is used to extract the minute from…

Continue Reading PySpark : Extracting minutes of a given date as integer in PySpark [minute]