Tag: big_data_interview

PySpark @ Freshers.in

PySpark : Truncate date and timestamp in PySpark [date_trunc and trunc]

pyspark.sql.functions.date_trunc(format, timestamp) Truncation function offered by Spark Dateframe SQL functions is date_trunc(), which returns Date in the format “yyyy-MM-dd HH:mm:ss.SSSS”…

Continue Reading PySpark : Truncate date and timestamp in PySpark [date_trunc and trunc]
PySpark @ Freshers.in

PySpark : Inserting row in Apache Spark Dataframe.

In PySpark, you can insert a row into a DataFrame by first converting the DataFrame to a RDD (Resilient Distributed…

Continue Reading PySpark : Inserting row in Apache Spark Dataframe.
PySpark @ Freshers.in

PySpark : How to write Scala code in spark shell ?

To write Scala code in the Spark shell, you can simply start the Spark shell by running the command “spark-shell”…

Continue Reading PySpark : How to write Scala code in spark shell ?
PySpark @ Freshers.in

PySpark : What happens once you do a spark submit command ?

When you submit a Spark application using the spark-submit command, a series of steps occur to start and execute the…

Continue Reading PySpark : What happens once you do a spark submit command ?