PySpark : Calculating the exponential of a given column in PySpark [exp]

user May 23, 2023 Leave a Comment

PySpark offers the exp function in its pyspark.sql.functions module, which calculates the exponential of a given column.

In this article, we will delve into the details of this function, exploring its usage through an illustrative example.

Function Signature

The exp function signature in PySpark is as follows:

pyspark.sql.functions.exp(col)

The function takes a single argument:

col: A column expression representing a column in a DataFrame. The column should contain numeric data for which you want to compute the exponential.

Example Usage

Let’s examine a practical example to better understand the exp function. Suppose we have a DataFrame named df containing a single column, col1, with five numeric values.

from pyspark.sql import SparkSession
from pyspark.sql.functions import lit
spark = SparkSession.builder.getOrCreate()
data = [(1.0,), (2.0,), (3.0,), (4.0,), (5.0,)]
df = spark.createDataFrame(data, ["col1"])
df.show()

Result : DataFrame:

+----+
|col1|
+----+
| 1.0|
| 2.0|
| 3.0|
| 4.0|
| 5.0|
+----+

Now, we wish to compute the exponential of each value in the col1 column. We can achieve this using the exp function:

from pyspark.sql.functions import exp
df_exp = df.withColumn("col1_exp", exp(df["col1"]))
df_exp.show()

In this code, the withColumn function is utilized to add a new column to the DataFrame. This new column, col1_exp, will contain the exponential of each value in the col1 column. The output will resemble the following:

+----+------------------+
|col1|          col1_exp|
+----+------------------+
| 1.0|2.7182818284590455|
| 2.0|  7.38905609893065|
| 3.0|20.085536923187668|
| 4.0|54.598150033144236|
| 5.0| 148.4131591025766|
+----+------------------+

As you can see, the col1_exp column now holds the exponential of the values in the col1 column.

PySpark’s exp function is a beneficial tool for computing the exponential of numeric data. It is a must-have in the toolkit of data scientists and engineers dealing with large datasets, as it empowers them to perform complex transformations with ease.

Spark important urls to refer

Post Views: 19

Author: user

PySpark : Calculating the exponential of a given column in PySpark [exp]

Leave a Reply Cancel reply

Trending

Recent Posts

Featured Posts – Slider Widget

Electronics and Instrumentation

Chemical Engineering

Civil Engineering

Backpressure in AWS Kinesis Streams: Optimizing Data Processing

Troubleshooting Data Ingestion and Processing Issues with AWS Kinesis Streams

Impact of Shard Count Modification on AWS Kinesis Streams

How to map values of a Series according to an input correspondence:SSeries.map()

Understanding Series.transform(func[, axis])

Series.aggregate(func) : Pandas API on Spark

Series.agg(func) : Pandas API on Spark

Most Viewed Posts

Related Posts

Related Articles

Leave a Reply Cancel reply

Trending

Recent Posts

Featured Posts – Slider Widget