Big Data - Freshers.in

How to replace a value with another value in a column in Pyspark Dataframe ?
In PySpark we can replace a value in one column or multiple column or multiple…
In Snowflake how to Encrypts a BINARY value using a BINARY key ?
ENCRYPT_RAW is used to Encrypts a BINARY value using a BINARY key. Syntax ENCRYPT_RAW( <value_to_encrypt>…
PySpark : Find the maximum value in an array column of a DataFrame
pyspark.sql.functions.array_max The array_max function is a built-in function in Pyspark that finds the maximum value…
PySpark : Find the minimum value in an array column of a DataFrame
pyspark.sql.functions.array_min The array_min function is a built-in function in Pyspark that finds the minimum value…
PySpark : Finding the position of a given value in an array column.
pyspark.sql.functions.array_position The array_position function is used to find the position of a given value in…
In Spark how to replace null value for all columns or for each column separately-PySpark (na.fill)
Spark api : pyspark.sql.DataFrameNaFunctions.fill Syntax : fill(value, subset=None) value : "value" can only be int,…
PySpark : How to convert a sequence of key-value pairs into a dictionary in PySpark
pyspark.sql.functions.create_map create_map is a function in PySpark that is used to convert a sequence of…
Retrieving value of a specific element in an array or map column of a DataFrame.
pyspark.sql.functions.element_at In PySpark, the element_at function is used to retrieve the value of a specific…
How to find array contains a given value or values using PySpark ( PySpark search in array)
array_contains You can find specific value/values in an array using spark sql function array_contains. array_contains(array,…
Explain how can you implement dynamic partitioning in Hive (automatically creating partition based on column value)
Dynamic partition in hive Dynamic partitioning is a tactical method…

Tag: Big Data

What is Swappiness Value? What is the role of Swappiness Value during the cluster set up?

What is Snowflake Merge Command ? How to use it ?

What are the Data Processing Operators in Snowflake ?

What are the Query Operators supported by Snowflake

PySpark how to get rows having nulls for a column or columns without nulls or count of Non null

PySpark – groupby with aggregation (count, sum, mean, min, max)

PySpark filter : How to filter data in Pyspark – Multiple options explained.

Amazon Aurora quick reference and cheat sheet.

Amazon Athena quick reference and cheat sheet

How to drop multiple partition in Hive by giving condition.

Trending

Recent Posts

Featured Posts – Slider Widget

How PARTITION BY Works in Snowflake, and SQL in general

Stash a specific file using Git

Prevent your computer from locking : Python to simulate mouse movements

AWS EC2 vs Azure Virtual Machines

Production and Industrial Engineering

Engineering Technical campus placement question and answers

JavaScript’s reduceRight() method to iterate over an array from right to left

Merging Multiple Images into a Single PDF File Using Python

Nanotechnology

Electronics and Instrumentation

Most Viewed Posts