How to replace a value with another value in a column in Pyspark Dataframe ?
In PySpark we can replace a value in one column or multiple column or multiple values in a column to…
How to drop nulls in a dataframe : PySpark
For most of the data cleansing the first thing that you may need to do drop the nulls in the…
Why sqitch init snowflake cannot determine Snowflake account name ?
Currently supported databases by Sqitch’s database change management tool include Snowflake’s Cloud Data Warehouse as well as PostgreSQL 8.4+, SQLite…
In Spark how to replace null value for all columns or for each column separately-PySpark (na.fill)
Spark api : pyspark.sql.DataFrameNaFunctions.fill Syntax : fill(value, subset=None) value : “value” can only be int, long, float, string, bool or…
How to create an array containing a column repeated count times – PySpark
For repeating array elements k times in PySpark we can use the below library. Library : pyspark.sql.functions.array_repeat array_repeat is a…
OOPS interview questions for freshers and experienced
1. What is OOPS? OOPS is abbreviated as Object Oriented Programming system in which programs are considered as a collection…
AI for Solving Quantitative Reasoning Problems – Minerva
Google AI Introduces Minerva: A Natural Language Processing (NLP) Model for solvingĀ Mathematical Questions Solving mathematical and scientific questions was…
Airflow dags not getting refreshed/updating. How to do it manually?
Once we created DAG in Airflow, we expect it to get refreshed the same time. Once we refresh the Airflow…
Docker Interview Questions and Answers for Experienced and Freshers
1. Can you explain how Docker is advantageous over Hypervisors? Docker is advantageous in the below ways 1. It is…
pyplot : Sample code to draw a graph using python like MATLAB.
matplotlib.pyplot is a collection of functions that make matplotlib work like MATLAB. pyplot function can creates plotting area in a…