PySpark @ Freshers.in

How to replace a value with another value in a column in Pyspark Dataframe ?

In PySpark we can replace a value in one column or multiple column or multiple values in a column to…

PySpark @ Freshers.in

How to drop nulls in a dataframe : PySpark

For most of the data cleansing the first thing that you may need to do drop the nulls in the…

Snowflake

Why sqitch init snowflake cannot determine Snowflake account name ?

Currently supported databases by Sqitch’s database change management tool include Snowflake’s Cloud Data Warehouse as well as PostgreSQL 8.4+, SQLite…

PySpark @ Freshers.in

In Spark how to replace null value for all columns or for each column separately-PySpark (na.fill)

Spark api : pyspark.sql.DataFrameNaFunctions.fill Syntax : fill(value, subset=None) value : “value” can only be int, long, float, string, bool or…

PySpark @ Freshers.in

How to create an array containing a column repeated count times – PySpark

For repeating array elements k times in PySpark we can use the below library. Library : pyspark.sql.functions.array_repeat array_repeat is a…

Oops @ Freshers.in

OOPS interview questions for freshers and experienced

1. What is OOPS? OOPS is abbreviated as Object Oriented Programming system in which programs are considered as a collection…

AI for Solving Quantitative Reasoning Problems – Minerva

Google AI Introduces Minerva: A Natural Language Processing (NLP) Model for solvingĀ  Mathematical Questions Solving mathematical and scientific questions was…

Apache Airflow

Airflow dags not getting refreshed/updating. How to do it manually?

Once we created DAG in Airflow, we expect it to get refreshed the same time. Once we refresh the Airflow…

Docker Interview Questions and Answers for Experienced and Freshers

1. Can you explain how Docker is advantageous over Hypervisors? Docker is advantageous in the below ways 1. It is…

pyplot : Sample code to draw a graph using python like MATLAB.

matplotlib.pyplot is a collection of functions that make matplotlib work like MATLAB. pyplot function can creates plotting area in a…