Author: user

PySpark @ Freshers.in

How to get json object from a json string based on json path specified – get_json_object – PySpark

get_json_object get_json_object will extracts json object from a json string based on json path mentioned and this will and returns…

Google sheet @freshers.in

Google Sheet Quick Reference Guide and Keyboard Shortcuts

Google Sheet Quick Reference Guide and Keyboard Shortcuts Create a Spreadsheet from Google Drive: In Google Drive, click the New…

PySpark @ Freshers.in

How to round the given value to scale decimal places using HALF_EVEN rounding in Spark – PySpark

bround function bround function returns the rounded expr using HALF_EVEN rounding mode. That means bround will round the given value…

Descriptive vs Diagnostic vs Predictive vs Prescriptive: 4 type of Analytics

a. Descriptive: This tells you what happened in the past. You will get the data from the past and report…

MS Excel @ Freshers.in

Excel shortcuts for daily use tasks

Shortcuts that you can use for your daily tasks 1 CTRL*A Select All 2 CTRL+C Copy all Cells in Highlighted…

Hive @ Freshers.in

What are the Optimization Techniques that you can apply on Apache Hive ?

1. Partitioning : Partitioning works by dividing the data into smaller segments, These are created using logical grouping based on…

PySpark @ Freshers.in

How to replace a value with another value in a column in Pyspark Dataframe ?

In PySpark we can replace a value in one column or multiple column or multiple values in a column to…

PySpark @ Freshers.in

How to drop nulls in a dataframe : PySpark

For most of the data cleansing the first thing that you may need to do drop the nulls in the…

Snowflake

Why sqitch init snowflake cannot determine Snowflake account name ?

Currently supported databases by Sqitch’s database change management tool include Snowflake’s Cloud Data Warehouse as well as PostgreSQL 8.4+, SQLite…

PySpark @ Freshers.in

In Spark how to replace null value for all columns or for each column separately-PySpark (na.fill)

Spark api : pyspark.sql.DataFrameNaFunctions.fill Syntax : fill(value, subset=None) value : “value” can only be int, long, float, string, bool or…