pyspark.sql.functions.array_sort The array_sort function is a PySpark function that allows you to sort an array…
Author: user
How to convert xls to csv ? I can see first header column got shifted to next column- Solved
The requirement is to convert and xls to csv using python. Initially we used pandas , pandas.read_excel to read an…
How can I get all the hive tables and its external location,partitions etc ?
There may be some situations where you may need to give all the hive tables created and its location and…
Hive – Where can I get the hive metastore details (Credentials,Host,Server etc)
In General we don’t deal directly with Hive metastore . But there are some situation that we may need to…
Hive – What are the metastore tables in Hive ?
Metastore is the central repository of Apache Hive metadata. It stores metadata for Hive tables AUX_TABLE BUCKETING_COLS CDS COLUMNS_V2 COMPACTION_QUEUE…
Python-How to extract multiple words between two strings-(Extracting word between {})
Here we will see how to extract string between two specific character/string. This is a use case when you want…
How to transfer file from SFTP server to Local using Python
There are situation you may need to programmatically transfer file from SFTP server to you local environment. Here we will…
How to remove csv header using Spark (PySpark)
A common use case when dealing with CSV file is to remove the header from the source to do data…
How to access hive using Python (Source code )
Use case : If you want to do some scheduling or some automation , we may need to access Hive…
Chemical Reactions and Equations – Revision Notes (Class 10)
Chemical Reactions and Equations Chemical Change New substances formed. Change in chemical properties. Examples are as bellow: (a) Curdling of…
Matplotlib Cheat Sheet for Quick reference
Matplotlib Matplotlib Cheat Sheet for Quick reference Matplotlib is a Python 2D plotting library which produces publication-quality figures in a…