Use case : If you want to do some scheduling or some automation , we may need to access Hive using Python.
Libraries used : impyla (https://github.com/cloudera/impyla)
Python version used : 3.6
Here our environment is AWS EMR with preconfigured
#Source Code - Python
#Code by www.Freshers.in #For Learning purpose #Python Hive from impala.dbapi import connect conn = connect(host='10.18.179.231', port=10000, auth_mechanism='PLAIN') cur = conn.cursor() cur.execute("show databases") dbs = cur.fetchall() for db in dbs: print(db)
If you face any issue like “bitarray-1.6.1-cp36-cp36m-win_amd64.whl is not a supported wheel on this platform.”
You may need to install one of the bellow based on your windows.
For 64 bit
pip install https://download.lfd.uci.edu/pythonlibs/w4tscw6k/cp36/bitarray-1.6.1-cp36-cp36m-win_amd64.wh
For 32 bit
pip install https://download.lfd.uci.edu/pythonlibs/w4tscw6k/cp36/bitarray-1.6.1-cp36-cp36m-win32.whl