Python : How to extract images from PDF files

user October 13, 2022 Leave a Comment on Python : How to extract images from PDF files

In this article you can see how to extract images from pdf files and save it in your local. For that here we are using PyPDF2 library.

PyPDF2 is a pure-python PDF library that can split, merge, crop, and otherwise alter the pages of PDF files. It is free and open-source.

Install PyPDF2

!pip install PyPDF2

Sample code to extract images from PDF

from PyPDF2 import PdfReader
pdfreader = PdfReader("freshers_ny.pdf")
first_page = pdfreader.pages[0]
count = 0
for image_file in first_page.images:
    with open(str(count) + image_file.name,"wb") as fp:
        fp.write(image_file.data)
        count = count + 1

PyPDF2 Official page
Get more post on Python, PySpark

Post Views: 15

How to merge multiple PDF files using Python?
Use case : If you have multiple files for example chapter wise question papers etc.…
Python-How to extract multiple words between two strings-(Extracting word between {})
Here we will see how to extract string between two specific character/string. This is a…
Python : Program to get all the files with full path, modified after a specific date.
This program uses the os.walk() function to iterate through all files and directories in the…
How to read data from AWS Secrets Manager using Python ?
Python programmers can utilise the boto3 library, which is the AWS SDK for Python, to…
Python 3.11.0 is now available
Major new features of the Python 3.11 series, are Include Fine-Grained Error Locations in Tracebacks.…
Python : Understanding traceback.format_exc() in Python
In Python, the traceback module provides functions for working with tracebacks, which are snapshots of…
How to convert a Python object to JSON data using the json module ?
Python comes with a built-in module called json; pip is not required to instal it.…
Ways to get the distribution of a column in Python, depending on the type of data
There are several ways to get the distribution of a column in Python, depending on…
Python : How to remove background of an image using Python
In this article we will see how can we remove background of an image using…
How to access hive using Python (Source code )
Use case : If you want to do some scheduling or some automation , we…