How to Work With a PDF in Python
Andrew Stephen
6 Lessons
31m
intermediate
The Portable Document Format or PDF is a file format that can be used to present and exchange documents reliably across operating systems. While the PDF was originally invented by Adobe, it is now an open standard that is maintained by the International Organization for Standardization (ISO). You can work with a preexisting PDF in Python by using the PyPDF2 package.
PyPDF2 is a pure-Python package that you can use for many different types of PDF operations.
By the end of this course, youβll know how to:
- Extract document information from a PDF in Python
- Rotate pages
- Merge PDFs
- Split PDFs
- Add watermarks
- Encrypt a PDF
How to Work With a PDF in Python
6 Lessons 31m
2. History of PyPDF2 (03:50)
4. Merging and Splitting PDFs (04:16)
About Andrew Stephen
Andrew is an avid Pythonista and creates video tutorials for Real Python. He is a qualified robotics and mechatronics engineer who works for an engineering firm as a production engineer and loves his sport, music, gaming and learning.
Β» More about Andrew




mikesult on March 1, 2020
Thank you Andrew for a great and very useful tutorial. I learned a lot about working with PDFs. I use pdf files as music charts quite a bit and these techniques will be very useful to split, merge and organize charts from pdf books. I appreciate your links to additional resources too.