pymupdf-fitz

Here are 25 public repositories matching this topic...

vickypandey14 / Convert-PDF-into-Image-By-Python

This Python script converts each page of a PDF document into separate image files. It utilizes the PyMuPDF library (fitz) to handle PDF operations and the Python Imaging Library (PIL) for image processing.

python python-script pdf-converter pymupdf pymupdf-fitz

Updated Feb 22, 2024
Python

das-amlan / PDF_Image_Extractor_Web_App

Star

This is a simple web app that allows users to upload a PDF file, extract images from the PDF, and display the images in the web app.

python html flask fitz streamlit pymupdf-fitz

Updated Dec 1, 2024
Python

ifte110 / Serach_all_pdfs_by_string

Star

Search through all pdf files in a folder for a specific keyword or string of keywords.

python pymupdf-fitz pdfsearchtool

Updated Feb 27, 2025
Python

malavika-suresh / multiple_pdf_comparison

Star

This Python-based tool allows for efficient comparison of two or more PDF documents, highlighting the differences between them. It extracts and compares the words in the PDFs, ignoring whitespace differences, and highlights the changed, added, or missing words.

python pdf differences-detected difflib pdf-comparison text-comparison fitz comparison-tool multiple-pdfs pymupdf-fitz pdf-comparison-highlight-differences

Updated Jul 2, 2025
Python

pawankumar94 / graphscribe-table-extractor

Star

Graphscribe is an intelligent, LLM-powered document understanding system designed to extract structured insights from complex visual content such as statistical diagrams, charts, and graphs.

tesseract-ocr ocr-recognition pymupdf pymupdf-fitz langchain genai-chatbot gemini-flash qwen2-5

Updated Apr 21, 2025
Python

mcagriaksoy / diff_merge_pdf

Star

A tool for compare, merge, display difference and make OCR between the PDFs.

pdf-viewer pdf-generator pdf-merger ocr-recognition pdf-comparison x-ray-images ocr-text-reader diff-tool pdf-document-processor pdf-ocr-extraction pyqt6-desktop-application pymupdf-fitz pdf-ocr pdf-visual-testing diff-tool-pdf

Updated Jan 21, 2024
Python

Sazizi2025 / PDF-Founder

Star

Are you short on time?! Can't you search all the PDFs one by one for the content you want?! Well, PDF-Founder is here...

python pdf gui image tesseract rgb graphical tesseract-ocr easy-to-use image-generator snipping pdf-search-engine pymupdf pysimplegui pdf-search ptl pymupdf-fitz

Updated Jan 8, 2024
Python

devbm7 / QGen

Star

Question Generator System

nlp json ml transformers pandas python3 pytorch spacy wikipedia-api nltk smtp regular-expressions streamlit pymupdf-fitz t5-large

Updated Oct 21, 2024
Python

atthharvva / PDF-Form-Reader

Star

This Python script extracts information from PDF forms using OCR (Optical Character Recognition) and saves the extracted data into an Excel file. It is particularly designed for processing forms with checkboxes and textual fields. The script can handle variations in form structure and allows for easy customization to accommodate other PDF form type

python forms pillow pdf-forms openpyxl csv-export ocr-text-reader pdf-document-processor pymupdf-fitz graphical-checkboxes

Updated Jan 9, 2025
Python

OtenMoten / pdf-alchemist

Star

It's designed for transmuting PDFs into HTML. Harness the power of OCR, image processing, and web technologies to unlock the secrets within your PDF documents.

python pdf-converter pillow tesseract-ocr beautifulsoup4 pdf-document-processor dominate pymupdf-fitz tdqm

Updated Aug 9, 2024
Python

IglesiasT / comparador-pdfs

Star

python pdf-comparison pymupdf-fitz

Updated Aug 7, 2024
Python

ParthaPRay / pdf_text_extraction_json_section_subsection

Star

This repo contains codes for extraction of PDF text to JSON to show section number, section title, section body content, footnote

pdf json text regex extraction document article-extractor pymupdf-fitz

Updated Dec 23, 2024
Python

helgesander02 / TKFruitMG

Star

An ERP system that uses customtkinter as the GUI base, with a postgreSQL database and reportlab, win32print, and pymupdf-fitz design.

postgresql reportlab customtkinter pymupdf-fitz win32print

Updated Dec 5, 2023
Python

kalyaninagaraj / NFHS5

Star

Python code to read, retrieve, analyze, and plot district-level findings from official (pdf) publications of the 5th National Family Health Survey of India

principal-component-analysis kmeans-clustering geopandas matplotlib-pyplot tutorial-python nfhs5 pymupdf-fitz

Updated May 24, 2022
Jupyter Notebook

MelinaNorton / JournalAnalyzer

Star

Python tool for extracting, summarizing & embedding PDF journals—recommends best-fit publications via LangChain & GPT-4.1

python openai text-summarization research-tool academic-journals document-embedding pymupdf document-embeddings pdf-processing gpt-4 pymupdf-fitz llm llms chatgpt langchain langchain-python

Updated Jun 28, 2025
Python

Jatin-s16 / Resume-check-portal-for-candidates

Star

A Streamlit-based application that enables job seekers to evaluate and enhance their resumes by analyzing alignment with specific job descriptions, providing actionable insights for improvement.

python nlp regex cosine-similarity spacy-nlp streamlit sentence-transformers pymupdf-fitz

Updated Apr 8, 2025
Jupyter Notebook

Deepcoders30 / AI-CHATPDF

Star

ChatPDF is a web application that lets users upload PDFs and ask questions about their content.

javascript typescript reactjs fastapi pymupdf-fitz langchain faiss-vector-database groq-integration

Updated Jul 5, 2025
TypeScript

bilalhameed248 / PDF-Document-Extraction

Star

Python PDF-to-HTML Converter: Transforming PDF Documents into Structured HTML Tags. - Feb 2022 - Jun 2023

python pdf parser parsing extraction python3 document fitz pymupdf pymupdf-fitz

Updated Nov 5, 2023
Python

nngel / PDF-thumbnail-service

Star

A production-ready FastAPI microservice that functions as a PDF thumbnail generator, converting the first page of PDF files to optimized PNG thumbnails.

python pdf image thumbnail-generator fastapi vercel pymupdf-fitz

Updated Jun 5, 2025
Python

micheldpd24 / rag_aph_hippocrate

Star

RAG / Chatbot IA sur les Aphorismes d'Hippocrate

docker flask mistral rag pymupdf-fitz llm ollama rag-chatbot faiss-cpu hippocrate medecine-ancienne assistant-ia

Updated May 29, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the pymupdf-fitz topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pymupdf-fitz topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pymupdf-fitz

Here are 25 public repositories matching this topic...

vickypandey14 / Convert-PDF-into-Image-By-Python

das-amlan / PDF_Image_Extractor_Web_App

ifte110 / Serach_all_pdfs_by_string

malavika-suresh / multiple_pdf_comparison

pawankumar94 / graphscribe-table-extractor

mcagriaksoy / diff_merge_pdf

Sazizi2025 / PDF-Founder

devbm7 / QGen

atthharvva / PDF-Form-Reader

OtenMoten / pdf-alchemist

IglesiasT / comparador-pdfs

ParthaPRay / pdf_text_extraction_json_section_subsection

helgesander02 / TKFruitMG

kalyaninagaraj / NFHS5

MelinaNorton / JournalAnalyzer

Jatin-s16 / Resume-check-portal-for-candidates

Deepcoders30 / AI-CHATPDF

bilalhameed248 / PDF-Document-Extraction

nngel / PDF-thumbnail-service

micheldpd24 / rag_aph_hippocrate

Improve this page

Add this topic to your repo