Based on RapidOCR, extract the PDF content
-
Updated
May 7, 2025 - Python
Based on RapidOCR, extract the PDF content
Sample code for the Datalogics C++, Java, and .NET interfaces of the Adobe PDF Library
A powerful and user-friendly tool based on OCRmyPDF, offering a seamless GUI for conversion of image-based PDFs into searchable text.
Sample code for the Datalogics C++ interface of the Adobe PDF Library
Simple frontend for OCRmyPDF (Windows only).
Sample code for the Datalogics .NET interface of the Adobe PDF Library
Sample code for the Datalogics Java interface of the Adobe PDF Library setup to build with Maven
Sample code for the Datalogics .NET Framework interface of the Adobe PDF Library
This UiPath project developed during the STGI Hackathon, automates resume screening for HR teams. It extracts emails with a specified subject, saves PDF resumes, uses Tesseract OCR for data extraction. The extracted data is used to fill a form and at EOD, an audit report with insights and a CSV of responses is generated and sent to a specfied mail.
ocr resume
Example Django-Python project which contains OCR, PDF to OCR PDF, Text Similarity/Dissimilarity, PDF to PNG converter modules.
This repository contains examples to perform OCR on PDF document in ASP.NET Core Web Application and Azure App Service using Syncfusion .NET OCR library.
Content-aware file sorter with OCR for NAS (Synology/QNAP/Docker-friendly).
This project aims to use the OpenAI Function Calling to extract required data from given several kinds of invoice PDF files.
Adobe PDF Library Samples in Kotlin
Batch process all PDF files in a folder to make them searchable with OCR using ocrmypdf and a simple PowerShell script. Output files are saved in an 'output' subfolder. Perfect for Windows users needing fast PDF text recovery.
This UiPath project developed during the STGI Hackathon, automates resume screening for HR teams. It extracts emails with a specified subject, saves PDF resumes, uses Tesseract OCR for data extraction. The extracted data is used to fill a form and at EOD, an audit report with insights and a CSV of responses is generated and sent to a specfied mail.
Add a description, image, and links to the ocr-pdf topic page so that developers can more easily learn about it.
To associate your repository with the ocr-pdf topic, visit your repo's landing page and select "manage topics."