Wheelodex — pdf2image — Reverse Dependencies

Wheelodex » Projects » pdf2image » Reverse Dependencies

Reverse Dependencies of pdf2image

The following projects have a declared dependency on pdf2image:

llmreflect — a package for llm self-reflection
llmvm-cli — Command Line LLM with client-side tools support.
llmware — An enterprise-grade LLM-based development framework, tools, and fine-tuned models
lmdx-flow — Python toolkit for document information extraction using LMDX
mim-ocr — Tool for using different OCR engines and process their results using common data structures.
mis-scan-handler — Processing scans of machine-readable TrustMed documents
mkdocs-annexes-integration — A MkDocs plugin transforming annexes files into images to be integrated in markdown pages
mkdocs-pdf2image-plugin — An MkDocs plugin to convert the first page of a pdf to an image
mmda — MMDA - multimodal document analysis
motionpdf — A script built on Tesseract-OCR for converting .pdf to .txt
multilingual-pdf2text — A python library for extracting text from PDFs without losing the formatting of the PDF content.
napari-pdf-reader — Reader for PDF files
NstudyPy — A NStudyPy useful tools
ocr-joplin-notes — Add OCR data to Joplin notes
ocr-pdf-jpg-png — Tesseract OCR with OpenCV preprocessing and auto-correct.
ocrdataextractor — Proteus data extractor File
ocrpy — unified interface to google vision, aws textract, azure & tesseract OCR tools.
OCRUSREX — OCRUSREX takes a PDF (either by path or as a file-like object) and makes it searchable using Tesseract 4. It has an enterprise-friendly license.
Ocrversion1 — no summary
Ocrversion2 — no summary
odoo-addon-document-quick-access — Document quick access
odoo-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
odoo11-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
odoo12-addon-document-quick-access — Document quick access
odoo12-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
odoo13-addon-document-quick-access — Document quick access
odoo13-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
odoo14-addon-document-quick-access — Document quick access
odoo14-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
Oilele — Comic book visualizer
opencopilot-ai — OpenCopilot Backend
opticr — expose a single interface and API to few OCR tools
oracle-of-ammon — CLI tool for creating Search APIs.
paddle-pipelines — Paddle-Pipelines: An End to End Natural Language Proceessing Development Kit Based on PaddleNLP
papermage — Papermage. Casting magic over scientific PDFs.
papermerge-core — Open source document management system for digital archives
parsee-pdf-reader — no summary
pdf-binder — A tool for preparing PDFs for bookbinding
pdf-converter-nixx — no summary
pdf-orientation-corrector — A Python module to automatically detect and correct the orientation of pages in PDF documents.
pdf-scrapper — Pdf Scrapping interface
pdf-to-cb — PDF to Comic Book format
pdf-watermark — A python CLI tool to add watermarks to a PDF
pdf2dataset — Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extracting text and images
pdf2dcm — A PDF to Dicom Converter
pdf2image-cli — pdf2image port to a CLI version
pdf2ppt — A tool to convert PDF documents to PPTX format with an adjustable DPI setting.
pdf2pptx-cli — convert pdf to 1200 dpi image ppt
pdf2sb — Upload PDF file to Gyazo as images then convert Scrapbox format
pdf2table — pdf2table is a powerful Python tool designed to streamline the extraction of tabular data from PDF documents.
pdf2txt — A better pdf to text extraction toolkit
pdf2up — A small utility to generate fairly high resolution preview images of PDFs suitable for viewing or sharing to social media
PdfCC — PDF cropper & compressor: removes unwanted noise from pdf and compresses them
PDFCompareTrueDiff — A PDF comparison tool which helps to view the differences side-by-side
PdfDarkMode — Converts PDFs to have a grey background to be easier on the eyes
pdfdarkness — A command line tool for caluclating the darkness of the pages of PDF files
pdfner — Information extraction and named-entity recognition for indexing PDFs
pdfpad — no summary
PDFScraper — PDF text and table search
pdfshot — A Python CLI to export pages from PDF files as images.
pdfToImg — Easily convert PDF to Image from command line
pdftoprompt — Python library to abbreviate a PDF file to GPT 8k prompt length
pdftty — A PDF viewer for the terminal
pih-tls — Shared tools for PIH module
platform-gen-ai — This is pipeline code for accelerating solution accelerators
pm4ngs — PM4NGS generates a standard organizational structure for Next Generation Sequencing (ngs) data analysis
polybiblioglot — A tool to translate scanned books
pressurecooker — A collection of utilities for media processing.
pydoxtools — This library contains a set of tools in order to extract and synthesize structured information from documents
PyLexia — no summary
pynada — Python client for NADA API
pypdfops — A utility library for pdf manupulation
pytesseract-cli — A pytesseract wrapper enabling OCR on images and directories.
python-ocr — Input Adaptor to verify file extension
python-slides — A Python package for slideshows.
rbclassifydoc — Classify documents using rule based approach
reading4listeners — A deep-learning powered application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!
readyocr — A nice package OCR for Amazon Textract and Google Document AI
refuel-autolabel — Label, clean and enrich text datasets with LLMs
Ret2GPT — Ret2GPT: Advanced AI-powered binary analysis tool leveraging OpenAI's LangChain technology, revolutionizing CTF Pwners' experience in binary file interpretation and vulnerability detection.
ricecooker — API for adding content to the Kolibri content curation server
salt-viewer — Simple (archived) image viewer
sci-annot-eval — The evaluation component of the sci-annot framework
SDSParser — Extract chemical data from Safety Data Sheet documents
seckerwiki — A collection of scripts used to manage my personal Foam workspace
sermos-tools — Sermos Tools
sheatless — A python library for extracting parts from sheetmusic pdfs
sherlockpipe — Search for Hints of Exoplanets fRom Lightcurves Of spaCe based seeKers
SigProfilerAssignment — Mutational signatures attribution and decomposition tool
SimorghOCR — A simple OCR application using CustomTkinter, Tesseract, and EasyOCR.
sourav-easyocr — Input Adaptor to verify file extension
sourav-tesseract — Input Adaptor to verify file extension
spacypdfreader — A PDF to text extraction pipeline component for spaCy.
study-buddy — A simple package for parsing PDFs to text for Linux and Mac OS
studytool — Command lines for study
styled-prose — Generate images and thumbnails based on bitmap transformations of rendered prose
svgdigitizer — svgdigitizer is a Python library and command line tool to recover the measured data underlying plots in scientific publications.
syize — A tool kit package
tabledetector — End-to-End table structure detector
tableParser — It extract the table data from the pdf

1 2 3