Reverse Dependencies of pdf2image
The following projects have a declared dependency on pdf2image:
- llmreflect — a package for llm self-reflection
- llmvm-cli — Command Line LLM with client-side tools support.
- llmware — An enterprise-grade LLM-based development framework, tools, and fine-tuned models
- lmdx-flow — Python toolkit for document information extraction using LMDX
- mim-ocr — Tool for using different OCR engines and process their results using common data structures.
- mis-scan-handler — Processing scans of machine-readable TrustMed documents
- mkdocs-annexes-integration — A MkDocs plugin transforming annexes files into images to be integrated in markdown pages
- mkdocs-pdf2image-plugin — An MkDocs plugin to convert the first page of a pdf to an image
- mmda — MMDA - multimodal document analysis
- motionpdf — A script built on Tesseract-OCR for converting .pdf to .txt
- multilingual-pdf2text — A python library for extracting text from PDFs without losing the formatting of the PDF content.
- napari-pdf-reader — Reader for PDF files
- NstudyPy — A NStudyPy useful tools
- ocr-joplin-notes — Add OCR data to Joplin notes
- ocr-pdf-jpg-png — Tesseract OCR with OpenCV preprocessing and auto-correct.
- ocrdataextractor — Proteus data extractor File
- ocrpy — unified interface to google vision, aws textract, azure & tesseract OCR tools.
- OCRUSREX — OCRUSREX takes a PDF (either by path or as a file-like object) and makes it searchable using Tesseract 4. It has an enterprise-friendly license.
- Ocrversion1 — no summary
- Ocrversion2 — no summary
- odoo-addon-document-quick-access — Document quick access
- odoo-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
- odoo11-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
- odoo12-addon-document-quick-access — Document quick access
- odoo12-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
- odoo13-addon-document-quick-access — Document quick access
- odoo13-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
- odoo14-addon-document-quick-access — Document quick access
- odoo14-addon-document-quick-access-folder-auto-classification — Auto classification of Documents after reading a QR
- Oilele — Comic book visualizer
- opencopilot-ai — OpenCopilot Backend
- opticr — expose a single interface and API to few OCR tools
- oracle-of-ammon — CLI tool for creating Search APIs.
- paddle-pipelines — Paddle-Pipelines: An End to End Natural Language Proceessing Development Kit Based on PaddleNLP
- papermage — Papermage. Casting magic over scientific PDFs.
- papermerge-core — Open source document management system for digital archives
- parsee-pdf-reader — no summary
- pdf-binder — A tool for preparing PDFs for bookbinding
- pdf-converter-nixx — no summary
- pdf-orientation-corrector — A Python module to automatically detect and correct the orientation of pages in PDF documents.
- pdf-scrapper — Pdf Scrapping interface
- pdf-to-cb — PDF to Comic Book format
- pdf-watermark — A python CLI tool to add watermarks to a PDF
- pdf2dataset — Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extracting text and images
- pdf2dcm — A PDF to Dicom Converter
- pdf2image-cli — pdf2image port to a CLI version
- pdf2ppt — A tool to convert PDF documents to PPTX format with an adjustable DPI setting.
- pdf2pptx-cli — convert pdf to 1200 dpi image ppt
- pdf2sb — Upload PDF file to Gyazo as images then convert Scrapbox format
- pdf2table — pdf2table is a powerful Python tool designed to streamline the extraction of tabular data from PDF documents.
- pdf2txt — A better pdf to text extraction toolkit
- pdf2up — A small utility to generate fairly high resolution preview images of PDFs suitable for viewing or sharing to social media
- PdfCC — PDF cropper & compressor: removes unwanted noise from pdf and compresses them
- PDFCompareTrueDiff — A PDF comparison tool which helps to view the differences side-by-side
- PdfDarkMode — Converts PDFs to have a grey background to be easier on the eyes
- pdfdarkness — A command line tool for caluclating the darkness of the pages of PDF files
- pdfner — Information extraction and named-entity recognition for indexing PDFs
- pdfpad — no summary
- PDFScraper — PDF text and table search
- pdfshot — A Python CLI to export pages from PDF files as images.
- pdfToImg — Easily convert PDF to Image from command line
- pdftoprompt — Python library to abbreviate a PDF file to GPT 8k prompt length
- pdftty — A PDF viewer for the terminal
- pih-tls — Shared tools for PIH module
- platform-gen-ai — This is pipeline code for accelerating solution accelerators
- pm4ngs — PM4NGS generates a standard organizational structure for Next Generation Sequencing (ngs) data analysis
- polybiblioglot — A tool to translate scanned books
- pressurecooker — A collection of utilities for media processing.
- pydoxtools — This library contains a set of tools in order to extract and synthesize structured information from documents
- PyLexia — no summary
- pynada — Python client for NADA API
- pypdfops — A utility library for pdf manupulation
- pytesseract-cli — A pytesseract wrapper enabling OCR on images and directories.
- python-ocr — Input Adaptor to verify file extension
- python-slides — A Python package for slideshows.
- rbclassifydoc — Classify documents using rule based approach
- reading4listeners — A deep-learning powered application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!
- readyocr — A nice package OCR for Amazon Textract and Google Document AI
- refuel-autolabel — Label, clean and enrich text datasets with LLMs
- Ret2GPT — Ret2GPT: Advanced AI-powered binary analysis tool leveraging OpenAI's LangChain technology, revolutionizing CTF Pwners' experience in binary file interpretation and vulnerability detection.
- ricecooker — API for adding content to the Kolibri content curation server
- salt-viewer — Simple (archived) image viewer
- sci-annot-eval — The evaluation component of the sci-annot framework
- SDSParser — Extract chemical data from Safety Data Sheet documents
- seckerwiki — A collection of scripts used to manage my personal Foam workspace
- sermos-tools — Sermos Tools
- sheatless — A python library for extracting parts from sheetmusic pdfs
- sherlockpipe — Search for Hints of Exoplanets fRom Lightcurves Of spaCe based seeKers
- SigProfilerAssignment — Mutational signatures attribution and decomposition tool
- SimorghOCR — A simple OCR application using CustomTkinter, Tesseract, and EasyOCR.
- sourav-easyocr — Input Adaptor to verify file extension
- sourav-tesseract — Input Adaptor to verify file extension
- spacypdfreader — A PDF to text extraction pipeline component for spaCy.
- study-buddy — A simple package for parsing PDFs to text for Linux and Mac OS
- studytool — Command lines for study
- styled-prose — Generate images and thumbnails based on bitmap transformations of rendered prose
- svgdigitizer — svgdigitizer is a Python library and command line tool to recover the measured data underlying plots in scientific publications.
- syize — A tool kit package
- tabledetector — End-to-End table structure detector
- tableParser — It extract the table data from the pdf