Reverse Dependencies of PyMuPDF
The following projects have a declared dependency on PyMuPDF:
- 3m — 3m
- AdyanUtils — Special package
- afipcaeqrdecode — Package to decode and extract invoice metadata from an AFIP CAE qr code link
- agl-ocr-reader — OCR API: This OCR API is an application for extracting text from images and PDF files. It is built using Flask, a Python web framework. It utilizes the pytesseract OCR library, pymupdf and the PIL library for image processing.
- airclick — airclick 相关python包
- alacorder — Alacorder retrieves case detail PDFs from Alacourt.com and processes them into data tables suitable for research purposes.
- alldata — This is a Package in which you can Extract Images,Text and Tables from 1 package
- anbani — Georgian alphabet and language utilities for Natural Language Processing, script conversion and more.
- api2openai — Create a Python package.
- aradf — For converting pdf documents to txt files
- arcan — An AI web3 tooling platform for the decentralized customization and enhancement of AI agents
- arcanum-newspaper-segmentation-client — Client for Arcanum's Newspaper Segmentation API
- archive-hocr-tools — hOCR (streaming) parsers and writers
- archive-pdf-tools — Internet Archive PDF compression tools
- arxiv-summarizer — A happy toolkit for arxiv paper summarization and understanding.
- ascript — airclick 相关python包
- aus-council-scrapers — no summary
- ausbildungsnachweise-utils — Utilities to generate Ausbildungsnachweise PDFs from human readable input formats.
- Auto-Research — Geberate scientific survey with just a query
- autogluon.multimodal — AutoML for Image, Text, and Tabular Data
- autogluon-tonyhu-test.multimodal — AutoML for Image, Text, and Tabular Data
- autopipeline — no summary
- axa-fr-splitter — AXA France file splitter package
- b2cloud — ヤマト運輸株式会社が提供する送り状発行システムB2クラウドをpythonで利用するパッケージ
- bbrc-pyxnat — XNAT in Python
- bechdelai — Automating the Bechdel test and its variants for feminine representation in movies with AI
- Bio-Epidemiology-NER — Recognize bio-medical entities from a text corpus
- biochatter — Backend library for conversational AI in biomedicine
- bisheng-langchain — bisheng langchain modules
- bisheng-unstructured — ETLs fro LLMs
- BLA2 — This is a Package in which you can Extract Images,Text and Tables from 1 package
- bluewave — Python script to analyze the similarity of two PDFs
- bnw-tools — Tools developed in the BorgNetzWerk project for the extraction, analysis and publication of knowledge.
- BookerPdfTool — iBooker/ApacheCN 知识库抓取工具
- browsr — TUI File Browser App
- Bs-Extractor — This repository contains a Python program designed to execute Optical Character Recognition (OCR) and Facial Recognition on images.
- BsSalary-Extractor — This repository contains a Python program designed to execute Optical Character Recognition (OCR) and Facial Recognition on images.
- BuoyanText — Normalizing English and Chinese Text
- burdoc — Advanced PDF parsing for python
- camel-ai — Communicative Agents for AI Society Study
- CanD — Create complex layouts for scientific figures in matplotlib
- cardimpose — Impose multiple copies of a pdf onto a larger document.
- casparser — (Karvy/Kfintech/CAMS) Consolidated Account Statement (CAS) PDF parser
- chat-research — Use ChatGPT to accelerator your research.
- chatiq — A versatile Slack bot using GPT & Weaviate-powered long-term memory to accomplish various tasks.
- ChatLLM — Create a Python package.
- ChatSQL — Create a Python package.
- chemrel — A project which focuses on automating and transferring chemical data extraction using span categorization and relation extraction models.
- chichitk — Python UI library built upon Tkinter
- chinese-pdf-divider — divide chinese pdf file into blocks within 512
- chutoro — no summary
- cista — Dropbox-like file server with modern web interface
- civilpy — Civil Engineering Tools in Python
- clearedge — no summary
- closeai — Create a Python package.
- clown-sort — Sort screenshots based on rules or through individual review.
- cnmv-data — Extracción desde PDF de la cartera de inversión reportada por Fondos de Inversión a la CNMV
- cognee — Cognee - is a library for enriching LLM context with a semantic layer for better understanding and reasoning.
- colibrie — Colibrie is a blazing fast tool to extract tables from PDFs
- colorblind_pdf — A package to process PDFs for testing colorblind accessibility.
- colrev — CoLRev: An open-source environment for collaborative reviews
- comicbox-pdffile — A ZipFile like API for PyMuPDF
- comicpy — Tool to create CBR or CBZ files, supports PDF, ZIP, RAR files.
- compiloor — no summary
- concall-tools — Tools to extract information from concall transcripts
- cornsnake — Wrap common Python utilities for working with files, git, ZIP, lists, processes, dates and times.
- correpy — CorrePy (Corretagem Python) é uma lib responsável por parsear notas de corretagem no padrão B3 (Sinacor) e retornar os dados no formato JSON.
- CPM-Bee — Create a Python package.
- cpm-live — Create a Python package.
- csv2img — Package to save CSV as PNG
- cv-xtractor — A Python package for extracting information from CVs (resumes).
- D2F2 — Tool to convert image folders into files
- data-alchemy — Package to process documents of any format
- datatc — Automate every-day interactions with your data.
- Datev-Splitter — Splits Datev-Reports into single pdf files per Personalnummer.
- deknp — Extending from npm
- desktop-env — The package provides a desktop environment for setting and evaluating desktop automation tasks.
- dewy — Knowledge base service.
- df-extract — DecisionFacts Extraction Library extracts content from PDF, PPTX, Docx, png, jpg., and convert as structured JSON data.
- doc-analyzer — no summary
- doc-extractor — no summary
- doc-intel — Your solution to cleansing PDF documents for preprocessing for NLP
- doc-loader — Given werkzeug.FileStorage, fastapi.UploadFile or str file path as input it converts any image files(.pdf, .jpg, .png, .tiff) into list of PIL or numpy objects
- doc2data — Integrated document processing with machine learning.
- docrx — search in documents
- document-contents-extractor — A simple script to extract contents section from a PDF or DJVU document
- documentocr-verdict — This repository contains a Python program designed to execute Optical Character Recognition (OCR)
- docusign-integration — no summary
- dodfminer — no summary
- dragon-tools — A small package od tools
- DriverPAC3120 — Driver PAC3120
- easyofd — easy operate OFD
- easypdfheading — PDF subheadings finder with text.A package that allows to find subheadings in a PDF.
- ebib — ebib is a bibliography manager system aimed to work with Gitlab/Github pages
- edge-pdf — pdf工具库
- ehiden — no summary
- emreader — no summary
- enex2notion — Import Evernote ENEX files to Notion
- epub-image-helper — This tool allows you to easily convert specified photos and images into EPUB e-book format, making it accessible for family and friends. It can be used to create monthly or yearly photo collections for children and transform travel photos into e-books.
- evadb — EvaDB AI-Relational Database System