Reverse Dependencies of PyMuPDF
The following projects have a declared dependency on PyMuPDF:
- ebib — ebib is a bibliography manager system aimed to work with Gitlab/Github pages
- edge-pdf — pdf工具库
- ehiden — no summary
- emreader — no summary
- enex2notion — Import Evernote ENEX files to Notion
- epub-image-helper — This tool allows you to easily convert specified photos and images into EPUB e-book format, making it accessible for family and friends. It can be used to create monthly or yearly photo collections for children and transform travel photos into e-books.
- evadb — EvaDB AI-Relational Database System
- expdf2txt — PDF to TXT
- ezdxf — A Python package to create/manipulate DXF drawings.
- fadoudou2 — Awesome OCR toolkits based on PaddlePaddle (8.6M ultra-lightweight pre-trained model, support training and deployment among server, mobile, embeded and IoT devices)
- FanTeX — A TeX editor for scientific writing.
- farm-haystack — LLM framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data.
- fbl — FBL is tool to find broken links in articles and files
- fendi — An auto py builder for ChatBots on top of streamlit app's - LLaMa's powered APS
- fetcharoo — A Python library for downloading files from a webpage, with support for recursion depth and optional merging.
- fiction-dl — A content downloader, capable of retrieving works of (fan)fiction from the web and saving them in a few common file formats.
- fillpdf — A Library to fill and flatten pdfs
- findmyfile — This package allows you to search a directory for documents that match keywords
- fitz-utils — Extra functions for use with pymupdf module
- flushai — SDK for Flush AI (flushai.cloud)
- fsearchpy — This package searches for a specified text pattern in various documents, including office and PDFs and prints results including file name, path and size while displaying progress and completion time
- GeneralAgent — General Agent: From LLM to Agent
- genius-chatbot — Use huggingface models to create an intelligent and scalable chatbot
- geniusrise-vision — Huggingface bolts for geniusrise
- getpaper — getpaper - papers download made easy!
- gigachain — Building applications with LLMs through composability
- gigachain-community — Community contributed LangChain integrations.
- gpt-editor-utils — no summary
- gpt-pdf-md — A Python package that utilizes GPT-4V and other tools to convert PDFs into Markdown files.
- GPT-PDF-Reader — A Python package that utilizes GPT-4V and other tools to extract and process information from PDF files
- gpt-researcher — GPT Researcher is an autonomous agent designed for comprehensive online research on a variety of tasks.
- graffle2pdftex — A command line utility that exports omnigraffle canvases files to pdf_tex.
- greenpass — Scriptable green pass verifier
- h2ogpt — no summary
- hammadml-gpu — Hammad Python ~ Machine Learning
- hammer-sh — A package containing useful methods for my masterthesis
- handprint — Run handwritten text recognition services on images of documents
- Highlighted-PDF-2-Anki-FlashCards — No description yet
- hipdf — Highlight the first word of English sentences in PDF file.
- horbach-cli — no summary
- hough — Skew detection and correction in scanned images
- ibott-files — This packages crates a simple way to work with, files, folders, images and pdfs.
- ihk-ausbildungsnachweis-utilities — Utilities to generate IHK Ausbildungsnachweise PDFs from human readable input format and sign them.
- img2table — img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing
- imgtovar — Extracting structured variables from image data
- indico_toolkit — A package to support Indico IPA development
- injected-utils — no summary
- insights-extractor — Efficient PDF analysis, text extraction, preprocessing, and pattern recognition with customizable configurations and utilities.
- iscc-sdk — SDK for creating ISCCs (International Standard Content Codes)
- joinpdf — A simple cli that merge multiple PDF files.
- khoj-assistant — An AI copilot for your Second Brain
- knovleks — no summary
- knowmine — Knowledge mining package
- kv-pdf-processor — no summary
- LangaraCourseInfo — Langara Course Information Aggregator
- langchain — Building applications with LLMs through composability
- langchain_1111_Dev_cerebrum — Building applications with LLMs through composability
- langchain-by-johnsnowlabs — Building applications with LLMs through composability
- langchain-community — Community contributed LangChain integrations.
- langchain-upstage — An integration package connecting Upstage and LangChain
- langchain-utils — Utilities built upon the langchain library
- langchain-xfyun — 在LangChain中流畅地使用讯飞星火大模型
- langchaincoexpert — Building applications with LLMs through composability
- langchainmsai — Building applications with LLMs through composability
- langchainn — Building applications with LLMs through composability
- langplus — Building applications with LLMs through composability
- langroid — Harness LLMs with Multi-Agent Programming
- lbpextract — A tool to convert La Banque Postale's account documents (PDFs) into CSV files.
- Leer-PDFR370 — Librería para leer ficheros PDFs y extraer la información en formato str
- legal-pre-processing — Pre processing tools for documents with legal content.
- Lense — For QandA
- lib-funciones — Libreria de funciones usadas en física experimental II UNLP
- linkrot — Extract metadata and URLs from PDF files
- llama-index-readers-file — llama-index readers file integration
- llm-parse — Parse data from documents optimised for downstream llm tasks.
- llm2openai — Create a Python package.
- llm4data — LLM4Data is a Python library designed to facilitate the application of large language models (LLMs) and artificial intelligence for development data and knowledge discovery.
- llmopenai — Create a Python package.
- localretriever — A simple Python package
- lost-cat-images — Lost Cat (images) is a package for image related uris and tools
- lost-cat-office — Lost cat Office is a package containing office document parsers
- Map-Helper — A Tool for Helping with Flood Maps
- markdown-pdf — Markdown to pdf renderer
- marker-pdf — Convert PDF to markdown with high speed and accuracy.
- mdpdf — Python command line application to convert Markdown to PDF.
- mediqbox-loadpdf — A mediqbox component for extracting figures and tables from PDF files
- mkdocs-thumbnails — An MkDocs plugin. Generates thumbnails of PDF files and YouTube links.
- mlopspython-extraction — Extraction package for MLOpsPython project
- mocksign — Easily simulate printing, hand-signing and scanning of documents, inspired by FalsiSign.
- modulomuysimpleoscar — funcion es_primo
- monopoly-sg — PDF parsing for Singaporean banks
- moodleteacher — A Moodle client library for teachers.
- msanalyzer — Analyze XPS report files generated by Mastersizer 2000
- nafigator — Python package to convert spaCy and Stanza documents to NLP Annotation Format (NAF)
- neuralpit — NeuralPit SDK
- nonebot-paddle-ocr — nonebot_paddle_ocr
- nonebot-plugin-chatpdf — A nonebot plugin for chatpdf
- notion-export-prettify — no summary
- Ocr-Req — An example python package
- ocrmypdf — OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched