Reverse Dependencies of pyPdf
The following projects have a declared dependency on pyPdf:
- a-data-processing — A library that prepares raw documents for downstream ML tasks.
- ab-data-processing — Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
- add_staves — Add empty staves for your analysis to your score.
- aerospace-chatbot — Aerospace engineering chatbot and AI tools.
- Agatta — Three-item analysis python package
- agentforge — AI-driven task automation system
- ai-enterprise-agent — AI Agent simplifies the implementation and use of generative AI with LangChain.
- aideml — Autonomous AI for Data Science and Machine Learning
- aidriver — AIDriver
- ailingbot — An all-in-one solution to empower your IM bot with AI.
- aio-agents — Opinionated template for building llm agents
- aisdc — Tools for the statistical disclosure control of machine learning models
- aiutil — A utils Python package for data scientists.
- akasha-terminal — document QA package using langchain and chromadb
- alfeios — Enrich your command-line shell with Herculean cleaning capabilities
- amazon-textract-helper — Amazon Textract Helper tools
- amazon-textract-overlayer — Amazon Textract Overlay tools
- amazon-textract-pipeline-pagedimensions — Amazon Textract Pipeline Component to add page dimensions to page block types
- amical — no summary
- analysta-index — Extension of Langchain loaders, llms and retrievers for Analysta
- applyllm — A python package to apply opensource LLM in local CUDA environment
- arac — Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
- arxiv-dl — Command-line arXiv Papers Downloader. Citation extraction and PDF naming automation.
- assert-files — Assert files in test automation
- astromodule — Astronomy Tools
- atksh-utils — atksh's utils
- aus-council-scrapers — no summary
- auto-find-date-pdf — A simple lib to find dates from any txt/ pdf/ docx/ rtf source. For documentation see
- auto-learn-gpt — autoML for training and inference Deep Learning model
- axoden — axoden simplifies the quantification of axonal projections in neuroscience.
- azcam — Acquisition and analysis package for scientific imaging
- azure-ai-generative — Microsoft Azure Machine Learning Client Library for Python
- azureml-rag — Contains Retrieval Augmented Generation related utilities for Azure Machine Learning and OSS interoperability.
- bblocks — A package with tools to download and analyse international development data. These tools are meant to be the building blocks of further analysis.
- benchllm — Tool for testing LLMs
- beyondllm — Beyond LLM is an toolkit to Build Experiment Evaluate and Observe RAG pipelines
- bibtheque — Bibliography management tool.
- bioimageio-chatbot — Your Personal Assistant in Computational BioImaging.
- bisheng-pyautogen — Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
- blacksquare — A package for creating crossword puzzles
- blanks-gen — blanks_gen is a script that wraps tectonic to generate blanks for [Chgk](https://en.wikipedia.org/wiki/What%3F_Where%3F_When%3F)
- bonecommand — 一个用于提升效率的命令行工具
- botc-tokens — A collection of command line utilities for creating, updating, and grouping tokens for Blood on the Clocktower.
- botrun-pdf-to-text — no summary
- boursobank — Parses BoursoBank account statements.
- brevia — Extensible API and framework to build your Retrieval Augmented Generation (RAG) and Information Extraction (IE) applications with LLMs
- cadenai — no summary
- camelot-fork — Camelot Fork
- camelot-py — PDF Table Extraction for Humans.
- cannlytics — 🔥 Cannlytics is a suite of tools that you can use to wrangle, standardize, and analyze cannabis data
- chat-cli-anything — Chat with anything on cli.
- chat-with-mlx — A Retrieval-augmented Generation (RAG) chat interface with support for multiple open-source models, designed to run natively on MacOS and Apple Silicon with MLX.
- chellow — Web Application for checking UK energy bills.
- chromadb-data-pipes — ChromaDB Data Pipes 🖇️ - The easiest way to get data into and out of ChromaDB
- cliriculum — A python cli tool to rapidly create an html or PDF resume
- cloai — A CLI for OpenAI's API
- clonwn-sort — Sort screenshots based on rules or through individual review.
- clown-sort — Sort screenshots based on rules or through individual review.
- cocpyth — Command line interface to generate Call of Cthulhu characters
- cognee — Cognee - is a library for enriching LLM context with a semantic layer for better understanding and reasoning.
- comicon — A simple comic conversion library between CBZ/EPUB/MOBI/PDF
- comicpy — Tool to create CBR or CBZ files, supports PDF, ZIP, RAR files.
- confirms — Comprehension of trade term sheets and confirmations
- conflare — conformal retreival augmented generation with LLMs
- ConnectedPapersExtractor — A package for creating summaries based on https://www.connectedpapers.com/.
- convince — Better instruction following for large language models
- cortex-cli — Nearly Human Cortex CLI for interacting with model functions.
- cpp-aws-s3-pdf — This package createds a PDF file from selected s3 objects
- crewcal — Convert an airline crew schedule pdf into iCalendar format.
- cvfe — Canada Visa Forms (5257e and 5645e) Extractor.
- cvScore — A CLI application that scores CVs using keywords.
- cyber-signature — no summary
- cybrex — Researching AI
- danoliterate — Benchmark of Generative Large Language Models in Danish
- data-alchemy — Package to process documents of any format
- dbgpt — DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.
- deep-translator — A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators
- deepdoctection — Repository for Document AI
- desktop-env — The package provides a desktop environment for setting and evaluating desktop automation tasks.
- distyll-info — Information parsing assistant
- docchat — no summary
- docindex — A package for fast persistent storage of multiple document embeddings and their metadata into Pinecone for production-level RAG.
- DocsChat — Chat with your docs using langchain in a streamlit app with mistral or llama in ollama.
- doms_databasen — Scraper and PDF text processor for domsdatabasen.dk
- dr-doc-search — Search through a document using a chat interface
- dr-service — no summary
- drafthorse — Python ZUGFeRD XML implementation
- dreamai — 🔂
- dreamai-pdf — Library based on DreamAI for parsing PDFs
- dreamai-ray — DreamAI platform leveraging RAY.
- drivers — Unix input drivers for Software 2.0
- drivescanner — Scan your filesystem to look for files that are a potential GDPR risk
- dspygen — A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.
- ebank — no summary
- ebanktool — no summary
- ecoinvent-interface — Unofficial client for interfacing with ecoinvent database
- edc-pdf-reports — Report classes using reportlab/pdf for clinicedc/edc projects
- edi-energy-scraper — a scraper to mirror edi-energy.de
- eidolon-ai-sdk — An open source sgent service SDK
- embedchain — Simplest open source retrieval (RAG) framework