Reverse Dependencies of langdetect
The following projects have a declared dependency on langdetect:
- KolaViz — Compute a collective dynamics from MOOC's discussion forums.
- langmo — toolbox for various tasks in the area of vector space models of computational linguistic
- langsearch — Easily create semantic search based LLM applications on your own data
- language-remote — no summary
- learnware — The learnware package supports the submission, usability testing, organization, identification, deployment, and reuse of learnware.
- libprocess — no summary
- libretranslate — Free and Open Source Machine Translation API. Self-hosted, no limits, no ties to proprietary services.
- lidtk — Language identification Toolkit
- lighteval — A lightweight and configurable evaluation package
- lilac — Organize unstructured data
- lilacai — Organize unstructured data
- lm-eval — A framework for evaluating language models
- magi-dataset — Convenient access to massive corpus of GitHub repositories
- MeetupAPI — Use the combined power of the official Meetup API and a web scraper to implement Meetup into your project.
- mlprimitives — Pipelines and primitives for machine learning and data science.
- multiocr — no summary
- mvodb — Rename and move files using metadata from online databases.
- myspokenlanguagedetection — Spoken language identification with CNN and RNN - Improved Version: accuracy up
- news-please — news-please is an open source easy-to-use news extractor that just works.
- nitter-miner — cli utility for data mining https://nitter.net/serach
- nlnormaliz — Natural language normalizer for documents in Python
- nlp — HuggingFace/NLP is an open library of NLP datasets.
- nlp-900 — NLP
- nlp-rake — Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
- nlp-text-cleaner — Clean the text for NLP project
- nlpyutil — Personal usual utils for python
- nltokeniz — Natural language tokenizer for documents in Python
- noba-mauve — Unit test your writing
- nonebot-plugin-vits-tts — nonebot-plugin-vits-tts
- oarepo-documents — OARepo rdm records data model
- oarepo-doi-resolver — DOI resolver for OARepo
- omdenalore — AI for Good library
- onegov.search — Elasticsearch integration for OneGov Cloud
- onnxtr — Onnx Text Recognition (OnnxTR): docTR Onnx-Wrapper for high-performance OCR on documents.
- opencompass — A comprehensive toolkit for large model evaluation
- openseneca — OpenSeneca
- osint — Collection of Open Source Intelligence (OSINT) tools
- otmt — Tools for determining if web archive collecions are Off-Topic
- ovos-lang-detector-classics-plugin — average plugin classifications for language detection
- own-knowledge-gpt — Custom Knowledge GPT
- paddle-pipelines — Paddle-Pipelines: An End to End Natural Language Proceessing Development Kit Based on PaddleNLP
- pdf2ebook — PDF to ebook
- pdf2mp3 — Converts PDF to MP3 using Google Text-to-Speech
- PDFScraper — PDF text and table search
- picklepie — a Python Package
- plagdef — A tool which makes life hard for students who try to make theirs simple.
- platform-gen-ai — This is pipeline code for accelerating solution accelerators
- poise-cli — Poise, a CLI for retrieving quotes on Goodreads
- pptx-tools — A power point tools
- principledinvestigator — not yet
- pulpfiction — A simple utility tool to detect non-English comments in code
- py-string-tool — useful additional string functions
- pydatamail-ml — pydatamail_ml - Machine Learning extension for pydatamail
- pydetex — An application that transforms LaTeX code to plain text
- pydoxtools — This library contains a set of tools in order to extract and synthesize structured information from documents
- pyGenealogicalTools — Genealogical tools
- pykoko — KOKO is an easy-to-use entity extraction tool
- python-claude-api — The python package that returns Response of Anthropic Claude through API.
- python-doctr — Document Text Recognition (docTR): deep Learning for high-performance OCR on documents.
- python-flickr-mirroring — CLI for mirroring flickr photos of a specific user
- python-golos — Python library for Golos blockchain
- pywidgets — Lightweight utility package for common computer vision tasks.
- QueryRewriter — no summary
- quicktranslate — translate with youdao,baidu and google
- ragtime — Ragtime 🎹 is an LLMOps framework to automatically evaluate Retrieval Augmented Generation (RAG) systems and compare different RAGs / LLMs
- rejected-article-tracker — Utility package to track if a journal article has been published somewhere.
- rpunct — An easy-to-use package to restore punctuation of text.
- rstojnic-tfds-nightly — tensorflow/datasets is a library of datasets ready to use with TensorFlow.
- rtst — no summary
- ryder — News reader for Python
- sampleReddit — Take snowball samples of Reddit data
- school-transport-application-form-tool — School Transport Application Form Tool
- searchdatamodels — no summary
- sentiment-analysis — Sentiment analysis for paragraph or sentence
- serey — A python library for the Serey blockchain
- sermos-tools — Sermos Tools
- shareberry — A python shareberry library.
- SimilarityText — Find the similarity between two texts using AI
- sis-great-ai — Transform your prototype AI code into production-ready software.
- sky — AI powered scraping in Python 3
- small-web-dataset — Process all the RSS and Atom feeds from the Small Web feeds list, validate them, generate statistics and eventually more.
- SmiToText — test processing
- social-analyzer — API, CLI & Web App for analyzing & finding a person's profile across 300+ social media websites (Detections are updated regularly)
- soft-404 — A classifier for detecting soft 404 pages
- sosse — Selenium Open Source Search Engine
- SourceRank — no summary
- spacy-langdetect — Fully customizable language detection pipeline for spaCy
- spacy-language-detection — Fully customizable language detection for spaCy pipeline
- splunk-appinspect — Automatic validation checks for Splunk Apps
- steem — Official python steem library.
- steep-steem — Fork of official python STEEM library.
- TabNamesCat — CategorizationTabNames
- tcb-sheet-tools — A Collection of Utilities. Not even can be described.
- TDMYSA — scores
- tensorflow-datasets — tensorflow/datasets is a library of datasets ready to use with TensorFlow.
- Terminalia — A Python Library for Command Line Interface (CLI) Development
- text-util-en-pt — Python project for text cleaning. Some specifics for English and Portuguese languages.
- textcl — Text preprocessing package for use in NLP tasks
- textLSP — Language server for text spell and grammar check with various tools.
- textnormaliser — A python package that runs a series of operations over text to decorate a corpus