Reverse Dependencies of editdistance
The following projects have a declared dependency on editdistance:
- addok — Search engine for address. Only address.
- allennlp-semparse — A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP
- allosaurus — a multilingual phone recognizer
- amazon-textract-textractor — A package to use AWS Textract services.
- amphetype — Advanced typing practice program
- arivo.om — Util functions for om docker containers
- asrp — no summary
- asrtoolkit — The GreenKey ASRToolkit provides tools for automatic speech recognition (ASR) file conversion and corpora organization.
- autofj — Auto-Program Fuzzy Similarity Joins Without Labeled Examples
- bartide — A Python package to extract, correct and analyze nucleotide barcodes from sequenced reads.
- benchmarkstt — A library for benchmarking AI/ML applications.
- beymax — A high-level, functional programming wrapper to discord.py
- brainwalk — Spatial graph embeddings for ObsidianMD
- camel-tools — A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
- cf-nlp — ClowdFlows natural language processing module
- chat-with-functions — no summary
- click-fuzzy — Fuzzy command matching and aliases for click
- climoji — A cli emoji finder
- codegen-metrics — Package for computation of code generation metrics
- contextualSpellCheck — Contextual spell correction using BERT (bidirectional representations)
- CyberU — Your spider tool based on selenium and an even general platform based on Python.
- deepa2 — Cast NLP data as multiangular DeepA2 datasets and integrate these in training pipeline
- dlc2action — tba
- doc-curation — A package for curating doc file collections, with ability to sync with youtube and archive.org doc items.
- dyslexic-readability — A readability scoring library tailored to the specific needs of Turkish dyslexic readers.
- eDOCr — OCR for Engineering Mechanical Drawings
- emlangkit — Emergent Language Analysis Toolkit
- espnet — ESPnet: end-to-end speech processing toolkit
- ethpwn — A swiss army knife package to help with ethereum smart contract exploit interaction, designed with CTF challenges in mind. Some might call it a set of pwn tools for ethereum exploitation.
- ExpoSeq — A pacakge which provides various ways to analyze NGS data from phage display campaigns
- fastpat — USPTO patent data fetcher and parser
- fings — Filters for Next Generation Sequencing
- flexs — FLEXS: an open simulation environment for developing and comparing model-guided biological sequence design algorithms.
- fonduer — Knowledge base construction system for richly formatted data.
- fonetika — Phonetics algorithms (Soundex and Metaphone) for russian, english, finnish and estonian languages
- funasr — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funcodec — FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec
- fuzzyjoin — Join two tables by a fuzzy comparison of text columns.
- game-prediction2 — Game prediction, simplified. No edits, no GCP. Everything's just `AsyncIterables` in and out.
- gchar — Game character manager.
- generic-iterative-stemmer — A generic language stemming utility, dedicated for gensim word-embedding.
- genet — GenET: Genome Editing Toolkit
- greynirseq — Natural language processing for Icelandic
- grounder — Estimate the quality and factual correctness of natural language text to help ground language models such as BERT, GPT-J, GPT-Neo, ChatGPT, and Bard.
- ilakkani — a tower of spellcheckers for Tamil
- image-ocr — A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
- immuneML — immuneML is a software platform for machine learning analysis of immune receptor repertoires.
- innatis — A library of useful custom Rasa components
- inspora-rasa-utilities — Some Rasa NLU components for making my life easier
- json-merger — Python module that is able to merge json record objects.
- jupyter-ascending — no summary
- keras-ocr — A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
- knowknow-amcgail — Analyzing the evolution of ideas using citation analysis
- ksu — Implementation of the KSU compression algorithm https://www.cs.bgu.ac.il/~karyeh/compression-arxiv.pdf
- librelingo-json-export — Export LibreLingo courses in the JSON format used by the web app
- linguistics — Python library for natural language processing
- llmuses — Eval-Scope: Lightweight LLMs Evaluation Framework
- mead-audio8 — MEAD Audio
- mgefinder — A toolbox for identifying mobile genetic element (MGE) insertions from short-read sequencing data of bacterial isolates.
- miRBaseMiner — Mining the miRNA annotation in miRBase for comprehensive understanding in miRNA annotation reference before implementing in miRNA study.
- mmf — mmf: a modular framework for vision and language multimodal research.
- mothertongues — Mother Tongues Dictionaries dictionary creation tool
- mountaintop — make research work more friendly
- namematch — Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets
- nemo-text-processing — NeMo text processing for ASR and TTS
- nemo-toolkit — NeMo - a toolkit for Conversational AI
- nmt — Neural Machine Translation for NLPIA 2nd Edition
- nmtpytorch — Sequence-to-Sequence Framework in Pytorch
- noise2read — Turn noise to read
- ntils-yuxin-wang — NLP utils
- ocrd-cor-asv-ann — sequence-to-sequence translator for noisy channel error correction
- ocrstack — A simple OCR package
- onegov.core — Contains code shared by all OneGov applications.
- openmodule — Libraries for developing the arivo openmodule
- orbis-plugin-scoring-wl-harvest-scorer — The Weblyzard Harvest Scroring plugin for Orbis
- paddlespeech — Speech tools and models based on Paddlepaddle
- paiargparse — no summary
- panphon — Tools for using the International Phonetic Alphabet with phonological features
- pdf-struct — Logical structure analysis of visually structured documents.
- phonepiece — a multilingual phone tokenizer
- preppipe — Document to Visual Novel generator
- proteinflow — Versatile pipeline for processing protein structure data for deep learning applications.
- pug-nlp — Python Natural Language Processing by and for the Python User Group in Portland, OR
- pyastsim — Detect similarities between Python source files
- pydyno — Dynamic analysis of systems biology models
- pytesstrain — Collection of utilities for Tesseract OCR training
- pytokenjoin — pyTokenJoin is a library containing efficient algorithms that solve the set similarity join problem with maximum weighted bipartite matching.
- pywer — A simple Python package to calculate word error rate (WER).
- rna-seq-tools — simple functions for manipulating sequences and secondary structures in pandas dataframe format
- ru-soundex — Soundex algorithm for russian, english and finnish languages
- s3prl-vc — Voice conversion toolkit based on S3PRL: Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
- seqio — SeqIO: Task-based datasets, preprocessing, and evaluation for sequence models.
- seqio-nightly — SeqIO: Task-based datasets, preprocessing, and evaluation for sequence models.
- seqm — Utilities for calculating sequence metrics.
- similarity — Python library for measuring string similarity
- spacy-ke — Keyword extraction with spaCy
- SRRec — Short Reads Rectification
- subhaashita — A package for curating subhaashita-s (quotes) in various languages.
- sumerian-ner — Sumerian Named Entity Recognition
- t5 — Text-to-text transfer transformer
1
2