Reverse Dependencies of librosa
The following projects have a declared dependency on librosa:
- a62-emotion — A model for emotion classification based on text and audio.
- aaapi — Another Audio API - Collection of audio and music processing API with massive amount of dependencies
- achatbot — An open source chat bot for voice (and multimodal) assistants
- acids-msprior — MSPRIOR: A multiscale prior model for realtime temporal learning
- acids-rave — RAVE: a Realtime Audio Variatione autoEncoder
- acoustic-odometry — Acoustic Odometry library
- acousticdistance — implements acoustic distance measures between two audio snippets with a MFCC-features and an ANN-features
- activity-detection-evaluation — Library for evaluating activity detection
- adapter-transformers — A friendly fork of HuggingFace's Transformers, adding Adapters to PyTorch language models
- AdonisAI — AdonisAI is python library to build your own AI virtual assistant with natural language processing.
- adversarial-robustness-toolbox — Toolbox for adversarial machine learning.
- africanwhisper — A framework for fast fine-tuning and API endpoint deployment of Whisper model specifically developed to accelerate Automatic Speech Recognition(ASR) for African Languages.
- ai-gradio — A Python package for creating Gradio applications with AI models
- aideml — Autonomous AI for Data Science and Machine Learning
- aimet-ml — Python package of frequently used modules for ML developments in AIMET..
- airunner — Run local opensource AI models (Stable Diffusion, LLMs, TTS, STT, chatbots) in a lightweight Python GUI
- aisfx — Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.
- aivoifu — Easy and fast AI Waifu voice generation
- aiwaifu-vts-controller — no summary
- aladindb — 🔮 Super-power your database with AI 🔮
- alcokit — package to do MIR and learn generative models on top of librosa and pytorch
- alexandra-ai-eval — Evaluation of finetuned models.
- align-phonemes — Phoneme Aligner
- alignments — A library for aligning audio and text
- allin1 — All-In-One Music Structure Analyzer
- amen — Algorithmic music remixing
- amt-augpy1.0 — Python augmentation toolkit for Automatic Music Transcription datasets
- amt-tools — Machine learning tools and framework for automatic music transcription
- analisi-canti — Analisi canti
- analyzeAudio — Measure one or more aspects of one or more audio files.
- annolid — An annotation and instance segmentation-based multiple animal tracking and behavior analysis package.
- april-asr — Offline open source speech recognition API based on next-generation Kaldi
- armory-testbed — Adversarial Robustness Test Bed
- artbox — ArtBox is a tool set for handling multimedia files.
- articubench — articubench - An Articulatory Speech Synthesis Benchmark
- ArtNex — ArtNex is a deep learning framework exploring the innovative fusion of art and technology.
- as-seg — Package for the segmentation of autosimilarity matrices. This version is related to a stable vesion on PyPi, for installation in MSAF.
- asr-deepspeech — ASRDeepspeech (English / Japanese)
- asrassessment — Provides Phoneme Error Rate & Visualisation Assessment
- asrecognition — ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
- asrp — no summary
- ast-slizovskaia — Test exercise AST model on the ESC-50 dataset
- asteroid-filterbanks — Asteroid's filterbanks
- attenuate — Real-time raw speech enhancement with deep state-space modeling
- AudAugio — Augments audio for machine learning
- audclas — Package that uses extracted audio classification model from neonbjb/DL-Art-School - for filtering fine audio files.
- audearch — audearch is a simple music search system
- audio-analysis-lib — A Python library for audio analysis, similar to librosa.
- audio-cat — audio splitter and labeler
- audio-classification-features — Complete Package for Audio Classification
- audio-file-translator — audio-file-translator. For Windows, macOS, and Linux, on Python 3
- audio-offset-finder — Find the offset of an audio file within another audio file
- audio-separator — Easy to use audio stem separation, using various models from UVR trained primarily by @Anjok07
- audio-separator-ui — Easy to use audio stem separation with a UI, using various models from UVR trained primarily by @Anjok07
- Audio-Similarity — Audio similarity metrics for audio tasks
- audio-sleuth — an open-source framework for detecting audio generated from generative systems
- audio-slicer — Automatically segregates audio and cleans up muted parts
- audio-snippets — Simple snippets for audio analysis
- audio2anki — Convert audio and video files into Anki flashcard decks with translations
- audio2chat — Generate chat data from multi-speaker audio files
- AudioAugment — Audio data augmentation tool for machine learning projects
- AudioAugmentor — Python package for simple application of wide range of audio augmentations.
- AudioCarver — A library built to carve out minimum seams from an audio file
- audiodiffusion — Generate Mel spectrogram dataset from directory of audio files.
- audiodl — Audio Deep learning
- AudioFeaturizer — Takes audio as input and returns computed features as a dataframe
- audioFX — Audio effects library.
- audioic — AudioIC Project
- audioinfo-ecrit — no summary
- audioldm — This package is written for text-to-audio generation.
- audioldm-eval — This package is written for the evaluation of audio generation model.
- audioldm2 — This package is written for text-to-audio/music generation.
- audiomate — Audiomate is a library for working with audio datasets.
- audiomentations — A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
- audiomodels — audio models package with semantic features identification
- audioperm — Audioperm, a python library for generating different permutations of audible segments from audio files.
- audiosample — AudioSample is an optimized numpy-like audio manipulation library, created for researchers, used by developers.
- audiosegment — Wrapper for pydub.AudioSegment for additional methods.
- audiosr — This package is written for text-to-audio/music generation.
- audiossl — no summary
- audiotree — Audio data loading and augmentations in JAX
- audioviz — An user-friendly music information retrieval tools interfacing with Google Colab
- audiozen — Audio ZEN is a library for audio/speech signal processing.
- audtorch — Deep learning with PyTorch and audio
- augaudio — A simple audio data augmentation package
- augly — A data augmentations library for audio, image, text, & video.
- augmentaudio — A simple audio data augmentation package
- augmolino — augmentation for audio based datasets for machine learning
- aukit — audio toolkit
- auralis — This is a faster implementation for TTS models, to be used in highly async environment
- auraloss — Collection of audio-focused loss functions in PyTorch.
- auto-highlighter-py — automatically clip moments from twitch VODs
- autochord — Automatic Chord Recognition library
- autodl-gpu — Automatic Deep Learning, towards fully automated multi-label classification for image, video, text, speech, tabular data.
- automated-rhythm-generation — ARG: Automated Rhythm Generation. Let's generate rhythm game maps automatically!
- avn — Package for zebra finch song analysis.
- avr — AVR is a voice anti-spoofing system that uses deep learning models to detect spoofed audio files.
- azureml-evaluate-mlflow — Contains the integration code of AzureML Evaluate with Mlflow.
- bambird — BAM, unsupervised labelling function to extract and cluster similar animal vocalizations together
- Bangla-Speech2Text2Speech — Bangla Speech to Text & Text to Speech.