Reverse Dependencies of librosa
The following projects have a declared dependency on librosa:
- cerbogen — A Python package for CerboTech
- charmory — Adversarial Robustness Evaluation Library
- chbpm — A Python library for BPM adjustment in audio files
- chenhuiming — a private package
- ChildProject — LAAC@LSCP
- ChimpDrummingDetector — A program for detecting chimpanzee drumming in long-term rainforest recordings
- chisel4ml — A Chisel based hardware generation library for highly quantized neural networks.
- chord-extractor — Python library for extracting chords from multiple sound file formats
- chtk — A data science toolkit for working with Clone Hero charts.
- CLTranscriptor — Wav2Vec2-based transcriptor fine tuned on chilean lessons
- code-video-generator — Generate videos that walkthrough code
- cody-adapter-transformers — A friendly fork of HuggingFace's Transformers, adding Adapters to PyTorch language models
- common-ml-functions — A python package for out-of-the-box ML solutions
- commonvoice-pinyin — Common Voice Chinese dataset for PyTorch, Pinyin, and TTS
- conch-sounds — Analyze acoustic similarity in Python
- concrete-ml-extensions-brevitas — Quantization-aware training in PyTorch
- conversationalnlp — Your main project
- coqui-tts — Deep learning for Text to Speech.
- covid-detection — this a covid detection with audio ml model
- crepe-notes — Post-processing for CREPE to turn f0 pitch estimates into discrete notes e.g. MIDI
- crowsetta — A Python tool to work with any format for annotating animal vocalizations and bioacoustics data
- cstx — Description of the cstx package
- ctc-chroma — CTC-based chroma feature exractors
- d3net-spleeterweb — Unofficial Python package of D3Net implementation by Sony Research AI, used in Spleeter Web.
- danspeech — Speech recognition for Danish
- das — DAS
- das_unsupervised — Tools for unsupervised classification of acoustic signals.
- dasp-pytorch — Differentiable audio processors in PyTorch.
- datasets — HuggingFace community-driven open-source library of datasets
- dawnet-client — DAWNet client enables remote execution of python code triggered from a DAW.
- dcase-models — Python library for rapid prototyping of environmental sound analysis systems
- ddsp — Differentiable Digital Signal Processing
- deep-copilot — gyw toolkits
- deeprhythm — A fast, accurate Tempo Predictor
- DeepSpectrum — no summary
- DeepSpectrumLite — no summary
- deepss — DeepSS
- deepss_unsupervised — Tools for unsupervised classification of acoustic signals.
- delta-nlp — DELTA is a deep learning based natural language and speech processing platform.
- descript-audiotools — Utilities for handling audio.
- desktop-env — The package provides a desktop environment for setting and evaluating desktop automation tasks.
- dfpwm — DFPWM convertor for Python
- dienen — Train deep neural networks using configuration files
- diffsptk — Speech signal processing modules for machine learning
- diffusers — State-of-the-art diffusion in PyTorch and JAX.
- diffusers-unchained — Diffusers
- diffusersv — State-of-the-art diffusion in PyTorch and JAX.
- djmix — no summary
- dls-rvc — no summary
- dreamsound — DreamSound Class for CNN Activation Layer Sonification
- dscleaner — A Python Library to Clean, Preprocess and Convert audio datasets
- easy-vc-dev — no summary
- elixir-client — Elixir client enables remote execution of python code triggered from a Crucible Plugin on the Signals & Sorcery platform.
- elpis — A library to perform automatic speech recognition with huggingface transformers.
- emocodes — A library designed to accompany the EmoCodes system.
- EmotiVoice — EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
- emphases — Crowdsourced and Automatic Speech Prominence Estimation
- emvoice — Extract emotion expression-related voice features from audio.
- Endeless — Create playlists with seamless music transitions.
- english-asr — An Automatic Speech Recognition(ASR) for English language trained on LibriSpeech dataset using Conformer.
- ertk — Tools for process emotion recognition datasets, extracting features, and running experiments.
- espnet — ESPnet: end-to-end speech processing toolkit
- espnet-onnx — ONNX Wrapper for ESPnet
- everyvoice — Text-to-Speech Synthesis for the Speech Generation for Indigenous Language Education Small Teams Project
- evoclearn-core — Core tools for Early Vocal Learning simulation.
- example-pkg-zyxstudycs — Try package python code
- exordium — Collection of utility tools and deep learning methods.
- fast-tts — no summary
- fastaudio — Audio Module for fastai version 2
- FastAudioVisal — A command tool to deal with the recognition in audiovisual domain. It is pipline tool for all of the work.
- FastAudioVisual — A command tool to deal with the recognition in audiovisual domain. It is pipline tool for all of the work.
- fastproaudio — End-to-end audio with fastai
- fastxtend — Train fastai models faster (and other useful tools)
- faunanet — faunanet - A bioacoustics platform for the analysis of animal sounds with neural networks based on birdnetlib
- fedot-ind — Time series analysis framework
- ffmpeg-python-utils — Python scripts constructing ffmpeg commands and running them by subprocess.
- fftrack — FFTrack is a Python-based music recognition tool that allows users to identify songs from audio input.
- fish-audio-preprocess — Preprocess audio data
- flax-addons — flax addons
- flowtron — Flowtron library
- flwr-datasets — Flower Datasets
- fouriax — A jax port of auraloss
- Fourmodels — A package for comparing four machine learning models
- frogger — no summary
- ftis — The finding things in stuff package.
- funasr — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funasr-onnx — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funasr-runtime — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funasr-torch — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funaudio — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funcodec — FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec
- fungpt — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funllm — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funnmt — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funspeaker — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funspeech — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funtts — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- furchain — FurChain is an innovative toolkit for creating and interacting with digital personas, complete with voice cloning and role-playing capabilities. It offers a suite of tools for real-time voice manipulation, chatbot creation, and text-based RPG adventures, all while being open-source and operable offline.
- fuzzy-muffler — mp3|ogg|wav audio muffler/fuzifier
- GailBot — GailBot API