Reverse Dependencies of librosa
The following projects have a declared dependency on librosa:
- djmix — no summary
- dls-rvc — no summary
- dmf-utils — DMF's Python package providing reusable functionalities for neuroscience research.
- dnn-tts-torch — This is a library consisting of pre-trained models for the synthesis of Russian and English speech
- dodrio — Data Package for TTS
- dora-gradio — dora-gradio
- dorothy-cci — A Creative Computing Python Library for Interactive Audio Generation and Audio Reactive Drawing
- dreamsound — DreamSound Class for CNN Activation Layer Sonification
- dsbundle — Streamline your data science setup with dsbundle in one effortless install.
- dscleaner — A Python Library to Clean, Preprocess and Convert audio datasets
- e2tts-mlx — Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX
- easy-vc-dev — no summary
- eeg-ml-pipeline — A Python package for EEG machine learning analysis, classification, and visualization.
- eend — End-to-End Neural Diarization
- elixir-client — Elixir client enables remote execution of python code triggered from a Crucible Plugin on the Signals & Sorcery platform.
- elpis — A library to perform automatic speech recognition with huggingface transformers.
- emocodes — A library designed to accompany the EmoCodes system.
- EmotiVoice — EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
- emphases — Crowdsourced and Automatic Speech Prominence Estimation
- emvoice — Extract emotion expression-related voice features from audio.
- Endeless — Create playlists with seamless music transitions.
- english-asr — An Automatic Speech Recognition(ASR) for English language trained on LibriSpeech dataset using Conformer.
- epanns — Categorise sounds within an audio file
- ertk — Tools for process emotion recognition datasets, extracting features, and running experiments.
- espnet — ESPnet: end-to-end speech processing toolkit
- espnet-onnx — ONNX Wrapper for ESPnet
- eternalblue — A diarization package
- everyvoice — Text-to-Speech Synthesis for the Speech Generation for Indigenous Language Education Small Teams Project
- evoclearn-core — Core tools for Early Vocal Learning simulation.
- example-pkg-zyxstudycs — Try package python code
- exordium — Collection of utility tools and deep learning methods.
- explosion-distance-estimator — Estimate the distance to an explosion from video and audio analysis
- f5-tts — F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
- fasr — FASR: Fast Automatic Speech Recognition
- fast-tts — no summary
- fastaudio — Audio Module for fastai version 2
- FastAudioVisal — A command tool to deal with the recognition in audiovisual domain. It is pipline tool for all of the work.
- FastAudioVisual — A command tool to deal with the recognition in audiovisual domain. It is pipline tool for all of the work.
- fastproaudio — End-to-end audio with fastai
- fastrtc — Stream images in realtime with webrtc
- fastrtc_canary — The realtime communication library for Python - fastrtc with Nvidia's Canary STT
- fastrtc-jp — A module kit for Fast RTC in Japanese
- fastrtc-kroko — The realtime communication library for Python. With support for Kroko-ASR model.
- fastrtc-moonshine-onnx — Fork of moonshine_onnx on pypi. Speech recognition for live transcription and voice commands with the Moonshine ONNX models.
- fastxtend — Train fastai models faster (and other useful tools)
- faunanet — faunanet - A bioacoustics platform for the analysis of animal sounds with neural networks based on birdnetlib
- faunanet-record — Audio Recording Facilities for the isparrow package
- fedot-ind — Time series analysis framework
- ffmpeg-python-utils — Python scripts constructing ffmpeg commands and running them by subprocess.
- fftrack — FFTrack is a Python-based music recognition tool that allows users to identify songs from audio input.
- filemac — Open source Python CLI toolkit for conversion, manipulation, Analysis of files (All major file operations)
- fish-audio-preprocess — Preprocess audio data
- fish-speech-lib — Fish Speech pipeline as library so you don't need to webui.
- fishsound-finder — Python software to automatically detect fish sounds in passive acoustic recordings
- flashtts — A Fast TTS toolkit
- flax-addons — flax addons
- flowtron — Flowtron library
- flwr-datasets — Flower Datasets
- fouriax — A jax port of auraloss
- Fourmodels — A package for comparing four machine learning models
- frogger — no summary
- ftis — The finding things in stuff package.
- funasr — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funasr-onnx — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funasr-runtime — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funasr-torch — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funaudio — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funcodec — FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec
- fungpt — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funllm — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funnmt — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funspeaker — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funspeech — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- funtts — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- furchain — FurChain is an innovative toolkit for creating and interacting with digital personas, complete with voice cloning and role-playing capabilities. It offers a suite of tools for real-time voice manipulation, chatbot creation, and text-based RPG adventures, all while being open-source and operable offline.
- fuzzy-muffler — mp3|ogg|wav audio muffler/fuzifier
- GailBot — GailBot API
- gallama — An opinionated Llama Server engine with a focus on agentic tasks
- GAMuT — Granular Audio Musaicing Toolkit for Python
- genaibook — Utilities for 'Hands-On Generative AI with Transformers and Diffusion Models' (upcoming)
- geniusrise-audio — audio bolts for geniusrise
- gft — GFT (general fine-tuning) A Little Language for Deepnets: 1-line programs for fine-tuning, inference and more
- gft-cpu — GFT (general fine-tuning) A Little Language for Deepnets: 1-line programs for fine-tuning, inference and more
- gllm-docproc-binary — A library for orchestrating the processing of document. Typically in a Gen AI applications (but not limited to just Gen AI).
- gnes — GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.
- goofi — Real-time neuro-/biosignal processing and streaming pipeline.
- GPT-SoVITS-Infer — Inference code for GPT-SoVITS
- gpt-sovits-python — Python wrapper for fast inference with GPT-SoVITS
- gradio-webrtc — Stream images in realtime with webrtc
- graphite-datasets — tensorflow/datasets is a library of datasets ready to use with TensorFlow.
- grinding-lib — Demo Python Library for Siemens Grinding project
- guitarsounds — A python package to analyze and visualize harmonic sounds
- gurulearn — Comprehensive ML library for model analysis, computer vision, medical imaging, and audio processing with enhanced features including confidence metrics and flowbot integration (modularity introduced) used lazy loader to fix slow loading updated lisence
- h2ogpt — no summary
- hallooworldgk — A basic hello package
- hay-say-common — Constants and methods that are shared between the Hay Say UI and various Docker containers it communicates with.
- hear-savi — An HEAR API for SAVI AudioCNN
- hearbaseline — Holistic Evaluation of Audio Representations (HEAR) 2021 -- Baseline Model
- hezar — Hezar: The all-in-one AI library for Persian, supporting a wide variety of tasks and modalities!
- HMBasr — Automatic Speech Recognition