Reverse Dependencies of librosa
The following projects have a declared dependency on librosa:
- pec-dss — Paralinguistic Event Classification from Diarized Speaker Segments
- penn — Pitch Estimating Neural Networks (PENN)
- perch-hoplite — Tooling for agile modeling on large machine perception embedding databases.
- persianttsfarsi — A Python library for Persian text-to-speech using Microsoft Azure service.
- phasefinder — rotational beat estimation model
- piano-transcription-inference — Piano transcription inference toolbox
- pianoputer — Use your computer keyboard as a "piano"
- pipepal — PipePal is a Python package that simplifies building pipelines for speech and voice analysis.
- pitch-detectors — collection of pitch detection algorithms with unified interface
- pitchsqueezer — Robust pitch tracker for speech, using synchrosqueezing and spectral autocorrelation
- pliers — Multimodal feature extraction in Python
- plixkws — Plug-and-Play Multilingual Few-shot Spoken Words Recognition
- ploppie — A high-level, stupid-simple Pythonic LiteLLM abstraction layer for implementing simple chat workflows, with tools.
- plot-wav — no summary
- Plotting-funcs — Auxiliary functions for plotting purposes mainly.
- podcast-teaser — Automatically generate engaging audio teasers from podcast episodes
- PolUVR — Easy to use audio stem separation with a UI, using various models from UVR trained primarily by @Anjok07
- polyglotdb — no summary
- pop-autocar3 — AIoT AutoCar3 library for pop
- pop-serbot2 — AIoT serbot2 library for pop
- ppacls — Audio Classification toolkit on PaddlePaddle
- ppaudio — An audio classification toolkit based on PaddlePaddle for detecting abnormal sounds
- ppgan — Awesome GAN toolkits based on PaddlePaddle
- ppgs — Phonetic posteriorgrams
- ppser — Speech Emotion Recognition toolkit on PaddlePaddle
- ppvits — VITS toolkit on PaddlePaddle
- praudio — Complex preprocessing of entire audio datasets with 1 command
- precountify — A tool for pre-countifying
- promonet — Prosody Modification Network
- prowav — The package for preprocessing wave data
- pruna — Smash your AI models
- psy-detector — A real-time sigh detection system using audio processing
- pumpp — A practically universal music pre-processor
- puretalk — Text-to-Speech (TTS) with natural human voice involves converting written text into spoken words using advanced machine learning models. These models are trained to produce speech that closely mimics the nuances, intonations, and rhythms of human speech, making the output sound more natural and lifelike.
- py-data-juicer — A One-Stop Data Processing System for Large Language Models.
- py3-ttsmms — Text-to-speech with The Massively Multilingual Speech (MMS) project
- pyabelab — Library that are likely to be used frequently in ABELAB.
- pyampact — pyAMPACT (Python-based Automatic Music Performance Analysis and Comparison Toolkit) is a python package that links symbolic and audio music representations to facilitate score-informed estimation of performance data in audio as well as general linking of symbolic and audio music representations with a variety of annotations.
- Pyara — Library for audio classification
- pyaudioaugment — no summary
- pyaudioclassification — Dead simple audio classification
- pyAudioKits — Powerful Python audio workflow support based on librosa and other libraries
- pyclarity — Tools for the Clarity Challenge
- pyfinch — A python package for analyzing neural & bioacoustics signals from songbirds
- pyfoal — Python forced aligner
- PyHa-test — A python package for automatically detecting species and comparing to ground truth
- pylatentsync — A python package for LatentSync
- pylights — Module used to change the color and brightness of lights to the beat of an udio file
- pyliz-ai — Library to interact with local/remote LLM.
- pymatchmaker — A package for real-time music alignment
- pymcd — Calculate Mel-Cepstral Distortion (MCD)
- pymixing — A simple daw in python.
- pymouth — Live2D Mouth-sync artifact
- pymss — Python package for music source separation.
- pymusickit — A Python package for music analysis. Keyfinder Forked from "https://github.com/jackmcarthur/musical-key-finder"
- pymusiclooper — Repeat music endlessly and create seamless music loops, with play/export/tagging support.
- pyneuralfx — A python package for neural audio effect
- pysaten — Detect silence segment from speech signal.
- pyscreech — PyScreech - Audio Performance Library
- pysilero — no summary
- pysoundtool — A research-based framework for exploring sound as well as machine learning in the context of sound.
- python-dataset — no summary
- pyvad — 'py-webrtcvad wrapper for trimming speech clips'
- pyvoice — A real-time speech-to-text transcription tool using machine learning (NumPy), PyQt6, and faster-whisper.
- pyw2v2 — Simple wav2vec2 wrapper
- qai-hub-models — Models optimized for export to run on device.
- quantumaudio — A Python package for building Quantum Representations of Digital Audio. Developed by Moth Quantum.
- qwen-omni-utils — Qwen Omni Language Model Utils - PyTorch
- radtts — RADTTS library
- rapid-paraformer — Tool of speech recognition.
- realbook — Realbook, a library to make using audio on TensorFlow easier.
- regsets — A collection of regression datasets with PyTorch-like dataset classes.
- resemble-enhance — Speech denoising and enhancement with deep learning
- Resemblyzer — Analyze and compare voices with deep learning
- rest-api-supporter — Rest api supporter
- reviutils — A common library frequently used on python
- rlmc — Python utils for AI 🚀
- robobo-emotion — LibrerÃa para detectar emociones en imágenes y audio usando Robobo
- rstojnic-tfds-nightly — tensorflow/datasets is a library of datasets ready to use with TensorFlow.
- rt-pie — Real Rime PItch Estimator
- rtst — no summary
- rtvamp — Vamp plugin host for real-time audio feature analysis
- rtvc — Real-Time Voice Conversion GUI
- runes-client — Runes client enables remote execution of python code triggered from a Crucible Plugin on the Signals & Sorcery platform.
- ruptures — Change point detection for signals in Python.
- ruth-text-to-speech — A Python CLI for Ruth NLP
- ruth-tts-converter — A Python CLI for Ruth NLP
- ruth-tts-converter-python — A Python CLI for Ruth NLP
- rvc — An easy-to-use Voice Conversion framework based on VITS.
- rvc-dv-clone — Retrieval-based Voice Conversion library
- rvc-infer — Python wrapper for inference with rvc
- rvc-inferpy — Easy tools for RVC Inference
- rwave — no summary
- s3a-decorrelation-toolbox — Decorrelation algorithm and toolbox for diffuse sound objects and general upmix
- s3prl-vc — Voice conversion toolkit based on S3PRL: Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
- sadtalker-z — sadtalker
- sagemaker-huggingface-inference-toolkit — Open source library for running inference workload with Hugging Face Deep Learning Containers on Amazon SageMaker.
- saigen-dep-test — A test of using dependencies
- saigen-dep-test-with-poetry — no summary
- samosila-core — no summary