Reverse Dependencies of openai-whisper
The following projects have a declared dependency on openai-whisper:
- aibo — aibo: AI partner that can run offline
- aij — AI Journalist
- algorin-cli — Acceso a GPT-3 y procesamiento de documentos desde la línea de comandos.
- aniemore — Aniemore (Artem Nikita Ilya EMOtion REcognition) is a library for emotion recognition in voice and text for russian language.
- asrp — no summary
- audio-journal — CLI tool to transcribe audio and store it in Notion
- audiomind — no summary
- AudioSummariser — Summarises the text generated from the audio files for quicker resolution. The audio files are typically the customer support recordings for now but the usecase can be extended to more dimensions. Sentiment is analysed and depicted visually.
- audiotranscription — no summary
- auto-subtitle-llama — Automatically generate, translate and embed subtitles into your videos
- autocut-sub — Cut video by subtitles
- autotranscribe — An auto transcription service for youtube and normal videos.
- blindai — BlindAI Core / API is an open-source and easy-to-use Python library allowing you to query AI models with assurances that your private data will remain private
- bnw-tools — Tools developed in the BorgNetzWerk project for the extraction, analysis and publication of knowledge.
- buzz-captions — no summary
- conversations — no summary
- convopilot — An AI tool to help users better navigate conversations.
- copypy — Video Transcription
- corava — Python project for development of a Conversation Optimized Robot Assistant (CORA). CORA is a voice assistant that is powered by openai's chatgpt for both user intent detection as well as general LLM responses.
- deepsearchai — no summary
- digestvid — A tool to transcribe and summarize video content.
- DubSplitter — an easy tool to split dubs based on given silence
- easy-whisper — An easy to use adaption of OpenAI's Whisper, with both CLI and (tkinter) GUI, faster processing even on CPU, txt output with timestamps.
- easy-whisper-local — no summary
- echo-artistry — EchoArtistry is an innovative tool that transforms spoken words into captivating visual stories.
- essence-extractor — Unleash the power of content transformation with EssenceExtractor, a dynamic tool that turbocharges your workflow, turning YouTube videos into engaging, readable blog posts in a snap!
- farm-haystack — LLM framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data.
- farm-haystack-speech2text — Haystack node to convert audio files into Documents.
- fish-audio-preprocess — Preprocess audio data
- foxlator-lib — Library backend for foxlator
- frogbase — FrogBase simplifies the download-transcribe-embed-index workflow for multi-media content. It does so by linking content from various platforms with speech-to-text models, image & text encoders and embedding stores.
- funasr — FunASR: A Fundamental End-to-End Speech Recognition Toolkit
- GailBot — GailBot API
- gpt3discord — A Chat GPT Discord bot
- hspylib-askai — HomeSetup - AskAI
- jac-speech — no summary
- JarvisAI — JarvisAI is python library to build your own AI virtual assistant with natural language processing.
- khoj-assistant — An AI copilot for your Second Brain
- langs-vall — Paquete de vall-e-x para proyecto de traduccion de lenguajes
- langsearch — Easily create semantic search based LLM applications on your own data
- live_illustrate — Live-ish illustration for your role-playing campaign
- live-transcribe — Real-time audio transcription. Runs OpenAI's Whisper locally.
- liveTranscriberGenx — To do live transcription.
- llsubtitles — Use OpenAI's whisper to generate subtitles in multiple languages for the purpose of language learning
- luis-v-subtitler — A Python package to use AI to subtitle any video in any language
- lyrics-transcriber — Automatically create synchronised lyrics files in ASS and MidiCo LRC formats with word-level timestamps, using Whisper and lyrics from Genius and Spotify
- manim-voiceover — Manim plugin for all things voiceover
- Marketingtool — A tool module to help you do marketing
- mexca — Emotion expression capture from multiple modalities.
- mmdiary — Multimedia Diary Tools
- mosamaticdesktop — Desktop tool for analyzing medical images
- opaw — Unofficial python wrapper of OpenAI API.
- opendatagen — Data preparation system to build controllable AI system
- oraculo — A project to use Sentence Transformers and embeddings to make a pocket search engine
- podcast-summarizer — Summarizes podcasts.
- proto-clip-toolkit — A simple toolkit from Proto-CLIP demo that provies speech recognition, part-of-speech tagging and realworld robot demo APIs.
- puddl — no summary
- purpose-transcribe — A CLI tool for transcribing audio files
- pyaligner — Automatic audio transcriptor and audi-text aligner
- pychatgpt-gui — pyChatGPT GUI - is an open-source, low-code python GUI wrapper providing easy access and swift usage of Large Language Models (LLMs) such as ChatGPT, AutoGPT, LLaMa, GPT-J, and GPT4All with custom-data and pre-trained inferences.
- pyscreech — PyScreech - Audio Performance Library
- qai-hub-models — Models optimized for export to run on device.
- quilbert — Friendly ai voice assistant
- s3prl-vc — Voice conversion toolkit based on S3PRL: Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
- scrit — command line tool to manage transcription of system audio with openai whisper
- selenium-recaptcha — reCaptcha v2 solver for selenium
- speechtoolkit — ML for Speech presents SpeechToolkit, a unified, all-in-one toolkit for TTS, ASR, VC, & other models.
- speechtotext-python — Python package to benchmark speech2text models.
- spotify-translator — Generate lyric translations and transcriptions from Spotify URLs using OpenAI's Whisper model.
- stream-translator-gpt — Command line tool to transcribe & translate audio from livestreams in real time
- subaligner — Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers.
- symbolicai — A Neuro-Symbolic Framework for Python
- tafrigh — تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
- talk-summarizer — Python library to summarize talks
- talk2pdf — no summary
- talkgpt4all — A voice chatbot based on GPT4All and OpenAI Whisper, running on your PC locally
- testgailbot002 — GailBot API
- testgailbotapi — GailBot Test API
- testgailbotapi001 — GailBot Test API
- Tetra-Model-Zoo — Models optimized for export to run on device.
- transcription-diff — Speech to transcription comparison
- uniteai — AI, Inside your Editor.
- vall-e-x — An open source implementation of Microsoft's VALL-E X zero-shot TTS
- verbatim — high quality multi-lingual speech to text
- video2sub — Transcribes video/audio/url to subtitles.
- VocalForge — Your one-stop solution for voice dataset creation
- whisper-clipboard — A basic TUI for transcribing audio to your clipboard using OpenAI's whisper models.
- whisper-dictation — no summary
- whisper-live — A nearly-live implementation of OpenAI's Whisper.
- whisper-mic — Whisper for your microphone
- whisper-pyannote-fusion — Fuse whisper and pyannote results
- whisper-s2t — An Optimized Speech-to-Text Pipeline for the Whisper Model.
- whisper-timestamped — Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
- whisper-voice-commands — Execute scripts with Whisper for your microphone
- whisper2subs — Transcribes audio using Whisper and translates it using DeepL.
- whisperer-ml — Go from raw audio to a text-audio dataset with OpenAI's Whisper
- whisperspeech — An Open Source text-to-speech system built by inverting Whisper
- yena — My yena Python package
- youtube2srt — Generates high-quality subtitles in SRT format for YouTube videos using openai-whisper by processing the audio content. Provides an option to translate the generated subtitles into English.
- yt-audio-collector — Create hindi language dataset for Speech Recognition from youtube
1