m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Pythonspeechspeech-recognitionspeech-to-textwhisperasr
This is stars and forks stats for /m-bain/whisperX repository. As of 05 Mar, 2024 this repository has 5685 stars and 495 forks.

WhisperX This repository provides fast automatic speech recognition (70x realtime with large-v2) with word-level timestamps and speaker diarization. ⚡ī¸ Batched inference for 70x realtime transcription using whisper large-v2 đŸĒļ faster-whisper backend, requires <8GB gpu memory for large-v2 with beam_size=5 đŸŽ¯ Accurate word-level timestamps using wav2vec2 alignment đŸ‘¯â€â™‚ī¸ Multispeaker ASR using speaker diarization from pyannote-audio...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
christianhaitian/arkosShellErlangPython890+1700
imihajlow/ccpuAssemblyPythonVerilog59020
aixed/WeChat-HookCJavaScriptC++83402830
streetpea/chiaki4deckCC++Kotlin3100180
Noble-Mushtak/Advent-of-CodeRustHaskellScala4000
wechaty/python-wechaty-getting-startedMakefilePython1580520
GoogleCloudPlatform/kubernetes-engine-samplesHCLPythonShell1.1k01.1k0
fgui/trytond-hyton_holidayHyPython0000
DerpFest-AOSP/vendor_derpMakefileShellPython30520
adobe-research/custom-diffusionPythonShell1.6k01110