Insanely Fast Whisper - Transcribe 300 minutes (5 hours) of audio in less than 98 seconds (github.com) en
Whisper Large-v3 Release (github.com) en
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification....
The weights for Show-1 have been released (github.com) en
Abstract:...