Skip to main content

OpenAI Whisper Transcription

You can transcribe content on the supercomputers using OpenAI’s “Whisper” Speech-to-Text model.

Below is an example transcribing a sample wav file with GPU acceleration (required for this mamba env):

$ salloc -G 1 -p htc --mem=20G -t 240
$ module load mamba/latest
$ source activate openai-whisper
$ wget https://voiceage.com/wbsamples/in_mono/Conference.wav
$ whisper Conference.wav
[00:00.000 --> 00:01.000] This is Peter.
[00:01.000 --> 00:02.000] This is Johnny.
[00:02.000 --> 00:03.000] Kenny.
[00:03.000 --> 00:04.000] Good job.
[00:04.000 --> 00:04.880] We just wanted to take a minute to thank you.