This tutorial is written with macOS in mind. The same should also be possible to accomplish on Windows, but the exact steps will be different.

1. Downloading the audio tracks

2. Converting audio to text with whisper.cpp

Whisper is an automatic speech recognition (ASR) model created by OpenAI. We can install an optimized open-source version on our computers:

Alternative