Point Whisper at any audio file. The first run downloads the model (142MB for base, 1.5GB for turbo). Start with base to verify everything works, then switch to turbo for real use.
$ whisper meeting.mp3 --model base
Useful flags:
$ whisper meeting.mp3 --model base --language en # skip auto-detect, faster
$ whisper meeting.mp3 --model base --task translate # any language → English
$ whisper meeting.mp3 --model base --output_format txt # text only, no subtitles
What you should see
100%|████████████| 142M/142M [00:08<00:00]
[00:00.000 --> 00:05.200] The meeting started at nine.
[00:05.200 --> 00:11.400] Action items for this week...
✓ done.
CheckpointDid Whisper print timestamped lines to the terminal?