Instantcasts -- podcast transcripts and summaries in seconds

Learn how to process an hour‑long podcast in under ten seconds using optimized Whisper inference, covering model tweaks, audio storage, chunking, prompting, and diarization.

Overview

I’ve been working on a fun side project around fast Whisper inference that takes the URL to a podcast and, in <10 seconds for an hour-long show, generates a transcript and summary. The actual application is super basic, but it showcases some advanced stuff around optimizing Whisper inference at both a model level and an infra level (e.g. where does the podcast audio file live? it matters!)

Tech stack