A transcribed meeting has a text transcript produced from the recording. The spoken content is now machine-readable and searchable, but the transcript is raw — speaker labels may be wrong, filler words clutter the text, and formatting is whatever the transcription tool produced.
Transcription is the step that unlocks meeting content from audio/video into text. The quality varies by tool: Whisper produces bare text with timestamps, Fathom adds speaker labels and AI summaries, Otter provides real-time transcription with varying accuracy. Regardless of source, the raw transcript needs editorial attention before it can serve as a reliable reference.
Transcribed → Cleaned: When the transcript has been processed through cleanup — speaker names normalized, filler words calibrated (removed or reduced to natural rhythm), formatting standardized. The /transcript-cleanup skill handles this step.
Processing track design from [[Linear Processing Stages for Meetings]]↑. Track placement within the three-track status system from [[Status Lifecycle Tracks]]↑.