Why Video Podcasting Has Basically Taken Over (and What It Means for Audio-Only Shows)

Not that long ago, podcasting was a purely audio thing. You'd record into a mic, export an MP3, upload it somewhere, and that was that. The idea of anyone watching a podcast was a bit of a joke — like watching someone listen to the radio. Then YouTube happened to podcasting, and now the whole industry is dealing with a shift it didn't fully see coming.

Here's where things stand: as of late 2025, 53% of new weekly podcast listeners in the US say they prefer watching a podcast over just listening to one. That number was 30% back in April 2022. In roughly three years, video went from a nice-to-have to the default expectation for a huge chunk of the audience. And if you look at platform rankings, YouTube is now the number-one podcast destination in the country, ahead of Spotify and Apple Podcasts — which is wild when you think about the fact that YouTube wasn't even really considered a "podcast platform" five years ago.

So what actually drove this? A few things collided at once.

First, the Joe Rogan effect isn't going away. When millions of people watched Rogan and his guests across a three-hour table for years, it normalized the visual format. The audience learned to read body language, facial reactions, and the chemistry between people in a way that audio alone couldn't deliver. That expectation got baked into the format.

Second, content creators who were already building on YouTube started doing interview-style shows and discovered it worked really well there. People like Andrew Huberman, Lex Fridman, and the guys behind My First Million built massive audiences partly because YouTube's recommendation engine treated their content like any other video — meaning it pushed it to people who'd never heard of them before. That kind of discoverability just doesn't exist on Apple Podcasts or Spotify the same way.

Third, and maybe most practically: short-form clips. When you record a video podcast, you can clip out five, ten, fifteen moments per episode and post them to Instagram Reels, YouTube Shorts, TikTok, LinkedIn. If you're audio-only, you can do audiograms — those static image things with a waveform bouncing — but they perform nowhere near as well as an actual video clip. So video podcasters suddenly had a social media engine built into their production process, while audio-only shows were working twice as hard for half the reach.

That said, declaring audio dead is way overstating it. There are still massive audio-first podcasts doing extremely well, and the listening context matters a huge amount. People at the gym, in the car, on a walk — they're not watching anything. Audio consumption is still growing in absolute terms, even if video is growing faster. The global podcast audience hit around 619 million listeners in 2026, up from where it was the year before, and most of those people aren't sitting at a desk staring at a screen while they listen.

What the shift really means is that the production decision has gotten more loaded. Starting a new show today, you basically have to ask yourself whether you're building for ears or eyes (or both), because those are genuinely different choices that affect your studio setup, your editing workflow, your thumbnail strategy, your clip approach, and where you're going to be posting.

The shows that are adapting the best tend to do one of a few things. Some go all-in on video-first and treat the audio as almost a byproduct — they optimize for YouTube pacing, they invest in good lighting, they obsess over thumbnails. Others keep audio at the center but add a "watch" version with a static or lightly-animated frame, mostly to capture that discovery channel on YouTube without rebuilding their whole workflow. And a smaller group is doing genuine multi-platform distribution where they're creating platform-specific versions of the same content.

One thing that's become increasingly clear: video podcasting is up to 50% more engaging than audio-only uploads on YouTube. That's a significant engagement difference, and it feeds directly into how the algorithm treats your content. A video podcast that holds attention well gets recommended more, which builds the audience faster, which gives you better data on what's working, which makes the show better. It compounds.

For audio-only diehards, the counterargument usually goes like this: not everyone can or wants to be on camera. There are plenty of podcasters who are genuinely better communicators when they're not thinking about how they look. The camera adds anxiety for some people. And there are formats — long, dense, research-heavy episodes — where the visual element adds almost nothing.

Fair points, all of them. But the discoverability gap is getting harder to ignore. If YouTube is pushing podcast content into feeds and your show isn't there at all, you're ceding an entire discovery channel. That doesn't automatically kill an audio-only show, but it does mean your growth strategy has to work harder everywhere else.

The honest answer for most people starting out in 2026 is: at least record video, even if you're not fully committing to a YouTube strategy yet. You can always use it later. You can't go back and add cameras to episodes you already recorded. And the cost of adding video to a professional setup is often much lower than people assume — especially when you're already booking a proper studio with lighting and multi-camera rigs built in.

The medium is still podcasting. The screen is just optional — but it's an option that's getting harder to leave on the table.

Previous
Previous

The Rise of the Corporate Podcast and What It's Quietly Changing About Marketing

Next
Next

Solo vs. Co-Host vs. Interview: What the Data Actually Says About Format