Why Video Podcasting Has Basically Taken Over (and What It Means for Audio-Only Shows)

There's a moment in most industries where a technology shift stops being a trend and becomes the baseline — the new floor that everyone's working from, whether they acknowledge it or not. That moment happened in podcasting sometime around 2023 or 2024, and it had to do with video. The podcast world went from being almost entirely an audio medium to being something more complicated, more visual, and in many ways more competitive than most podcasters were prepared for. And if you're still treating your podcast like it's 2017 — record some audio, drop it in a feed, move on — you're operating on a set of assumptions that the industry has genuinely moved past.

Let's start with the actual numbers because they're hard to argue with. YouTube is now the number-one podcast platform in the United States. Full stop. It holds about 33% market share among weekly podcast listeners — well ahead of Spotify at 26% and Apple Podcasts at 14%. Those three platforms together account for about 64% of all US podcast listening. YouTube reported over one billion monthly podcast viewers in early 2025. One billion people. That's not a niche behaviour or an edge case. That's a mainstream shift in how people consume content that was once almost exclusively audio.

And the preference data is even more striking when you break it down by age. Among listeners under 35, the preference for watching rather than just listening to podcasts skews dramatically toward video. Roughly 53% of new weekly podcast listeners in the US say they prefer watching rather than hearing a podcast as of late 2025. That number was just 30% in April 2022. Three years. The shift happened in three years. And it happened because several large forces converged at roughly the same moment.

The Joe Rogan Effect (But Not in the Way You Think)

Everyone credits Joe Rogan with the video podcast revolution, and there's some truth to it, but the causal story is more interesting than "Rogan did video so everyone else did too." Rogan's show normalized a specific visual grammar — two or more people seated around a table in a long, unfiltered conversation, shot with multiple cameras, published in full on YouTube. The visual experience wasn't cinematic or produced in any Hollywood sense. It was simple, almost documentary-like. But what it communicated was realness. Viewers could watch how the host and guest related to each other, read body language, see reactions that audio alone couldn't carry. That grammar became the template.

What Rogan demonstrated wasn't just that video podcasting was possible — it was that video added something genuinely valuable to long-form conversation content. A three-hour conversation on audio requires enormous trust in the speakers' ability to hold attention without visual anchoring. The same conversation on video can be consumed in a completely different cognitive mode. You can look away and come back. You can catch a gesture or an expression that changes the meaning of what was just said. The video layer creates redundancy that makes the content easier to absorb over long durations.

Other major shows followed — Lex Fridman's interviews, My First Million, Huberman Lab, Tim Ferriss, the call her daddy era, all the major business and culture shows. But what really changed the industry wasn't the biggest shows going video. It was the medium-sized and independent shows going video, discovering that YouTube's recommendation engine treated their content like any other video on the platform — which meant it could be pushed to people who had never heard of the show before.

Why YouTube Changes the Discovery Math Entirely

This is the most important practical reason for the shift, and it's the one that gets talked about least in terms of its specific implications. Audio podcast platforms — Apple Podcasts, Spotify, even Pocket Casts — are subscription-first environments. Their primary function is to surface content to people who have already opted in or who are actively searching. Discovery through algorithm is limited. Spotify has improved this with curated playlists and podcast recommendations, but neither Spotify nor Apple can do what YouTube does: show your content to someone who has never heard of you based purely on behavioral signals that suggest they might like it.

YouTube's recommendation engine is one of the most powerful content distribution mechanisms ever built. It watches what people watch, for how long, what they watch next, what they search for, what they skip — and it uses all of that to decide what to put in front of people next. When a video podcast episode gets recommended to someone who has never heard of the show, that person becomes a potential listener. When a short clip from that episode gets posted to YouTube Shorts and it performs well, it reaches a completely different population of people who are being pushed content, not seeking it.

Audio podcast platforms cannot offer this. Getting discovered on Apple Podcasts is hard. The search is functional but not powerful. The "Top Charts" are dominated by massive established shows. Algorithmic recommendation outside of search is minimal. If you're building an audience on audio platforms alone, your growth depends almost entirely on word of mouth, cross-promotion, and PR. All of those things are valuable and necessary — but they have a ceiling that YouTube's algorithm doesn't impose in the same way.

This explains why shows that publish on YouTube consistently report discovering that a meaningful percentage of their new audience finds them there rather than through audio platforms. The video layer isn't just adding a visual dimension — it's opening a new discovery channel that reaches people who would otherwise never encounter the show.

The Short-Form Clip Machine

The other major driver of the video podcast shift is something so practical it might seem almost too obvious: short-form clips. When you record a video podcast, every interesting moment — the sharp insight, the funny exchange, the surprising counterargument, the emotional reveal — is automatically a potential 60-90 second vertical video clip for Instagram Reels, TikTok, YouTube Shorts, and LinkedIn. You didn't have to do anything extra to create that content. It already exists in the footage you shot.

For audio-only podcasters, the equivalent of clips are audiograms — static images with an animated waveform and captioned text. They're fine. They do the job of communicating that there's audio content attached. But they perform nowhere near as well as actual video clips in terms of reach and engagement. When platforms like TikTok, Instagram, and YouTube Shorts are all optimization for video content, an audiogram competes at a significant disadvantage against actual video on those platforms.

The math of this is significant. A video podcast that generates 10 clips per episode has 10 chances at algorithmic discovery per week on short-form platforms, entirely separate from the main episode's performance. Some clips will land with modest numbers. Occasionally one will break out and introduce the show to tens of thousands of people who had no previous awareness of it. The cumulative effect of this, compounded over dozens or hundreds of episodes, creates a discovery asset that audio-only shows simply can't replicate through equivalent effort.

What About Audio-Only? Is It Actually Dead?

No — and this is worth saying clearly because the video evangelists can get pretty absolutist about it. Audio-only podcasting is not dead. The global podcast audience hit around 619 million listeners in 2026 with continued year-over-year growth. Listening during commutes, exercise, cooking, and other "eyes-occupied" activities isn't going anywhere, because those activities still happen and people still want audio company during them.

The real change is that the question "should I add video?" is no longer an optional consideration. It's a strategic calculation with real consequences. The shows that aren't on YouTube aren't dead, but they're working without one of the most powerful discovery tools available in the medium. Their growth has to come from other sources — and those other sources are all harder to scale.

There's also a budget and capacity reality that matters. Not every podcaster has access to a multi-camera studio setup with professional lighting. For someone recording in a home office with a single USB mic and no video gear, adding video is a meaningful operational jump, not just a toggle to flip. The honest answer for those creators is that audio-first with clips wherever possible is a reasonable starting point — and that upgrading to a proper video setup when the show has enough traction to justify the investment is a real path.

But the window for launching an audio-only show and expecting meaningful organic growth without a video presence is narrowing. The shows that are building audiences fastest right now are almost universally multi-platform with a strong YouTube component. That's not a coincidence.

What Video Podcasting Actually Changes in Practice

For podcasters who make the video transition, the production implications are more significant than they might initially appear. It's not just "add cameras." A video podcast that's going to live on YouTube needs to be designed for a viewer, not just a listener — which means lighting, set design, camera angles, host positioning, and visual composition all become things you have to make decisions about. The audio-for-audio workflow and the audio-for-video workflow look completely different.

Good video podcast lighting means the subject is well-lit without harsh shadows, the background is intentional rather than accidental, and the image has a visual quality that communicates professionalism without needing to be cinematic. A three-point lighting setup, a well-dressed background with some depth, and cameras positioned at eye level — these things matter because they determine whether a viewer's visual experience of the episode is pleasant and trustworthy or distracting and low-effort.

Multi-camera shooting matters because it creates visual interest for the full episode and gives the editor something to work with. Cutting between a wide shot and close-ups, reacting shots and talking shots, keeps the visual experience from being static in a way that actively drives viewer fatigue over a 60-minute episode. A single locked-off camera for a two-hour conversation asks the viewer to watch essentially a static image for two hours. Some shows get away with this because the content is compelling enough — but it's a tax on the viewer's attention that doesn't need to be paid.

The editing workflow for video podcasts is also more demanding than audio editing. Syncing multiple camera angles, color grading for consistency, generating captions, cutting clips for social — all of this takes more time than a clean audio edit. Most podcasters who go video end up either investing in an editing team or working with a studio that handles production, because the operational load of doing it yourself is substantial.

The Platform Landscape Is Shifting to Meet This Reality

One of the clearest signals that video podcasting isn't a trend but a structural shift is what the major platforms have done in response. Apple Podcasts, which resisted video for years, began supporting video episodes in early 2026. Spotify continued investing heavily in its video podcast infrastructure. YouTube launched dedicated podcast features — a podcast section in the app, podcast-specific playlists, host attribution tools — that signaled clearly how seriously the platform takes this category.

The global podcast advertising market is projected to hit around $5 billion in 2026, up nearly 20% year-over-year. A meaningful share of that is video podcast advertising. Advertisers who follow audience attention are increasingly willing to pay for video podcast placements because the context — lean-forward, engaged, high-trust — is valuable for brand building in ways that programmatic digital ads aren't.

The direction of travel is clear enough that it's worth orienting around it explicitly: the future of podcasting is multi-platform, video-included, clip-driven, and YouTube-aware. That doesn't mean audio is an afterthought, but it does mean that audio-only strategies need to be very deliberate about the growth mechanisms they're relying on, since they're working without the most powerful one in the current ecosystem.

Making the Practical Decision

If you're starting a podcast today, the question isn't whether to do video — it's what level of video investment makes sense for your current stage. Recording in a professional studio that handles multi-camera video, lighting, and audio simultaneously is the highest-quality, most efficient path. You show up, the gear is set up, you record, everything is captured. You leave with raw footage that can be edited into a full episode, a YouTube upload, clips for every platform, and a full visual record of the conversation.

For established audio-only shows considering the switch, the transition is operationally significant but manageable with the right support. The audience you've already built is an asset — they'll follow you to video if you communicate the shift clearly and maintain the content quality they came for. The new audience you'll find through YouTube is genuinely incremental.

For everyone in the middle — thinking about starting something, evaluating whether to upgrade an existing show — the data points in one direction. Approximately 50.6% of podcast shows now post full video content on YouTube as of 2026, up 130% from 2022 levels. The shows that haven't made this move are in a shrinking minority, and the gap in discovery capacity between video and audio-only shows is getting wider, not narrower.

The audio medium isn't going away. But the most strategic move in podcasting today is treating audio as part of a broader content system, not the whole thing.

Previous
Previous

The Psychology Behind Why Some Podcasts Feel Impossible to Stop Listening To

Next
Next

The Real Reason Most Podcasts Die Before Episode 10