This Week's Sponsor:

SoundSource

New Year, New Audio Setup: SoundSource 6 from Rogue Amoeba


Posts tagged with "podcasts"

How I Used Claude to Build a Transcription Bot that Learns From Its Mistakes

Step 1: Transcribe with parakeet-mlx.

Step 1: Transcribe with parakeet-mlx.

[Update: Due to the way parakeet-mlx handles transcript timeline synchronization, which can result in caption timing issues, this workflow has been reverted to use the Apple Speech framework. Otherwise, the workflow remains the same as described below.]

When I started transcribing AppStories and MacStories Unwind three years ago, I had wanted to do so for years, but the tools at the time were either too inaccurate or too expensive. That turned a corner with OpenAI’s Whisper, an open-source speech-to-text model that blew away other readily available options.

Still, the results weren’t good enough to publish those transcripts anywhere. Instead, I kept them as text-searchable archives to make it easier to find and link to old episodes.

Since then, a cottage industry of apps has arisen around Whisper transcription. Some of those tools do a very good job with what is now an aging model, but I have never been satisfied with their accuracy or speed. However, when we began publishing our podcasts as videos, I knew it was finally time to start generating transcripts because as inaccurate as Whisper is, YouTube’s automatically generated transcripts are far worse.

VidCap in action.

VidCap in action.

My first stab at video transcription was to use apps like VidCap and MacWhisper. After a transcript was generated, I’d run it through MassReplaceIt, a Mac app that lets you create and apply a huge dictionary of spelling corrections using a bulk find-and-replace operation. As I found errors in AI transcriptions by manually skimming them, I’d add those corrections to my dictionary. As a result, the transcriptions improved over time, but it was a cumbersome process that relied on me spotting errors, and I didn’t have time to do more than scan through each transcript quickly.

That’s why I was so enthusiastic about the speech APIs that Apple introduced last year at WWDC. The accuracy wasn’t any better than Whisper, and in some circumstances it was worse, but it was fast, which I appreciate given the many steps needed to get a YouTube video published.

The process was sped up considerably when Claude Skills were released. A skill can combine a script with instructions to create a hybrid automation with both the deterministic outcome of scripting and the fuzzy analysis of LLMs.

Transcribing with yap.

Transcribing with yap.

I’d run yap, a command line tool that I used to transcribe videos with Apple’s speech-to-text framework. Next, I’d open the Claude app, attach the resulting transcript, and run a skill that would run the script, replacing known spelling errors. Then, Claude would analyze the text against its knowledge base, looking for other likely misspellings. When it found one, Claude would reply with some textual context, asking if the proposed change should be made. After I responded, Claude would further improve my transcript, and I’d tell Claude which of its suggestions to add to the script’s dictionary, helping improve the results a little each time I used the skill.

Over the holidays, I refined my skill further and moved it from the Claude app to the Terminal. The first change was to move to parakeet-mlx, an Apple silicon-optimized version of NVIDIA’s Parakeet model that was released last summer. Parakeet isn’t as fast as Apple’s speech APIs, but it’s more accurate, and crucially, its mistakes are closer to the right answers phonetically than the ones made by Apple’s tools. Consequently, Claude is more likely to find mistakes that aren’t in my dictionary of misspellings in its final review.

Managing the built-in corrections dictionary.

Managing the built-in corrections dictionary.

With Claude Opus 4.5’s assistance, I rebuilt the Python script at the heart of my Claude skill to run videos through parakeet-mlx, saving the results as either a .srt or .txt file (or both) in the same location as the original file but prepended with “CLEANED TRANSCRIPT.” Because Claude Code can run scripts and access local files from Terminal, the transition to the final fuzzy pass for errors is seamless. Claude asks permission to access the cleaned transcript file that the script creates and then generates a report with suggested changes.

A list of obscure words Claude suggested changing. Every one was correct.

A list of obscure words Claude suggested changing. Every one was correct.

The last step is for me to confirm which suggested changes should be made and which should be added to the dictionary of corrections. The whole process takes just a couple of minutes, and it’s worth the effort. For the last episode of AppStories, the script found and corrected 27 errors, many of which were misspellings of our names, our podcasts, and MacStories. The final pass by Claude managed to catch seven more issues, including everything from a misspelling of the band name Deftones to Susvara, a model of headphones, and Bazzite, an open-source SteamOS project. Those are far from everyday words, but now, their misspellings are not only fixed in the latest episode of AppStories, they’re in the dictionary where those words will always be corrected whether Claude’s analysis catches them or not.

Claude even figured out "goti" was a reference to GOTY (Game of the Year).

Claude even figured out “goti” was a reference to GOTY (Game of the Year).

I’ve used this same pattern over and over again. I have Claude build me a reliable, deterministic script that helps me work more efficiently; then, I layer in a bit of generative analysis to improve the script in ways that would be impossible or incredibly complex to code deterministically. Here, that generative “extra” looks for spelling errors. Elsewhere, I use it to do things like rank items in a database based on a natural language prompt. It’s an additional pass that elevates the performance of the workflow beyond what was possible when I was using a find-and-replace app and later a simple dictionary check that I manually added items to. The idea behind my transcription cleanup workflow has been the same since the beginning, but boy, have the tools improved the results since I first used Whisper three years ago.


MacStories Setups Update: An Apple Gear Refresh, Dual-Screen Gaming, and HomeKit

Our desk setups. Federico (left) and John (right).

Our desk setups. Federico (left) and John (right).

As we head into the final weeks of 2025, Federico and I figured it would be a good time to update the MacStories Setups page. There’s an ebb and flow to the gear and apps we test each year, and as the fall OS update season fades into the past, it’s not unusual for one or both of us to take stock of our setup and make changes. That’s been very true for both of us this year, but in different ways.

Federico has been focused on simplifying his hardware setup and testing a long list of apps and services. In contrast, I’ve made fewer gear cuts, focusing more on strategic changes to the gadgets I use and settling on a core set of work apps.

The result is that Federico’s hardware setup changes have primarily been updates to his Apple and portable gaming gear. He made the transition from the iPhone 16 Pro Max to the iPhone Air, and couldn’t be happier with the result. He also replaced the M4 iPad Pro with the latest M5 model and moved from the AirPods 4 to the AirPods Pro 3.

Both of us ditched our previous Apple Vision Pro head strap solutions for the Apple Dual Knit Band, which has been a big upgrade. It’s comfortable, and having one dial to adjust both bands is both clever and far simpler than other solutions I’ve tried.

Ayn Thor.

Ayn Thor.

Federico also added the Ayn Thor to his handheld gaming lineup. The Thor, which I also bought this fall, is a dual-screen OLED gaming handheld that runs Android. It’s perfect for emulating dual-screen systems like the Nintendo DS and 3DS, but it has also been excellent for game streaming and testing the emerging world of emulating SteamOS on Android. If game tinkering is your thing and this sounds intriguing, we have two episodes of NPC: Next Portable Console that go in-depth on the Ayn Thor.

Read more


Podcast Rewind: Cozy Sports, Wallpaper Wars, Southern Cooking, and Bad Bunny

Enjoy the latest episodes from MacStories’ family of podcasts:

Comfort Zone

Niléane gets sporty and frosty, Chris is drowning in new Apple products, and Matt oversees the battle for the nicest iPhone wallpaper (where everyone wins).


MacStories Unwind

This week, Federico tries to recover from a rough week, John brings some Southern cooking to the show, Federico has a long backlog of videogames to attend to, and John has an Apple TV+ show and an album to share.


Magic Rays of Light

Sigmund is joined by 9to5Mac editor-in-chief Chance Miller to highlight Paul Greengrass’s emotional California fire triumph-over-disaster movie, Apple Music’s next Super Bowl half-time show headliner, and the indefinite postponement of The Savant.

Read more


Podcast Rewind: From AirPods Pro 3 to AYANEO’s Budget Handheld and Pok Pok’s Backstory

Enjoy the latest episodes from MacStories’ family of podcasts:

AppStories

This week, Federico and John follow up after a week with new Apple hardware and dig into watchOS and visionOS 26.

On AppStories+, John is mixing up his link and data organization systems - again.

This episode is sponsored by:

  • Claude – Get 50% off Claude Pro, including access to Claude Code.

NPC: Next Portable Console

This week, AYANEO introduces it’s most affordable handheld yet, but it’s not part of its budget line of devices. Plus, Qualcomm’s new chips pack a punch and the GDP Win 5 pack a punch, while Sony expands its Pulse line with desktop speakers.

This week on NPC XL, Federico’s disenchantment with the Sony PS5 kicks off a conversation about what’s been a very weird console generation.


First, Last, Everything

This week, Jonathan is joined by Esther Huybreghts. Esther is the Co-founder of the multi-award-winning app, PokPok. Since the app launched in 2021, it’s won an App Store award, an Apple Design award, and many other commendations. Pok Pok was also recently included on TIME’s list of the 100 Most Influential Companies of 2025. Pok Pok is a really special app, and Esther talked about its inception, plans for the future, and staying strong against negative – and sometimes hateful – feedback.

Read more


Introducing First, Last, Everything and Cozy Zone

The latest additions to the MacStories podcast family.

The latest additions to the MacStories podcast family.

Tomorrow is Apple’s big day, but today is ours. We’re pleased to introduce you to a brand new podcast, and more of one of your favorite shows.

First, Last, Everything

First, Last, Everything is a brand new MacStories podcast that explores people’s relationships with the technology that has shaped their lives. Each week, Jonathan Reed – who is a regular contributor to MacStories Weekly, our Club community manager on Discord, and the author of our annual watchOS reviews and other stories – gets to know a guest through three pieces of tech:

  • the one that first inspired them,
  • their latest obsession, and
  • the thing that’s meant everything to them.

From apps and computers to weird gadgets and more, Jonathan and listeners get to know someone from the unique perspective of the technology they love. Plus, in each episode, Jonathan shares an aside about a bit of technology you may have never heard of during the ‘Something’ segment.

Jonathan is a great interviewer and has a fantastic slate of guests lined up for season one of First, Last, Everything. You’ll hear new perspectives from familiar voices as well as brand new ones. Gadgets are fun, but it’s the people who use them that bring them to life through their stories. If you’re curious about technology, how others use it, and how new perspectives could inform your tech use, First, Last, Everything is your kind of show.

First, Last, Everything is a seasonal podcast, with new episodes releasing every Tuesday throughout the season. Season one includes eight episodes that will take the show well into the fall. Then, after a brief break, the podcast will be back with a new slate of interviews.

As someone who cares a lot about the human side of technology, I’m absolutely in love with this show. Federico and I dabbled with a similar concept years ago, so I’m really happy to see Jonathan putting his own spin on the idea and bringing together such a great lineup of guests that I’d love to spoil for everyone but will keep as a surprise.

You can listen to the first episode with YouTuber Tom Hitchins, whom, coincidentally, Federico and I hung out with a lot at WWDC this year, today. You can find Tom on his YouTube channel, Byte Review, or his website, Kiroku.co.uk, where he’s making calm, aesthetic videos about Apple products, creating wallpapers and Lightroom presets, and selling stickers and other great products. I think you’ll love the interview.

As we’re fond of saying, you can get First, Last, Everything wherever you get your podcasts. The show is still rolling out across the Internet, so it may show up in some places before others, but you can always visit macstories.net/podcasts/first-last-everything for links to the show on Apple Podcasts, Overcast, Pocket Casts, Spotify, and good, old-fashioned RSS. Plus, you can follow the show on Mastodon and Bluesky.

Cozy Zone

We’re equally excited to bring you Cozy Zone, a members-only extension of Comfort Zone hosted by Matt Birchler, Niléane, and Chris Lawley. It was a no-brainer to add Comfort Zone to MacStories, and the reaction from listeners has been beyond our expectations. It’s the perfect blend of information and entertainment sprinkled with the hosts’ weird obsessions.

In each weekly bonus episode of Comfort Zone, Matt, Niléane, and Chris invite listeners to join them in the Cozy Zone, where they’ll cover extra topics, invent wilder challenges and games, and share all their great (and not-so-great) takes on tech. The show is an excellent way to get to know the hosts better and participate in the fun you already enjoy on Comfort Zone. Here’s the trailer.

The first episode of Cozy Zone is being released publicly so everyone can get a taste of it right now. You’ll find it today in the Comfort Zone feed and on YouTube. Going forward, Cozy Zone will be audio-only for now, but if we hear from enough listeners who want it, we’ll work on making video versions a regular option, too. The first episode is available to watch here:

Episode 1 of Cozy Zone is available in the Comfort Zone audio-only podcast feeds too.

You can get cozy with the Comfort Zone crew for just $5/month or $50/year, which not only makes the bonus episodes possible, but supports Comfort Zone, too. The gang has already recorded the first few episodes, and they’re great. We think you’ll love them as well. To keep up with Cozy Zone, be sure to follow the existing Comfort Zone accounts on Mastodon and Bluesky.

Finally, thanks to everyone who reads MacStories and listens to our shows. It means a lot to us. We have no shortage of ideas of ways we can make MacStories even better than it was yesterday or is today, but without you all reading, listening, and spreading the word about what we do to your friends and family, it wouldn’t be possible. The MacStories community is strong and the kind of foundation that has allowed us to weather fundamental shifts in the online media world. Thank you for all you’ve done and continue to do for MacStories.


Relay Launches Its 2025 Fundraiser for St. Jude Children’s Research Hospital

Today, Relay kicked off its annual fundraiser for St. Jude Children’s Research Hospital to help combat childhood cancer. We’d love it if MacStories readers joined us in supporting this cause.

You can make a donation by visiting here.

As Stephen Hackett explains on 512 Pixels:

[St. Jude] is leading the way the world understands, treats and defeats childhood cancer, and other life-threatening diseases. If that sounds expensive, it’s because it is, but the bill isn’t passed to the families of patients. At St. Jude, no child is denied treatment based on race, religion or a family’s ability to pay.

Relay, which has raised over $4 million for St. Jude over the years, will be raising money through the end of September. Thanks in advance for checking out Relay’s fundraiser for St. Jude and to all who donate.

Permalink

Initial Notes on iPadOS 26’s Local Capture Mode

Now this is what I call follow-up: six years after I linked to Jason Snell’s first experiments with podcasting on the iPad Pro (which later became part of a chapter of my Beyond the Tablet story from 2019), I get to link to Snell’s first impressions of iPadOS 26’s brand new local capture mode, which lets iPad users record their own audio and video during a call.

First, some context:

To ensure that the very best audio and video is used in the final product, we tend to use a technique called a “multi-ender.” In addition to the lower-quality call that’s going on, we all record ourselves on our local device at full quality, and upload those files when we’re done. The result is a final product that isn’t plagued by the dropouts and other quirks of the call itself. I’ve had podcasts where one of my panelists was connected to us via a plain old phone line—but they recorded themselves locally and the finished product sounded completely pristine.

This is how I’ve been recording podcasts since 2013. We used to be on a call on Skype and record audio with QuickTime; now we use Zoom, Audio Hijack, and OBS for video, but the concept is the same. Here’s Snell on how the new iPadOS feature, which lives in Control Center, works:

The file it saves is marked as an mp4 file, but it’s really a container featuring two separate content streams: full-quality video saved in HEVC (H.265) format, and lossless audio in the FLAC compression format. Regardless, I haven’t run into a single format conversion issue. My audio-sync automations on my Mac accept the file just fine, and Ferrite had no problem importing it, either. (The only quirk was that it captured audio at a 48KHz sample rate and I generally work at 24-bit, 44.1KHz. I have no idea if that’s because of my microphone or because of the iPad, but it doesn’t really matter since converting sample rates and dithering bit depths is easy.)

I tested this today with a FaceTime call. Everything worked as advertised, and the call’s MP4 file was successfully saved in my Downloads folder in iCloud Drive (I wish there was a way to change this). I was initially confused by the fact that recording automatically begins as soon as a call starts: if you press the Local Capture button in Control Center before getting on a call, as soon as it connects, you’ll be recording. It’s kind of an odd choice to make this feature just a…Control Center toggle, but I’ll take it! My MixPre-3 II audio interface and microphone worked right away, and I think there’s a very good chance I’ll be able to record AppStories and my other shows from my iPad Pro – with no more workarounds – this summer.

Permalink

Video Production, Automation Hardware, Gaming, and More: My Desk Setup Update

Earlier today, we published an episode of AppStories covering the latest changes that Federico and I have made to our desk setups and other gear. I last updated my desk setup in July, before we started recording a video version of NPC: Next Portable Console. Since then, I’ve also started publishing bonus NPC videos and hardware reviews on the MacStories YouTube channel, which has driven the bulk of the changes in my setup.

On the hardware side, the changes include the addition of Elgato mounts, namely the Elgato Flex Arm and Solid Arm, along with a second Master Mount and Mini Key Light. I also purchased a Moment Multi Threaded Mount that lets me connect my iPhone 16 Pro Max to the Elgato Flex Arm using a strong MagSafe magnet for filming top-down shots. And I switched my podcast recording headphones to the Moondrop ARIA 2 In-Ear Headphones because the Shure model I used before didn’t stay in my ears.

Making the most of a small space..

Making the most of a small space..

The rest of my computing setup hasn’t changed much. I removed my M2 MacBook Air because the 11” M4 iPad Pro has replaced it for most of my lightweight portable needs. I’ve also added the Logitech MX Creative Console and Logitech POP Icon Keys keyboard. I’ve grown to like the Creative Console more than the Loupedeck Live for push-button automation, but for now, I still use both. I’m not entirely sure if I’ll stick with the POP Icon Keys keyboard because, as much as I like the automation that’s been baked into it, I prefer the feel of the Magic Keyboard, but I haven’t shelved it yet. I’ve also abandoned the Belkin BoostCharge Pro Wireless Charger because of a recent recall. Instead, I’ve switched to a Native Union Voyage Magnetic Wireless Charger that I’ve been testing and will review soon.

On the videogame side of things, I’ve gravitated to Anbernic devices, adding the Anbernic RG40XXV and Anbernic RG28XX to my setup. Along with the Anbernic RG35XXSP that was already on my list, I now have a nice range of retro handheld sizes and form factors. I have others as well, but these are my current go-to devices. I also added the Nintendo Switch Lite (Hyrule Edition) because I’m a sucker for anything Zelda and I wanted a smaller Switch for traveling.

The Mobapad M6 HD Joycons with my OLED Switch and the Switch Lite Hyrule Edition.

The Mobapad M6 HD Joycons with my OLED Switch and the Switch Lite Hyrule Edition.

I’ve tried to be good and stop buying controllers, but there have been some great ones released lately, so my list now includes the Mobapad M6 HD Joycons for my OLED Switch and the GameSir G8+. The G8+ is a lot like the GameSir G8 Galileo that it replaced except that it has Bluetooth and works with the iPhone, iPad mini, and Nintendo Switch; I love it.

Finally, I’ve made several changes to my app lineup. There are more Apple system apps on the list, which is usually the case after a long summer of testing macOS betas. Plus, there are several video recording and editing apps on the list now, including Final Cut Pro for the iPad and Mac, Final Cut Camera, a Stupid Raisins plugin for doing split-screen video, and Camo Studio.

That’s it for now. However, I’m already thinking about some fall and winter projects that will mean more changes to my setup, especially with respect to video production tools.


New NPC and AppStories Additions to the MacStories YouTube Channel

We’ve begun publishing more than podcasts on YouTube. In addition to MacStories’ six podcasts, we’ve added bonus content for NPC: Next Portable Console listeners. Whether you watch the video version of the show or not, we kicked off a playlist of bonus content this week with two videos.

The first is an unboxing and closer look at the Nintendo Switch Lite Hyrule Edition. I’ve had the Switch Lite for a little over a week and love it.

We also published a short video follow-up to one of the news stories we covered on the most recent episode of NPC. It turns out that the mysterious Nintendo device that appeared in the FCC’s records wasn’t quite what we thought it might be, and Brendon got photos from the Nintendo Store in New York, which Brendon also wrote about on Wavelengths.

YouTube’s playlist feature is also a great way to put together themed sets of past episodes of shows like AppStories, which has over 400 episodes. We’re working on more of these, but this week, we added a playlist of all 43 interview episodes we’ve done so far, as well as a collection of our classic Pick 2 episodes where Federico and I spotlight recent apps we’ve been trying.