Posts tagged with "featured"

Hands-On: How Apple’s New Speech APIs Outpace Whisper for Lightning-Fast Transcription

Late last Tuesday night, after watching F1: The Movie at the Steve Jobs Theater, I was driving back from dropping Federico off at his hotel when I got a text:

Can you pick me up?

It was from my son Finn, who had spent the evening nearby and was stalking me in Find My. Of course, I swung by and picked him up, and we headed back to our hotel in Cupertino.

On the way, Finn filled me in on a new class in Apple’s Speech framework called SpeechAnalyzer and its SpeechTranscriber module. Both the class and module are part of Apple’s OS betas that were released to developers last week at WWDC. My ears perked up immediately when he told me that he’d tested SpeechAnalyzer and SpeechTranscriber and was impressed with how fast and accurate they were.

It’s still early days for these technologies, but I’m here to tell you that their speed alone is a game changer for anyone who uses voice transcription to create text from lectures, podcasts, YouTube videos, and more. That’s something I do multiple times every week for AppStories, NPC, and Unwind, generating transcripts that I upload to YouTube because the site’s built-in transcription isn’t very good.

What’s frustrated me with other tools is how slow they are. Most are built on Whisper, OpenAI’s open source speech-to-text model, which was released in 2022. It’s cheap at under a penny per one million tokens, but isn’t fast, which is frustrating when you’re in the final steps of a YouTube workflow.

An SRT file generated by Yap.

An SRT file generated by Yap.

I asked Finn what it would take to build a command line tool to transcribe video and audio files with SpeechAnalyzer and SpeechTranscriber. He figured it would only take about 10 minutes, and he wasn’t far off. In the end, it took me longer to get around to installing macOS Tahoe after WWDC than it took Finn to build Yap, a simple command line utility that takes audio and video files as input and outputs SRT- and TXT-formatted transcripts.

Yesterday, I finally took the Tahoe plunge and immediately installed Yap. I grabbed the 7GB 4K video version of AppStories episode 441, which is about 34 minutes long, and ran it through Yap. It took just 45 seconds to generate an SRT file. Here’s Yap ripping through nearly 20% of an episode of NPC in 10 seconds:

Replay

Next, I ran the same file through VidCap and MacWhisper, using its V2 Large and V3 Turbo models. Here’s how each app and model did:

App Transcripiton Time
Yap 0:45
MacWhisper (Large V3 Turbo) 1:41
VidCap 1:55
MacWhisper (Large V2) 3:55

All three transcription workflows had similar trouble with last names and words like “AppStories,” which LLMs tend to separate into two words instead of camel casing. That’s easily fixed by running a set of find and replace rules, although I’d love to feed those corrections back into the model itself for future transcriptions.

Once transcribed, a video can be used to generate additional formats like outlines.

Once transcribed, a video can be used to generate additional formats like outlines.

What stood out above all else was Yap’s speed. By harnessing SpeechAnalyzer and SpeechTranscriber on-device, the command line tool tore through the 7GB video file a full 2.2× faster than MacWhisper’s Large V3 Turbo model, with no noticeable difference in transcription quality.

At first blush, the difference between 0:45 and 1:41 may seem insignificant, and it arguably is, but those are the results for just one 34-minute video. Extrapolate that to running Yap against the hours of Apple Developer videos released on YouTube with the help of yt-dlp, and suddenly, you’re talking about a significant amount of time. Like all automation, picking up a 2.2× speed gain one video or audio clip at a time, multiple times each week, adds up quickly.

Whether you’re producing video for YouTube and need subtitles, generating transcripts to summarize lectures at school, or doing something else, SpeechAnalyzer and SpeechTranscriber – available across the iPhone, iPad, Mac, and Vision Pro – mark a significant leap forward in transcription speed without compromising on quality. I fully expect this combination to replace Whisper as the default transcription model for transcription apps on Apple platforms.

To test Apple’s new model, install the macOS Tahoe beta, which currently requires an Apple developer account, and then install Yap from its GitHub page.


iOS 26, iPadOS 26, and Liquid Glass: The MacStories Overview

During today’s WWDC 2025 keynote, held in person at Apple Park and streamed online, Apple unveiled a considerable number of upgrades to iOS and iPadOS, including a brand-new design language called Liquid Glass. This new look, which spans all of Apple’s platforms, coupled with a massive upgrade for multitasking on the iPad and numerous other additions and updates, made for packed releases for iOS and iPadOS.

Let’s take a look at everything Apple showed today for Liquid Glass, iOS, and iPadOS.

Read more


macOS Tahoe: The MacStories Overview

At its WWDC 2025 keynote held earlier today, Apple officially announced the next version of macOS, macOS Tahoe. As per the company’s naming tradition over the past decade, this new release is once again named after a location in California. This year, however, to unify the version numbers across all its operating systems, Apple has decided to align the new release with the upcoming year. This is why the version number for macOS Tahoe will be macOS 26, directly up from last year’s macOS 15.

macOS 26 features the brand-new Liquid Glass design language, which Apple is also rolling out across iOS, iPadOS, visionOS, watchOS, and tvOS. But macOS Tahoe doesn’t stop there. In addition to the flashy new look, Apple has introduced many features, ranging from a supercharged new version of Spotlight and intelligent actions in Shortcuts to new Continuity and gaming-focused features for the Mac.

Here’s a recap of everything that Apple showed off today for macOS Tahoe.

Read more


Apple Intelligence Expands: Onscreen Visual Intelligence, Shortcuts, Third-Party Apps, and More

Source: Apple.

Source: Apple.

One of the big questions heading into today’s WWDC keynote was how Apple would address its AI efforts. After a splashy introduction last year followed by a staggered rollout and the eventual delay of the more personalized Siri, it was unclear how much focus the company would put on Apple Intelligence during its big announcement video.

Surprisingly, they came right out of the gate with a segment on Apple Intelligence, even going so far as to mention the fact that the more personalized Siri needed more time; it’s slated to be released “in the coming year.” But SVP of Software Craig Federighi also said that Apple Intelligence had progressed with more capable and efficient models and teased that more Apple Intelligence features would be revealed throughout the presentation. Rather than dedicating a significant portion of the keynote just to AI features, the company returned to a platform-centered structure for the rest of the video and mentioned Apple Intelligence as it related to each OS.

In its second year, Apple Intelligence is set to expand in more ways than one. Perhaps most excitingly, third-party developers will soon have access to Apple Intelligence’s on-device foundation model, enabling them to implement AI features in their apps that work offline in a privacy-respecting way. And because the framework is local, it will be available to developers at no additional cost with no API fees.

Read more


From the Creators of Shortcuts, Sky Extends AI Integration and Automation to Your Entire Mac

Sky for Mac.

Sky for Mac.

Over the course of my career, I’ve had three distinct moments in which I saw a brand-new app and immediately felt it was going to change how I used my computer – and they were all about empowering people to do more with their devices.

I had that feeling the first time I tried Editorial, the scriptable Markdown text editor by Ole Zorn. I knew right away when two young developers told me about their automation app, Workflow, in 2014. And I couldn’t believe it when Apple showed that not only had they acquired Workflow, but they were going to integrate the renamed Shortcuts app system-wide on iOS and iPadOS.

Notably, the same two people – Ari Weinstein and Conrad Kramer – were involved with two of those three moments, first with Workflow, then with Shortcuts. And a couple of weeks ago, I found out that they were going to define my fourth moment, along with their co-founder Kim Beverett at Software Applications Incorporated, with the new app they’ve been working on in secret since 2023 and officially announced today.

For the past two weeks, I’ve been able to use Sky, the new app from the people behind Shortcuts who left Apple two years ago. As soon as I saw a demo, I felt the same way I did about Editorial, Workflow, and Shortcuts: I knew Sky was going to fundamentally change how I think about my macOS workflow and the role of automation in my everyday tasks.

Only this time, because of AI and LLMs, Sky is more intuitive than all those apps and requires a different approach, as I will explain in this exclusive preview story ahead of a full review of the app later this year.

Read more


Early Impressions of Claude Opus 4 and Using Tools with Extended Thinking

Claude Opus 4 and extended thinking with tools.

Claude Opus 4 and extended thinking with tools.

For the past two days, I’ve been testing an early access version of Claude Opus 4, the latest model by Anthropic that was just announced today. You can read more about the model in the official blog post and find additional documentation here. What follows is a series of initial thoughts and notes based on the 48 hours I spent with Claude Opus 4, which I tested in both the Claude app and Claude Code.

For starters, Anthropic describes Opus 4 as its most capable hybrid model with improvements in coding, writing, and reasoning. I don’t use AI for creative writing, but I have dabbled with “vibe coding” for a collection of personal Obsidian plugins (created and managed with Claude Code, following these tips by Harper Reed), and I’m especially interested in Claude’s integrations with Google Workspace and MCP servers. (My favorite solution for MCP at the moment is Zapier, which I’ve been using for a long time for web automations.) So I decided to focus my tests on reasoning with integrations and some light experiments with the upgraded Claude Code in the macOS Terminal.

Read more


Notes on Mercury Weather’s New Radar Maps Feature

Since covering Mercury Weather 2.0 and its launch on the Vision Pro here on MacStories, I’ve been keeping up with the weather app on Club MacStories. It’s one of my favorite Mac menu bar apps, it has held a spot on my default Apple Watch face since its launch, and last fall, it added severe weather notifications.

I love the app’s design and focus as much today as I did when I wrote about its debut in 2023. Today, though, Mercury Weather is a more well-rounded app than ever before. Through regular updates, the app has filled in a lot of the holes in its feature set that may have turned off some users two years ago.

Today, Mercury Weather adds weather radar maps, which was one of the features I missed most from other weather apps, along with the severe weather notifications that were added late last year. It’s a welcome addition that means the next time a storm is bearing down on my neighborhood, I won’t have to switch to a different app to see what’s coming my way.

Zooming out to navigate the globe.

Zooming out to navigate the globe.

Radar maps are available on the iPhone, iPad, and Mac versions of Mercury Weather; they offer a couple of different map styles and a legend that explains what each color on the map means. If you zoom out, you can get a global view of Earth with your favorite locations noted on the map. Tap one, and you’ll get the current conditions for that spot. Mercury Weather already had an extensive set of widgets for the iPhone, iPad, and Mac, but this update adds small, medium, and large widgets for the radar map, too.

A Mercury Weather radar map on the Mac.

A Mercury Weather radar map on the Mac.

With a long list of updates since launch, Mercury Weather is worth another look if you passed on it before because it was missing features you wanted. The app is available on the App Store as a free download. Certain features require a subscription or lifetime purchase via an in-app purchase.


Inside Airbnb’s App Redesign: An AppStories Interview with Marketing and Design Leads

Last week, I was in LA for Airbnb’s 2025 Summer Release. As part of the day’s events, Federico and I interviewed Jud Coplan, Airbnb’s Vice President of Product Marketing, and Teo Connor, Airbnb’s Vice President of Design, for AppStories to talk about the new features and app the company launched. It was a great conversation that you can watch on YouTube:

or listen to the episode here:

Last week’s launch was a big one for Airbnb. The company debuted Services and reimagined and expanded Experiences. Services are the sort of things hotels and resorts offer that you used to give up when booking an Airbnb stay. Now, however, you can book a chef, personal trainer, hair stylist, manicurist, photographer, and more. Better yet, you don’t have to book a stay with an Airbnb host to take advantage of services. You can schedule services in your hometown or wherever you happen to be.

Experiences aren’t entirely new to Airbnb, but have been expanded and integrated into the Airbnb app in a way that’s similar to Services. Services allow you to get the most out of a trip from locals who know their cities best, whether that’s a cultural tour, dining experience, outdoor adventure, or something else.

Chef Grace explaining how to serve sadza.

Chef Grace explaining how to serve sadza.

While I was in LA, I prepared a meal alongside several other media folks from around the world. Our instructor was Chef Kuda Grace from Zimbabwe at Flavors from Afar. We made sadza with peanut butter and mustard greens and then sat down together to compare notes from the day’s events, tell stories about our dining experiences, and get to know each other better.

The evening was a lot of fun, but what struck me most about it was something we touched upon in this week’s episode of AppStories. The goal of Airbnb’s redesigned app is to get you to leave it and go out into the world to try new things. It reduces the friction and anxiety of taking the plunge into something new and emphasizes social interactions in the real world instead of on a screen. In 2025, that’s unusual for an app from a big company, and it was fascinating to talk to Teo and Jud about how they and their teams set out to accomplish that goal.

I like Airbnb’s redesigned app a lot. It’s playful, welcoming and easy to use. What remains to be seen is whether Airbnb can pull off what it’s set out to accomplish. It isn’t the first company to try to pair customers with local services and experiences. Nor is it Airbnb’s first attempt at experiences. However, the app is a solid foundation, and if my experience at dinner in LA was any indication, I suspect Airbnb may be onto something with Services and Experiences.

Disclosure: The trip to LA to conduct my half of this interview was paid for by Airbnb.

Permalink

Federico’s Latest Automation Academy Lesson: Building a Better Web Clipper with Shortcuts and AI

A webpage saved with Universal Clipper.

A webpage saved with Universal Clipper.

I share Federico’s frustration over saving links. Every link may be a URL, but their endpoints can be wildly different. If like us, you save links to articles, videos, product information, and more, it’s hard to find a tool that handles every kind of link equally well.

That was the problem Federico set out to solve with Universal Clipper, an advanced shortcut that automatically detects the kind of link that’s passed to it, and saves it to a text file, which he accesses in Obsidian, although any text editor will work.

Universal Clipper integrates with the Obsidian plugin Dataview, too.

Universal Clipper integrates with the Obsidian plugin Dataview, too.

Universal Clipper, which Federico released yesterday as part of his Automation Academy series for Club MacStories Plus and Premier members, is one of his most ambitious shortcuts that draws on multiple third-party apps, services, and command line tools in an automation that works as a standalone shortcut or as a function that can send its results to another shortcut. As Federico explains:

I learned a lot in the process. As I’ve documented on MacStories and the Club lately, I’ve played around with various templates for Dataview queries in Obsidian; I’ve learnedhow to take advantage of the Mac’s Terminal and various CLI utilities to transcribe long YouTube videos and analyze them with Gemini 2.5; I’ve explored new ways to interact with web APIs in Shortcuts; and, most recently, I learned how to properly prompt GPT 4.1 with precise instructions. All of these techniques are coming together in Universal Clipper, my latest, Mac-only shortcut that combines macOS tools, Markdown, web APIs, and AI to clip any kind of webpage from any web browser and save it as a searchable Markdown document in Obsidian.

Although the shortcut may be complex, the best part of Federico’s post is how easy it is to follow. Along the way, you’ll learn a bunch of techniques and approaches to Shortcuts automation that you can adapt for your own shortcuts, too.

Automation Academy is just one of many perks that Club MacStories Plus and Club Premier members enjoy including:

  • Weekly and monthly newsletters 
  • A sophisticated web app with search and filtering tools to navigate eight years of content
  • Customizable RSS feeds
  • Bonus columns
  • An early and ad-free version of our Internet culture and media podcast, MacStories Unwind
  • A vibrant Discord community of smart app and automation fans who trade a wealth of tips and discoveries every day
  • Live Discord audio events after Apple events and at other times of the year

On top of that, Club Premier members get AppStories+, an extended, ad-free version of our flagship podcast that we deliver early every week in high-bitrate audio.

Use the buttons below to learn more and sign up for Club MacStories+ or Club Premier.

Join Club MacStories+:

Join Club Premier:

Permalink