My MacOS Shortcut & Macro Library – Raycast, Keyboard Maestro, and More

Our work is reader-supported; if you buy through our links, we may earn an affiliate commission.

I’ve invested a lot of time into creating macros, shortcuts, and other ways of doing my work faster on my Mac.

Part of the reason I’ve done this is that I deal with RSI (Repetive Stress Injury) issues, particularly in my left hand.

Even though I practice good posture, have set up my workspace for proper ergonomics, and exercise like a maniac, I still spend a lot of time typing (as well as playing guitar), which can lead to overuse injuries. The less typing I can do, the better.

Beyond that, I’m also just fascinated with tools that can help me work faster and do more with less input. This post is a regularly-updated guide to many of the different utilities, shortcuts, and macros that I’ve developed specifically for my Mac computer that allow me to work faster.

Below I’ve created a major section for each app that I use. Within that section, I’ve listed all of the customizations, sub-apps, and shortcuts that I’ve created.

I think it’s worth noting that I “wrote” 95% of this post by talking to my Mac, not by actually typing.

MacWhisper is an app that uses OpenAI’s open-source Whisper model to transcribe audio to text. The model is extremely accurate and, at least on my M2 Mac Studio, very fast.

🎙️ MacWhisper
Quickly and easily transcribe audio files into text with OpenAI’s state-of-the-art transcription technology Whisper. Whether you’re recording a meeting, lecture, or other important audio, MacWhisper quickly and accurately transcribes your audio files into text.Features Easily record and transcribe audio files on your Mac System wide dictation with Whisper to replace Apple’s own dictation Just drag and drop audio files to get a high quality transcription Record directly from your microphone or any other input device on your Mac All transcription is done on your device, no data leaves your machine. This makes MacWhisper a great app for sensitive audio such as interviews. Save or export your transcripts as a .whisper file, which includes the original audio and all your transcription edits for easy sharing .srt & .vtt subtitles export as well as csv, dote, docx, pdf, markdown and html exports Metal and GPU support for extremely fast transcription Get accurate text transcriptions in seconds (up to ~30x realtime) Search the entire transcript and highlight words Audio playback synced to transcripts Supports 100 different languages Copy the entire transcript or individual sections Star/Favorite segments Compact mode (hide timestamps) Automatically remove ums, uhhs and other similar filler words Drag and drop directly from Voice Memos Edit and delete segments from the transcript Add up to two speakers manually Inline Video Player Video playback synced to subtitles View multiple language subtitles at once in the videoplayer Select transcription language (or use auto detect) Change playback speed from 0.5 to 3.0x (audio & video) Supported formats: mp3, wav, m4a, ogg, opus, mov and mp4 videos. Adjust whisper settings (beam search / greedy, beam size etc) Supports Tiny, Base and Small transcription qualities (and you can compare the transcription quality between all quality levels) MacWhisper Pro All above features Batch Transcribe as many files one after the other. Useful if you want to add subtitles to an entire season of a show, or if you have a lot of interviews to go through Support for WhisperKit and Distilled models Transcribe YouTube videos Support for OpenAI (ChatGPT), Anthropic (Claude), Groq, Ollama, Custom OpenAI API endpoints and Azure AI models for easy prompting Support Cloud Transcriptions through OpenAI and Groq Manually add speakers to your transcript for a cleaner export Menubar app for accessing Whisper anywhere from your Mac Global, access MacWhisper from anywhere in a spotlight type view for instant transcription and easy pasting into other apps ChatGPT integration (with your own API key) Ignore segments such as [SILENCE] from appearing in your transcripts Supports GPT4, GPT4 Turbo, GPT4o and GPT4o-mini as well as older models Anthropic Claude Integration (with your own API key) Record and transcribe system audio (to record meetings for example) Supports Tiny (English Only), Tiny, Base, Small, Medium and Large (V2 and V3) models Add your own custom GGML models Change the starting timestamp for the transcript Translate audio file into another language through Whisper (use the Medium or Large models, the results will not be perfect and I’m working on more advanced ways to do this) Translate the full transcript by adding your own (free) DeepL API key. Translate subtitles into different languages Inline and separate video player with subtitle and multiple translated subtitles support Transcribe podcasts by combining single track audio for each host (beta) One time payment, no subscription. Pay once and use forever. Higher priority support. I’ll try to email you back as soon as possible if you run into anything. If you’re a journalist, student or non-profit, send me an email at [email protected] and tell me about your work to get 50% off 🙂 If you purchase MacWhisper Pro and are not happy with it, let me know within 7 days what could be improved and I’ll refund you. After downloading MacWhisper you will have to fill in your license key to unlock all Pro features. If you want to purchase more than 20 licenses, or if you’re looking for an MDM deployment or something custom, please send an email to [email protected]+ Supported LanguagesMacWhisper can transcribe audio in the following languages:English, Chinese, German, Spanish, Russian, Korean, French, Japanese, Portuguese, Turkish, Polish, Catalan, Dutch, Arabic, Swedish, Italian, Indonesian, Hindi, Finnish, Vietnamese, Hebrew, Ukrainian, Greek, Malay, Czech, Romanian, Danish, Hungarian, Tamil, Norwegian, Thai, Urdu, Croatian, Bulgarian, Lithuanian, Latin, Maori, Malayalam, Welsh, Slovak, Telugu, Persian, Latvian, Bengali, Serbian, Azerbaijani, Slovenian, Kannada, Estonian, Macedonian, Breton, Basque, Icelandic, Armenian, Nepali, Mongolian, Bosnian, Kazakh, Albanian, Swahili, Galician, Marathi, Punjabi, Sinhala, Khmer, Shona, Yoruba, Somali, Afrikaans, Occitan, Georgian, Belarusian, Tajik, Sindhi, Gujarati, Amharic, Yiddish, Lao, Uzbek, Faroese, Haitian Creole, Pashto, Turkmen, Nynorsk, Maltese, Sanskrit, Luxembourgish, Myanmar, Tibetan, Tagalog, Malagasy, Assamese, Tatar, Hawaiian, Lingala, Hausa, Bashkir, Javanese, Sundanese.System RequirementsMacWhisper requires a lot of computer memory to work well. To use the Medium and Large models your Mac should have more than 8GB of RAM. Performance on older Intel based Macs can also be bad but I have not been able to test this properly.👨‍💻 Check out my other macOS utilities:OpenAI Bundle – Get all my OpenAI apps at a discounted rateMacGPT – Use ChatGPT on your Mac and from your menubarDetective – GPT Vision for macOSVoices – High Quality Text to Speech with OpenAIText Assistant – Generate useful text and manage your prompts with GPT and your own OpenAPI keyVivid – Double the brightness of your MacBook Pro by always using HDR modeForehead – Hide the Notch and round your MacBook cornersCooldown – Quickly toggle Low Power Mode from your menubarSpeedy – Fast Speedtest in your menubarPippo – Improve the Picture-in-Picture video player with seek controlsWhisper was made by building on top of all the hard work from Georgi Gerganov, check out his Whisper implementation here: https://github.com/ggerganov/whisper.cppPrivacy Policy and Terms of Use
goodsnooze.gumroad.com

I started using MacWhisper to transcribe my YouTube videos and create subtitle files, and I still use it for that purpose.

However, the real killer feature of this app is called Dictation. Once you’ve enabled it in the settings, you can hold down a customizable hotkey to enable dictation in a push-to-talk fashion. I personally use the right key, since I almost never use that key for anything else.

This lets you start speaking, and then transcribes your audio into text directly into the active app. I estimate that I do about 95% of my text input using this feature.

MacWhisper has a free version, which includes the Dictation feature. There’s also a Pro version, which you can buy for a one-time fee that gives you access to some more features, including the ability to run your transcribed text through an LLM like ChatGPT or Anthropic’s Claude.

I tested that LLM cleanup feature, but I find that in most cases, the raw output from Whisper is better. So I don’t use it now. However, it did give me the idea to develop custom macros with Keyboard Maestro that let me select and clean up text with an LLM on an as-needed basis.

There’s also a useful Find and Replace feature which allows you to specify words that should be replaced with other words in all transcripts. Whisper is already a very accurate model, but this allows you to make your transcripts even more accurate.

The Find and Replace setting in MacWhisper.

I should note that macOS has a built-in dictation feature which I’ve also tested. Here’s how it stacks up against MacWhisper as of December 2024.

macOS Dictation pros:

  • It’s included in macOS.
  • Transcription happens live. It’s not real-time, but you can see the text being entered as you’re speaking. MacWhisper requires you to let go of the dictation key before it will transcribe anything.

MacWhisper Dictation pros:

  • It’s currently more accurate, and it handles grammar and punctuation better than macOS Dictation.
  • I like being able to hold down the right key to trigger Dictation. Mac OS dictation requires either double tapping or hitting the Microphone button (F5), which is more of a reach on my keyboard.

I’ll also note that macOS has a separate voice control feature that can be turned on in System Settings → Accessibility → Voice Control. This is separate from Mac OS dictation, and will automatically transcribe any spoken audio without the need to toggle a hotkey. It also allows you to literally control your computer using a huge list of commands.

Currently, I prefer using MacWhisper dictation out of all these three options, but things may change in the future. I’m particularly interested in how Apple Intelligence might improve the quality of the Dictation feature’s transcription in the future.

Raycast is a super-handy launcher app that essentially replaces Spotlight on macOS, and gives you the ability to take a ton of actions directly from your keyboard.

The Raycast launcher.

Just like with Spotlight, you hit a hotkey (I use + Space) to bring up Raycast’s text input window. From there, you can type to filter through your actions, use the arrow keys to navigate, and hit Enter to run actions or launch programs.

Here are my most-used shortcuts and extensions:

  • Spotify Controls – MacOS doesn’t let you force your media keys to only control Spotify. So I’ve set + [ to Play and + ] to Pause using this extension.
  • Clipboard History – Included in Raycast. Retains my clipboard history. I can’t believe macOS hasn’t included a clipboard manager by default at this point.
  • Window Management – Included in Raycast. Lets me type commands like first half or maximize to manage the active window. There are many other window managers for macOS (Aerospace is a popular new one), but I’m happy with the commands in Raycast.
  • Launching Apps – Type to find and launch any app
  • Snippets – Much like a text expander, Raycast allows you to create snippets of text, then automatically paste them by typing shorter strings. For example, I use hth to paste Hope this helps! in our customer support community.
  • Text Shortcuts – This extension lets me run useful shortcuts to transform text. For Example, I Can Highlight Some Text and Run the Title Case Transformation on It. I like this extension, though I find myself steadily creating Keyboard Maestro macros for many of its actions, since using it requires more key presses than what I need to trigger a macro.
  • Color Picker – Spawns an eyedropper tool which lets me click any part of the screen to copy that pixel’s hexadecimal color value.

Interestingly, Raycast’s new Wrapped feature actually shows me which of these extensions I use the most:

The Raycast Wrapped feature, showing my top extensions and how many times I've opened them.

Raycast also has a Pro plan, which starts at $8/month. I have used it in the past, but my main use case was syncing all of my snippets between devices. That is a useful feature, but paying $8 a month for it seems a bit steep.

Keyboard Maestro is an incredibly powerful – and complex – automation tool for macOS. It allows you to create macros which you can trigger using keyboard shortcuts, scripts, or even via other macros.

These macros can do nearly anything, including:

  • Copying and pasting to and from the clipboard
  • Manipulating the contents of the clipboard
  • Launching, controlling, or closing apps
  • Controlling the actual OS (e.g. shutting down the system via a macro)
  • Reading and writing to the file system
  • Running custom scripts – including Shell scripts, AppleScript, JavaScript/Python scripts, etc.

Macros can also consist of several steps, and you can define custom variables to pass information from step to step.

Honestly, I feel like I haven’t even scratched the surface with Keyboard Maestro. If you really want to dig into the many, many macros you could build, check out the macro library in their community forum.

My main use case for Keyboard Maestro right now triggering a massive, multi-step macro that can use OpenAI’s gpt-4o-mini model to clean up my dictated transcripts from MacWhisper. It can even inject links to other pages on my site, using a list of links I’ve included in the system instructions.

The macro is fairly complex, and it also uses Apple Shortcuts in addition to Keyboard Maestro (thanks to Thought Asylum for sharing the initial Shortcut I used as a base to build mine). I’ve made a full video explaining how it works below:

Apple’s Shortcuts app is a powerful tool that can automate nearly anything, both on iOS and macOS.

I primarily use shortcuts on my phone, where I have shortcuts that can launch specific Notion pages directly from my home screen, as well as a shortcut that allows me to add tasks to Notion using my voice.

How to Create Tasks in Notion with Your Voice
A step-by-step guide on creating a voice command that can add multiple tasks to Notion at the same time, using ChatGPT. Works on iPhone and Android.
thomasjfrank.com

However, I also use this Notion Voice Tasks Shortcut on my Mac as well. While it is easy to open up Notion and create a new task, it’s even easier to launch the Shortcut from my Dock and brain-dump multiple tasks using my voice.

Additionally, I use Shortcuts in conjunction with Keyboard Maestro to run my AI text cleanup macro, which I detailed in the Keyboard Maestro section above.

I’ve been creating YouTube videos and podcasts for well over 10 years, and I’ve always been very persnickety about my audio quality.

In the past, I’ve always spent time in the post-production process adding effects like EQ, compression, and de-essing to the audio in my raw files. Recently, I’ve been working on optimizing that process by improving the quality of the audio as I record it.

On my streaming computer, this involved investing in a DBX286s channel strip, which is a hardware unit that can apply EQ, compression, and other effects directly to the signal coming from the microphone before it reaches my computer.

DBX 286s Microphone Preamp & Channel Strip Processor
$199.00
Buy Now
I'll earn a commission if you click this link and make a purchase at no additional cost to you.
12/28/2024 12:43 am GMT

This thing rocks, and I won’t be mad at all if you choose to buy it using my affiliate link above. That said, it ran me $200, and it’s a pretty big unit.

So when I wanted to figure out how to improve the default audio quality on my work computer (separate from the computer I use for streaming), I decided to figure out if I had any other options besides buying another DBX 286s.

That’s how I discovered Audio Hijack and Loopback. Together, these two apps allow me to send the input audio from my microphone through an effect chain, then to route that audio to any app I want. The result is podcast-quality audio in every app I use, from Slack huddles and Zoom calls to Loom videos.

To set this up, I first created an Audio Hijack session that uses my Scarlett 2i2 USB interface as the input device.

If you’re curious, the mic I’m using is a Shure SM7B – though I don’t recommend it for most people. It’s a famous mic – Michael Jackson recorded Thriller on it – but if you’re just talking into it, it’s overkill. It costs way more than you need to spend, and it’s also extremely gain-hungry – which means you pretty much have to run it into a Cloudifter or some other form of preamp unless you have an especially beefy audio interface. If you want a mic of similar quality, get the Shure MV7+ (which works via USB or XLR), or the cheaper, XLR-only MV7x if you already have an audio interface.

Audio Hijack showing my custom microphone enhancements session.

In Audio Hijack, I route the input from my audio interface into Neutron, which is an effects suite plugin made by Izotope. Like my microphone, this is absolute overkill. Neutron is meant for professional music production (I use it on my music), so using it just to boost the quality of my spoken-word audio is like hunting pheasant with an M1 Abrams tank.

In Neutron, I’ve built out a pretty simple effects rack:

  • Compressor
  • Equalizer
  • Sculptor (set to the Dialogue setting)
  • Limiter, limiting output to -3db
My Neutron configuration.

Audio Hijack actually comes with plugins that will let you do almost all of this without installing any VSTs. If you still want to use VSTs, you can find plenty of free ones that still rock, such as:

Whatever you choose, the principle is the same. Boost the signal a bit, use equalization and compression to shape the character of the audio, and limit the output so you don’t get any peaking.

The last thing I’ve done in Audio Hijack is set my session to run when Audio Hijack launches. I also have Audio Hijack launch at startup, which means the session is always running.

In Loopback, I’ve created a virtual device called Mic Enhancer Pass-Thru. I’ve routed this to my output channels and then set the Mic Enhancer Pass-Thru virtual device as the output in my Audio Hijack session.

My Mic Enhancer Pass-Thru virtual Device in Loopback.

The Mic Enhancer Pass-Thru virtual device shows up alongside all the other input devices I have connected to my computer, which means I can select it in any app as if it were a hardware input.

The audio input device list in Loom, showing my Mic Enhancer Pass-Thru device.

As a bonus, the fact that my audio is enhanced through Audio Hijack means that I can speak very quietly into my mic and still get near-perfect dictation with MacWhisper. I’ve gotten great results out of a barely-audible whisper.

In the near future, I’m planning on testing whether or not I can whisper into a portable headset (or even into my AirPods) in a public setting like a coffee shop.

I make a lot of screen recordings. Not only are the majority of my YouTube videos screen-recorded Notion tutorials these days, but I also create tons and tons of screen recordings to provide support to people who have bought my premium Notion templates (like Ultimate Brain).

For the latter use case, Loom is my secret weapon. Instead of shooting, editing, manually uploading, and sharing a video, Loom lets me simply start recording my screen. Once I’m done, the video is automatically uploaded to my Loom account, and the URL is on my clipboard.

Here’s an example:

Loom also automatically generates a transcript and adds subtitles to the video. It is the fastest way to create a video, especially when I need to share something I’m doing on my computer screen.

I don’t use Loom for everything; if I’m creating a really short video that I want to paste into Slack or into a blog post, I typically use CleanShot X instead. But for longer, tutorial-style videos where I want to show a face-cam and include narration, Loom is my go-to tool.

Cleanshot X is the best screenshot tool, period. I feel bad that Windows users can’t use it (they should use ShareX instead).

I take a lot of screenshots every single day. I’m often sharing screenshots in Slack, in responses to support tickets in our customer community, and of course in blog posts like this one.

Cleanshot lets me take those screenshots very quickly, and it also has tools for making quick annotations. I know this is going to sound extremely picky, but Cleanshot has the best arrows in the game – and to me, that matters.

Cleanshot has the best arrows in the game.

I can also quickly add borders to my screenshots, which makes them prettier and makes me happier.

Most importantly, Cleanshot can automatically copy screenshots to my clipboard. It can also record videos, and it’ll copy those to the clipboard as well.

Combined with Clop (detailed below), Cleanshot gives me an extremely quick workflow for creating high-quality, low-file-size screenshots and videos that I can paste anywhere. I’ve set my copy up with several hotkeys to perform different actions:

  • ++1 – Draw a rectangle to take a screenshot and copy it to the clipboard
  • ++2 – Draw a rectangle to take a screenshot and open the annotation tools
  • ++3 – Make a screen recording
  • ++4 – Take a screenshot of a specific app window and open the annotation tools
  • ++5 – Launch scrolling capture (scroll inside a drawn rectangle to create a long screenshot – handy for web pages)

Clop is a handy little utility that can automatically optimize the file sizes of images and screen recordings that I make with CleanShot.

I use the heck out of this tool as I’m always trying to make sure my site’s pages load as quickly as possible – and minimizing image and video file sizes goes a long way towards that purpose. Additionally, certain apps like Slack have a file attachment size limit. Automatically reducing the file size of screen recordings I make allows me to paste longer files (I don’t like using Loom for videos I share in Slack, as you can’t make a Loom video full-screen while still seeing the current threat).

I’ve used other file compression tools in the past, but what makes me love Clop is the fact that it can automatically optimize images and videos the moment I save them – or even add them to my clipboard.

This creates an extremely efficient workflow for creating and sharing screenshots:

  1. Take the screenshot with Cleanshot.
  2. Add annotations if needed.
  3. Copy the screenshot to the clipboard with Cleanshot.
  4. Clop automatically compresses the screenshot, placing the optimized version on the clipboard.
  5. Paste the screenshot anywhere.

🤔 Have an UB Question?

Fill out the form below and I’ll answer as soon as I can! ~Thomas

🤔 Have a Question?

Fill out the form below and I’ll answer as soon as I can! ~Thomas