Hermes
Local-first, privacy-focused meeting transcription for macOS.
No cloud services. No data leaves your device.
Hermes captures audio from your calls — Zoom, Google Meet, Microsoft Teams, FaceTime, or anything that outputs system audio — and your microphone simultaneously. It transcribes both streams locally using WhisperKit and displays a live transcript in a floating overlay that hovers above your call window.
Think of it as a personal, offline Granola — transcription only, no data leaves your Mac, ever.
-
Dual-Stream Capture
System audio (remote participants) via Core Audio Taps and microphone (you) via AVAudioEngine. Free speaker diarization — "Me" vs "Them" — with no ML speaker identification needed.
-
On-Device Transcription
WhisperKit running on Apple Neural Engine with the
large-v3model. Nothing leaves your Mac. No API keys, no subscriptions. -
Floating Overlay
A transparent panel that hovers above your call window. Collapses to a tiny pill icon when you don't need it. Always on top, never in the way.
-
Completely Private
No network calls, no analytics, no telemetry, no cloud anything. Audio is captured, transcribed, and stored on your Mac — period.
-
Global Hotkey
Start and stop recording from anywhere with
Cmd+Shift+R. No need to switch windows. -
Session History
All transcripts are persisted locally via SwiftData. Browse past sessions and export as Markdown.
-
Download the DMG or build from source.
-
Learn how to record, transcribe, and manage sessions.
-
How Hermes captures audio and transcribes it.
-
What data Hermes collects (nothing) and where it goes (nowhere).