Hex — Voice → Text

Press-and-hold a hotkey to transcribe your voice and paste the result wherever you're typing. Now with AI-powered summarization using local Ollama models!

Download Hex for macOS

Note: Hex is currently only available for Apple Silicon Macs.

I've opened-sourced the project in the hopes that others will find it useful! We rely on the awesome WhisperKit for transcription, and the incredible Swift Composable Architecture for structuring the app. The new AI summarization feature integrates with Ollama to provide intelligent summaries of your transcriptions using local language models. Please open issues with any questions or feedback! ❤️

Join our Discord community for support, discussions, and updates!

Features

Voice Transcription: Press-and-hold or double-tap hotkeys for flexible recording modes
AI Summarization: Generate intelligent summaries of selected transcriptions using local Ollama models
History Management: View, search, and manage all your transcriptions
Privacy-First: All processing happens locally on your machine
Hotkey Customization: Configure global hotkeys for seamless workflow integration

Instructions

Once you open Hex, you'll need to grant it microphone and accessibility permissions—so it can record your voice and paste the transcribed text into any application, respectively.

Recording Modes

Once you've configured a global hotkey, there are two recording modes:

Press-and-hold the hotkey to begin recording, say whatever you want, and then release the hotkey to start the transcription process.
Double-tap the hotkey to lock recording, say whatever you want, and then tap the hotkey once more to start the transcription process.

AI Summarization (Optional)

To use the AI summarization feature:

Install Ollama: Download and install Ollama on your Mac
Download a model: Run ollama pull llama3.2 (or your preferred model) in Terminal
Enable in Settings: Go to Hex Settings → Ollama and enable the integration
Configure: Set your Ollama base URL (default: http://localhost:11434) and select your model
Use: In the History tab, select multiple transcriptions and click "Generate Summary"

The AI summarization works entirely locally with your Ollama installation - no data is sent to external servers.

⚠️ Note: The first time you run Hex, it will download and compile the Whisper model for your machine. During this process, you may notice high CPU usage from a background process called ANECompilerService. This is macOS optimizing the model for the Apple Neural Engine (ANE), and it's a normal one-time setup step.

Depending on your CPU and the size of the model, this may take anywhere from a few seconds to a few minutes.

Project Structure

Hex is organized into several directories, each serving a specific purpose:

App/
- Contains the main application entry point (HexApp.swift) and the app delegate (HexAppDelegate.swift), which manage the app's lifecycle and initial setup.
Clients/
- PasteboardClient.swift
  - Manages pasteboard operations for copying transcriptions.
- SoundEffect.swift
  - Controls sound effects for user feedback.
- RecordingClient.swift
  - Manages audio recording and microphone access.
- KeyEventMonitorClient.swift
  - Monitors global key events for hotkey detection.
- TranscriptionClient.swift
  - Interfaces with WhisperKit for transcription services.
- OllamaClient.swift
  - Handles communication with local Ollama API for AI summarization.
Features/
- AppFeature.swift
  - The root feature that composes transcription, settings, and history.
- TranscriptionFeature.swift
  - Manages the core transcription logic and recording flow.
- SettingsFeature.swift
  - Handles app settings, including hotkey configuration, permissions, and Ollama integration.
- HistoryFeature.swift
  - Manages the transcription history view with multi-selection and AI summarization capabilities.
Models/
- HexSettings.swift
  - Stores user preferences like hotkey settings, sound preferences, and Ollama configuration.
- HotKey.swift
  - Represents the hotkey configuration.
Resources/
- Contains the app's assets, including the app icon and sound effects.
- changelog.md
  - A log of changes to the app.
- Data/languages.json
  - A list of supported languages for transcription.
- Audio/
  - Sound effects for user feedback.

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
.cursor/rules		.cursor/rules
.github		.github
Hex.xcodeproj		Hex.xcodeproj
Hex		Hex
HexTests		HexTests
.gitignore		.gitignore
.swiftlint.yml		.swiftlint.yml
CLAUDE.md		CLAUDE.md
Localizable.xcstrings		Localizable.xcstrings
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hex — Voice → Text

Features

Instructions

Recording Modes

AI Summarization (Optional)

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hex — Voice → Text

Features

Instructions

Recording Modes

AI Summarization (Optional)

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages