FluidVoice - Open source voice-to-text dictation app for macOS with local AI
FluidVoice is an open-source macOS dictation app that provides local AI-enhanced speech-to-text. It supports multiple speech models, command mode, write mode, and more, all running on-device for privacy. The latest 1.6.0 release introduces ultra-low latency Parakeet engine and Fluid Intelligence local AI enhancement.
Uh oh!
There was an error while loading. Please reload this page.
Notifications You must be signed in to change notification settings
Fork 300
Star 4.9k
BranchesTags
Open more actions menu
Folders and files
NameName
Last commit message
Last commit date
Latest commit
History
751 Commits
751 Commits
.github
.github
Fluid.xcodeproj
Fluid.xcodeproj
Sources/Fluid
Sources/Fluid
Tests/FluidDictationIntegrationTests
Tests/FluidDictationIntegrationTests
assets
assets
docs
docs
scripts
scripts
.gitattributes
.gitattributes
.gitignore
.gitignore
.swiftformat
.swiftformat
.swiftlint.yml
.swiftlint.yml
Fluid.entitlements
Fluid.entitlements
Info.plist
Info.plist
LICENSE
LICENSE
Package.resolved
Package.resolved
Package.swift
Package.swift
README.md
README.md
build.sh
build.sh
Repository files navigation
Open source voice-to-text dictation app for macOS with on-device AI enhancement.
Install with Homebrew: brew install --cask fluidvoice
Manual download: latest release
Important
This project is free and open source under GPLv3. If FluidVoice is useful to you, please star the repository — it helps visibility and keeps development going.
Support FluidVoice
If FluidVoice helps you, you can support continued development and future platform work for iOS and Windows on GitHub Sponsors.
What's New in 1.6.0
Insanely fast Parakeet — rebuilt Parakeet implementation with pretty much zero delay between speaking and seeing words on screen
Fluid Intelligence — fully local AI model for on-device dictation enhancement. No cloud, no API keys, no data leaving your Mac
Better Theming — adaptive light/dark theme with a compact toolbar switcher
Refreshed Onboarding — language-first voice engine setup, real dictation tryout, and AI enhancement setup in one clean pass
Warning
Based on early feedback, Fluid Intelligence may cause you to unsubscribe from other dictation apps and save money. You've been warned.
Fluid Intelligence
FluidVoice is fully open source under GPLv3. Fluid Intelligence is a separate, privately maintained local AI runtime that powers advanced on-device dictation enhancement — smart formatting, context-aware capitalization, and post-processing — all running locally on your Mac.
The app works great on its own with any supported speech model and optional cloud AI providers. Fluid Intelligence adds a fully local, private AI layer for users who want on-device enhancement without sending data anywhere.
We're keeping Fluid Intelligence private for now so we can sustainably offer the core dictation experience for free. This may change in the future.
Star History
Fluid Intelligence Sneak Peek
Email Template Flowers
change_time_name.mp4
Email_template.mp4
Change Time & Name Emoji
emoji.mp4
flowers.mp4
Hyphens & Numbers
hyphen123.mp4
Demo
Command Mode — Take any action on your Mac using FluidVoice
FluidVoice_1.5_cmd_mode.mp4
Write Mode — Write or rewrite text in any text box in any app
writemode_FluidVoice_1.5.mp4
Screenshots
Command Mode
History & Stats
Features
Fluid Intelligence — on-device AI enhancement for smart formatting, context-aware capitalization, and post-processing, all running locally on your Mac with zero data leaving your machine
Command Mode — control your Mac by voice: launch apps, run shortcuts, trigger system actions, and automate workflows without touching the keyboard
Write Mode — write or rewrite text directly in any text field across any app. Select text and rewrite it, or dictate new content inline
Live Preview — real-time transcription overlay with notch support, so you see words appear as you speak
Multiple Speech Models — Nemotron Speech 3.5, Parakeet Flash, Parakeet TDT v3 & v2, Cohere Transcribe, Apple Speech, and Whisper. Pick the model that fits your language and latency needs
AI Enhancement — optional post-processing via OpenAI, Groq, custom providers, or local Fluid Intelligence for cleaner, more accurate transcripts
Audio History — optional local recording history with budget controls and ZIP export, so you can review past dictations without cloud storage
Today-Usage Stats — daily usage tracking at a glance with a stats header card and toolbar pill
Adaptive Theming — light/dark theme that follows your system, with a compact toolbar switcher
Global Hotkey — instant voice capture from anywhere, no app switching needed
Smart Typing — direct insertion into any app via accessibility APIs for reliable, app-independent text entry
Menu Bar Integration — quick access, status, and settings from the menu bar
Auto-Updates — seamless updates with an optional beta channel for early previews
Per-App Configuration — assign different prompt sets to different apps, so your dictation adapts to whatever you're working in. Fully optional
Notch-Aware Overlay — transcription overlay that fits cleanly around the MacBook notch, or use a standard overlay if your Mac doesn't have one
Local-First — your voice and text never leave your machine unless you opt in to a cloud AI provider
Fastest Parakeet on Mac — one of the fastest native implementations of Parakeet on macOS, with near-instant transcription and minimal latency
Configurable Overlay — choose from pill-shaped to large overlay sizes to show live preview, or keep it minimal. Everything is optional
Everything is Optional — AI enhancement, Fluid Intelligence, audio history, analytics, and beta builds are all opt-in. The core dictation experience works out of the box with zero configuration beyond permissions and a hotkey
Supported Models
Model Best for Language support Download size Hardware
Nemotron Speech 3.5 — Ultra Fast Low Latency Streaming-capable multilingual dictation ~40 languages ~670 MB Apple Silicon
Nemotron 3.5 Multilingual Higher-accuracy multilingual dictation ~40 languages ~530 MB Apple Silicon
Parakeet Flash (Beta) Lowest-latency live English dictation English ~250 MB Apple Silicon
Parakeet TDT v3 Fast default multilingual dictation 25 languages ~500 MB Apple Silicon
Parakeet TDT v2 Fastest English-only dictation English ~500 MB Apple Silicon
Cohere Transcribe High-accuracy multilingual dictation 14 languages ~1.4 GB Apple Silicon
Apple Speech Zero-download native macOS speech System languages Built-in Apple Silicon + Intel
Whisper Tiny / Base / Small / Medium / Large Broad compatibility, including Intel Macs 99 languages ~75 MB to ~2.9 GB Apple Silicon + Intel
Parakeet TDT v3 Languages
Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, and Ukrainian.
Parakeet TDT v2 Languages
English.
Cohere Transcribe Languages
English, French, German, Italian, Spanish, Portuguese, Greek, Dutch, Polish, Mandarin, Japanese, Korean, Vietnamese, and Arabic.
Apple Speech Languages
System language support depends on the macOS speech recognition languages available on your machine.
Whisper Language Support
Whisper supports up to 99 languages, depending on the model size you choose.
Quick Start
Install with Homebrew:
brew install --cask fluidvoice
Or download the latest release.
Grant permissions — FluidVoice will ask for microphone and accessibility access. Both are required for dictation and typing into other apps.
Set your hotkey — pick a global hotkey in settings that triggers voice capture from anywhere.
Go through onboarding — choose your voice model based on your language and latency needs. Models range from zero-download Apple Speech to high-accuracy Nemotron and Whisper.
(Optional) Enable Fluid Intelligence — download the local AI model during onboarding for on-device dictation enhancement. Everything runs locally, no data leaves your Mac.
(Optional) Bring your own AI provider — add an OpenAI, Groq, or custom provider API key for cloud-based enhancement. Keys are stored securely in macOS Keychain. Select "Always allow" for key access.
(Optional) Opt in to beta builds — Settings → Automatic Updates → Beta Releases for early access to new features.
Requirements
macOS 15.0 (Sequoia) or later
Apple Silicon Mac for all models
Intel Macs supported via Whisper models (from 1.5.1+)
~1 GB disk space for a voice model
~3.5 GB disk space for the Fluid Intelligence model (optional)
Microphone access
Accessibility permissions for typing
Building from Source
git clone https://github.com/altic-dev/FluidVoice.git cd FluidVoice open Fluid.xcodeproj
Build and run in Xcode. All dependencies are managed via Swift Package Manager.
Build Only (No Signing)
xcodebuild -project Fluid.xcodeproj -scheme Fluid -destination 'platform=macOS' build CODE_SIGNING_ALLOWED=NO
Contributing
Contributions are welcome! Please create an issue first to discuss major changes before submitting a pull request.
Development Setup
Clone and open in Xcode as above.
Signing: FluidVoice → Signing & Capabilities → Automatically manage signing → pick your Team (Personal Team is fine). Stored in xcuserdata/ (gitignored).
Build and run — SPM handles dependencies.
(Optional) Pre-commit hook to prevent accidental team ID commits:
cp scripts/check-team-id.sh .git/hooks/pre-commit chmod +x .git/hooks/pre-commit
Pull Request Guidelines
One feature or fix per PR — keep changes focused and atomic
Create an issue first so work is trackable before review
Discuss non-trivial changes before opening a PR
Follow the PR template
Test thoroughly on your machine
Never commit personal team IDs or API keys
Check git diff before committing
Run Integration Tests
xcodebuild test -project Fluid.xcodeproj -scheme Fluid -destination 'platform=macOS'
CI uses unsigned builds:
xcodebuild test -project Fluid.xcodeproj -scheme Fluid -destination 'platform=macOS' CODE_SIGNING_REQUIRED=NO CODE_SIGNING_ALLOWED=NO
Privacy & Analytics
FluidVoice is local-first. Your voice, audio, and transcribed text never leave your machine unless you explicitly opt in to a cloud AI provider.
What's Collected (Opt-In)
Anonymous analytics are enabled by default to track app health and feature usage. You can disable at any time from Settings → Share Anonymous Analytics.
Collected:
App version, build, macOS version
Low-cardinality feature/config flags (e.g. app mode, major settings)
Approximate usage ranges (not exact values)
High-level success/error outcomes
Not Collected:
Voice, raw audio, or transcribed text
Selected text, prompts, or AI responses
Terminal commands, window titles, file paths, clipboard, or typed content
Any personal or private information
Community
Join our Discord: https://discord.gg/VUPHaKSvYV
Follow development on X: @ALTIC_DEV
License
From 2026-02-23 onward, this project is licensed under the GNU General Public License, Version 3.0 (GPLv3).
Versions published before this date were licensed under Apache License 2.0.
About
Fastest and only macOS Dictation app with on-device STT and custom trained AI enhancement model - Local Wispr Flow alternative. One ⭐ takes us a long way :)) Windows, iOS and Linux coming soon.
altic.dev/fluid
Resources
Readme
License
GPL-3.0 license
Uh oh!
There was an error while loading. Please reload this page.
Activity
Stars
4.9k stars
Watchers
15 watching
Forks
300 forks
Report repository
Releases 35
FluidVoice v1.6.1
Latest
Jun 28, 2026
+ 34 releases
Sponsor this project
Uh oh!
There was an error while loading. Please reload this page.
ko-fi.com/alticdev
Learn more about GitHub Sponsors
Contributors
Uh oh!
There was an error while loading. Please reload this page.
Languages
Swift 99.7%
Shell 0.3%