v2.0.2 macOS arm64 Free · Open Source (MIT)

VoxOver

Transcribe video with Whisper, translate subtitles via Ollama, DeepSeek, OpenAI, or Anthropic, and export SRT, ASS, MKV, or hard-burned MP4 — all from a local Mac app. No account required. No subscription.

macOS · Apple Silicon (arm64)

Core Features

🎙

Local Whisper Transcription — No Cloud Upload

Transcribe speech from any video using OpenAI Whisper, running fully on your Mac. Choose from tiny, base, small, medium, or large models. Models are cached locally — no repeat downloads.

🌐

Ollama · DeepSeek · OpenAI · Anthropic Translation

Use a local Ollama model (100% private, no API key), DeepSeek for cloud translation, OpenAI (custom Base URL supported), or Anthropic Claude. Switch anytime in Settings.

📝

Global Translation Context for Better Accuracy

Optionally generate a topic, terminology, and style prompt from your video before translating. Improves consistency for technical, educational, or niche content.

⚡️

Parallel Subtitle Translation — Faster Processing

Translate multiple subtitle segments simultaneously. Adjust worker count to balance speed against API rate limits.

📦

Export SRT, ASS, MKV Soft Subtitles, or Hard-Burned MP4

Export a standalone .srt file, an .ass styled subtitle file, an .mkv with a soft subtitle track (switchable in any player), or a hard-burned .mp4 ready for YouTube or social media upload.

🖥

100% Local Mac App — Your Videos Never Leave Your Machine

Electron shell with a local Gradio UI. No cloud upload unless you use a cloud API backend. Works offline with Ollama for fully private subtitle translation.

How it works

  1. 01

    Install the app

    Download the DMG, drag VoxOver to Applications, and launch. The app starts a local Gradio server — your browser opens http://127.0.0.1:7860 automatically.

  2. 02

    Upload your video

    Drop any MP4 or MOV into the Gradio UI. Audio is extracted at 16 kHz mono.

  3. 03

    Pick a Whisper model

    Select tiny → large depending on your accuracy and speed needs. Model downloads once, then caches locally.

  4. 04

    Choose a translation backend

    Connect to a running Ollama instance or enter your DeepSeek API key.

  5. 05

    Run and export

    Transcription, translation, and muxing all happen in one click. Pick SRT, ASS, MKV, or MP4 output.

Output formats

FormatDescriptionBest for
.srt Plain subtitle file with timestamps Re-use in any editor or player
.ass Advanced SubStation Alpha — styled subtitles with fonts, colors, and positioning Styled captions for video platforms, fan-subs, karaoke
.mkv Original video + soft subtitle track Local playback, subtitle toggle in VLC/IINA
.mp4 Hard-burned subtitles baked into video Upload-ready for YouTube, social media

Requirements

System

  • macOS (Apple Silicon arm64 recommended)

For local translation (Ollama)

  • Ollama running locally
  • Any compatible model pulled, e.g. qwen3:latest
  • No internet required after setup

For cloud translation

  • DeepSeek, OpenAI, or Anthropic API key
  • OpenAI supports custom Base URL for compatible APIs
  • Internet connection required
  • Pay-per-use (no subscription)

FAQ

Does it upload my video to the cloud?
No. Video processing (audio extraction, Whisper transcription) all happens locally. Only the subtitle text is sent to DeepSeek if you choose that backend. Ollama is 100% local.
Which Whisper model should I use?
base is fast and accurate enough for most content. Use medium or large for technical or heavily accented speech. Models are cached after the first download.
Can I translate into any language?
Translation quality depends on your chosen backend model. Ollama models like qwen3 support Chinese, Japanese, Korean, and major European languages. DeepSeek covers 50+ languages.
Is Intel Mac supported?
The current DMG is arm64 only. Intel support may be added in a future release. You can run from source on Intel now.
Where are Whisper models stored?
Models are cached to ~/Library/Application Support/VoxOver/whisper-models. If you have existing models in ~/.cache/whisper, they are copied over automatically on first launch.
Is it free?
Yes. The app is free and open-source (MIT). DeepSeek API usage is billed by DeepSeek directly at their standard rates.

Download VoxOver

v1.3.1 · macOS arm64 · Free & open source (MIT)