GitHub - phuc-nt/my-translator: Real-time speech translation — macOS & Windows, free TTS, no server, your API keys only

My Translator is a real-time speech translation desktop app built with Tauri. It captures audio directly from your system or microphone, transcribes it, and displays translations in a minimal overlay — with no intermediary server involved.

📖 Installation guides: macOS (EN) · macOS (VI) · Windows (EN) · Windows (VI)

How It Works

System Audio / Mic → 16kHz PCM → Soniox API (STT + Translation) → Overlay UI
                                                                    ↓ (optional)
                                                            TTS (Edge/Google/ElevenLabs) → 🔊

Feature	Detail
Latency	~2–3s
Languages	70+ (source) → any target, one-way & two-way
Cost	~$0.12/hr (Soniox API)
TTS	3 providers (Edge free, Google, ElevenLabs)
Platform	macOS (ARM + Intel) · Windows
Signed	✅ macOS signed & notarized
Auto-Update	✅ Built-in, check & install from Settings

Features

📖 Dual Panel View

Two display modes:

Single (default) — Translation text only, clean and focused
Dual — Source | Translation side-by-side, each panel scrolls independently

Toggle with the panel button (bottom-right on hover).

🔄 Smart Scroll

Auto-scroll only when you're at the bottom. Scroll up to read old content without being yanked back down.

🔤 Quick Font Size

A- / A+ floating controls (bottom-right on hover). Font size adjustable up to 140px — great for presentations.

🔄 Two-Way Translation

Translate conversations between two languages simultaneously — ideal for bilingual meetings.

One-way: Source language → Target language (e.g., Japanese → Vietnamese)
Two-way: Language A ↔ Language B (e.g., Vietnamese ↔ Japanese) — the app detects who is speaking and translates to the other language automatically

Setup for video calls (Zoom, Google Meet, MS Teams):

Audio Source: Both (System + Mic)
Translation Type: Two-way
Set Language A and Language B

Note: TTS narration is automatically disabled in two-way mode to prevent audio feedback loops (TTS output → mic recapture → re-translation).

🎙️ TTS Narration

Read translations aloud in one-way mode — 3 providers:

	Edge TTS ⭐	Google Chirp 3 HD	ElevenLabs
Cost	Free	Free 1M chars/mo	~$5/mo+
Quality	★★★★☆ Neural	★★★★★ Near-human	★★★★★ Premium
Vietnamese	✅ 2 voices	✅ 6 voices	✅ Yes
Setup	None	Google Cloud API key	API key
Speed control	✅	✅ 0.5x–2.0x	❌

TTS is OFF by default — toggle with the TTS button or ⌘ T.

📖 TTS guide: English · Tiếng Việt

📖 Custom Translation Terms

Define how domain-specific words should be translated:

Original sin = Tội nguyên tổ
Christ = Kitô
Pneumonia = Viêm phổi

Add terms in Settings → Translation → Translation terms. Great for religious, medical, or technical content.

🖥️ Local Mode (Apple Silicon only)

Experimental offline mode using MLX + Whisper + Gemma — runs 100% on-device. JA/EN/ZH/KO → VI/EN.

Privacy

Your audio never touches our servers — because there are none.

App connects directly to APIs you configure — no relay, no middleman
You own your API keys — stored locally, never transmitted elsewhere
No account, no telemetry, no analytics — zero tracking
Transcripts saved as .md files locally, per session

Tech Stack

Tauri 2 — Rust backend + WebView frontend
ScreenCaptureKit — macOS system audio
WASAPI — Windows system audio
cpal — Cross-platform microphone
Soniox — Real-time STT + translation
Edge TTS — Free neural TTS (default)
Google Cloud TTS — Chirp 3 HD (near-human quality)
ElevenLabs — Premium TTS

Build from Source

git clone https://github.com/phuc-nt/my-translator.git
cd my-translator
npm install
npm run tauri build

Requires: Rust (stable), Node.js 18+, macOS 13+ or Windows 10+.

Star History

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.github/workflows		.github/workflows
.vscode		.vscode
docs		docs
scripts		scripts
src-tauri		src-tauri
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
banner.png		banner.png
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How It Works

Features

📖 Dual Panel View

🔄 Smart Scroll

🔤 Quick Font Size

🔄 Two-Way Translation

🎙️ TTS Narration

📖 Custom Translation Terms

🖥️ Local Mode (Apple Silicon only)

Privacy

Tech Stack

Build from Source

Star History

License

About

Uh oh!

Releases 12

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

How It Works

Features

📖 Dual Panel View

🔄 Smart Scroll

🔤 Quick Font Size

🔄 Two-Way Translation

🎙️ TTS Narration

📖 Custom Translation Terms

🖥️ Local Mode (Apple Silicon only)

Privacy

Tech Stack

Build from Source

Star History

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 12

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages