Add KokoroTTS support for voice agent framework by tango4j · Pull Request #14910 · NVIDIA-NeMo/NeMo

tango4j · 2025-10-09T20:04:47Z

What does this PR do ?

This PR adds kokoro TTS support and configurations to NeMo Voice Agent framework.

Collection: [Note which collection this PR will affect]

voice agent

Changelog

See
nemo/agents/voice_agent/pipecat/services/nemo/tts.py
for core changes

Usage

You can potentially add a usage example below

python3 <NEMO>/examples/voice_agent/server/bot_websocket_server.py

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in NeMo ASR.

Additional Information

Related to # (issue)

Signed-off-by: taejinp <tango4j@gmail.com>

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

Signed-off-by: taejinp <tango4j@gmail.com>

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

nemo/agents/voice_agent/pipecat/services/nemo/tts.py

Signed-off-by: taejinp <tango4j@gmail.com>

…NeMo/NeMo into tango4j/add_kokoro_to_va

stevehuang52 · 2025-10-10T14:48:48Z

nemo/agents/voice_agent/pipecat/services/nemo/tts.py

+
+    Args:
+        lang_code: Language code for the model (default: 'a' for American English)
+        voice: Voice to use (default: 'af_heart')


can we list more voice options in the comments?

af_heart, af_bella, am_fenrir am_michael

are recommend

Check out
https://huggingface.co/hexgrad/Kokoro-82M/blob/main/VOICES.md

stevehuang52 · 2025-10-10T14:52:59Z

nemo/agents/voice_agent/pipecat/services/nemo/tts.py

+
+    def __init__(
+        self,
+        lang_code: str = "a",


It seems that the language code from Kokoro is every different from the ones used in other components (ASR/LLM/TTS), where "en-us" would be used for "American english". Maybe it's better to create a mapping from the ISO standard to the ones used by kokoro?

Just in case for the future, here is the mapping table:

emoji,letter,language,ietf_code 🇺🇸,a,American English,en-US 🇬🇧,b,British English,en-GB 🇪🇸,e,Spanish,es-ES 🇫🇷,f,French (France),fr-FR 🇮🇳,h,Hindi,hi-IN 🇮🇹,i,Italian,it-IT 🇯🇵,j,Japanese,ja-JP 🇧🇷,p,Brazilian Portuguese,pt-BR 🇨🇳,z,Mandarin Chinese,zh-CN

Signed-off-by: stevehuang52 <heh@nvidia.com>

github-actions · 2025-10-10T23:14:14Z

[🤖]: Hi @tango4j 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

stevehuang52

LGTM, thanks~!

tango4j and others added 3 commits October 9, 2025 12:17

Adding Kokoro TTS to TTS options

c992814

Signed-off-by: taejinp <tango4j@gmail.com>

Adding environment and req

524be6f

Signed-off-by: taejinp <tango4j@gmail.com>

Apply isort and black reformatting

6479a43

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

tango4j requested a review from stevehuang52 October 9, 2025 20:06

tango4j added 2 commits October 9, 2025 13:11

Removed unused import

9ecf3d2

Signed-off-by: taejinp <tango4j@gmail.com>

Removed unused import

9ae3670

Signed-off-by: taejinp <tango4j@gmail.com>

tango4j added Run CICD and removed Run CICD labels Oct 9, 2025

Apply isort and black reformatting

aac2b4a

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

chtruong814 added Run CICD and removed Run CICD labels Oct 9, 2025

github-actions bot removed the Run CICD label Oct 9, 2025

github-advanced-security bot found potential problems Oct 9, 2025

View reviewed changes

nemo/agents/voice_agent/pipecat/services/nemo/tts.py Fixed Show fixed Hide fixed

tango4j added 2 commits October 9, 2025 15:06

Adding eSpeakNG and nvidia yaml

61f4537

Signed-off-by: taejinp <tango4j@gmail.com>

Merge branch 'tango4j/add_kokoro_to_va' of https://github.com/NVIDIA-…

d9c5c6a

…NeMo/NeMo into tango4j/add_kokoro_to_va

stevehuang52 reviewed Oct 10, 2025

View reviewed changes

stevehuang52 added 6 commits October 10, 2025 11:05

update default prompt to exclude emoji

979a5a2

Signed-off-by: stevehuang52 <heh@nvidia.com>

fix logger

504d854

Signed-off-by: stevehuang52 <heh@nvidia.com>

fix pylint

cc4aadd

Signed-off-by: stevehuang52 <heh@nvidia.com>

add vllm logging

524e476

Signed-off-by: stevehuang52 <heh@nvidia.com>

fix linting

ee15d19

Signed-off-by: stevehuang52 <heh@nvidia.com>

fix linting

2c0c768

Signed-off-by: stevehuang52 <heh@nvidia.com>

tango4j self-assigned this Oct 10, 2025

tango4j added Run CICD and removed Run CICD labels Oct 10, 2025

github-actions bot removed the Run CICD label Oct 10, 2025

stevehuang52 approved these changes Oct 10, 2025

View reviewed changes

stevehuang52 merged commit bd808fd into main Oct 10, 2025
64 of 70 checks passed

stevehuang52 deleted the tango4j/add_kokoro_to_va branch October 10, 2025 23:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add KokoroTTS support for voice agent framework#14910

Add KokoroTTS support for voice agent framework#14910
stevehuang52 merged 14 commits intomainfrom
tango4j/add_kokoro_to_va

tango4j commented Oct 9, 2025

Uh oh!

Uh oh!

stevehuang52 Oct 10, 2025

Uh oh!

tango4j Oct 10, 2025

Uh oh!

stevehuang52 Oct 10, 2025

Uh oh!

tango4j Oct 10, 2025

Uh oh!

github-actions bot commented Oct 10, 2025

Uh oh!

stevehuang52 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

tango4j commented Oct 9, 2025

What does this PR do ?

Changelog

Usage

GitHub Actions CI

Before your PR is "Ready for review"

Who can review?

Additional Information

Uh oh!

Uh oh!

stevehuang52 Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

tango4j Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

stevehuang52 Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

tango4j Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 10, 2025

Uh oh!

stevehuang52 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants