Add nemotron-nano-v2 support to voice agent by stevehuang52 · Pull Request #14704 · NVIDIA-NeMo/NeMo

stevehuang52 · 2025-09-10T00:58:02Z

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do ?

Add support for using NVIDIA-Nemotron-Nano-9B-v2 as the LLM in voice agent

Note that you need at least 21GB of GPU memory to use this setting

To launch the voice agent server with this config:

NEMO_PATH=???  # Use your local NeMo path for the latest version
export PYTHONPATH=$NEMO_PATH:$PYTHONPATH
python ./server/server.py

Signed-off-by: stevehuang52 <heh@nvidia.com>

nemo/agents/voice_agent/pipecat/services/nemo/llm.py

Signed-off-by: stevehuang52 <heh@nvidia.com>

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>

tango4j

Tested the vLLM based NemoTron nano v2.
Consider updating the points I mentioned.

nemo/agents/voice_agent/pipecat/services/nemo/llm.py

examples/voice_agent/README.md

Signed-off-by: stevehuang52 <heh@nvidia.com>

…into heh/voice_agent_vllm

Signed-off-by: stevehuang52 <heh@nvidia.com>

…into heh/voice_agent_vllm

Signed-off-by: stevehuang52 <heh@nvidia.com>

github-actions · 2025-09-30T21:46:58Z

[🤖]: Hi @stevehuang52 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

Signed-off-by: taejinp <tango4j@gmail.com>

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

nemo/agents/voice_agent/utils/config_manager.py

Signed-off-by: stevehuang52 <heh@nvidia.com>

stevehuang52 · 2025-10-03T18:05:10Z

Tested with new environment and llama3.1 config

Signed-off-by: stevehuang52 <heh@nvidia.com>

Signed-off-by: taejinp <tango4j@gmail.com>

…/NeMo into heh/voice_agent_vllm

tango4j · 2025-10-03T22:27:33Z

unit test files were tested and also tested voice agent server check.
LGTM.

github-actions · 2025-10-03T22:29:04Z

[🤖]: Hi @stevehuang52 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

stevehuang52 added 4 commits September 9, 2025 20:35

add vllm support

ad1775e

Signed-off-by: stevehuang52 <heh@nvidia.com>

refactor

616326b

Signed-off-by: stevehuang52 <heh@nvidia.com>

update cfg

26dfc43

Signed-off-by: stevehuang52 <heh@nvidia.com>

update cfg

f2e676f

Signed-off-by: stevehuang52 <heh@nvidia.com>

stevehuang52 requested review from tango4j and weiqingw4ng September 10, 2025 00:58

stevehuang52 self-assigned this Sep 10, 2025

stevehuang52 added skip-docs skip-linting labels Sep 10, 2025

github-advanced-security bot found potential problems Sep 10, 2025

View reviewed changes

nemo/agents/voice_agent/pipecat/services/nemo/llm.py Fixed Show fixed Hide fixed

stevehuang52 and others added 2 commits September 9, 2025 21:04

update for nano-v2

4d32ea6

Signed-off-by: stevehuang52 <heh@nvidia.com>

Potential fix for code scanning alert no. 16177: Unused import

9aa1f74

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>

tango4j reviewed Sep 10, 2025

View reviewed changes

nemo/agents/voice_agent/pipecat/services/nemo/llm.py Outdated Show resolved Hide resolved

examples/voice_agent/README.md Outdated Show resolved Hide resolved

tango4j and others added 7 commits September 10, 2025 13:01

Merge branch 'main' into heh/voice_agent_vllm

260c554

update and refactor

17b4192

Signed-off-by: stevehuang52 <heh@nvidia.com>

update readme

8fe143a

Signed-off-by: stevehuang52 <heh@nvidia.com>

Merge branch 'heh/voice_agent_vllm' of https://github.com/NVIDIA/NeMo …

89b9c37

…into heh/voice_agent_vllm

update to pipecat=0.0.84

034301f

Signed-off-by: stevehuang52 <heh@nvidia.com>

add auto start/stop vllm server

dc31217

Signed-off-by: stevehuang52 <heh@nvidia.com>

update readme

887182c

Signed-off-by: stevehuang52 <heh@nvidia.com>

stevehuang52 requested a review from tango4j September 11, 2025 19:29

stevehuang52 added 4 commits September 11, 2025 17:50

auto switch between vllm and hf

8ac8be3

Signed-off-by: stevehuang52 <heh@nvidia.com>

refactor

56d286d

Signed-off-by: stevehuang52 <heh@nvidia.com>

update default cfg

74412b9

Signed-off-by: stevehuang52 <heh@nvidia.com>

add qwen3 example, refactor

dca36f0

Signed-off-by: stevehuang52 <heh@nvidia.com>

stevehuang52 requested a review from KunalDhawan September 16, 2025 17:48

tango4j and others added 4 commits September 16, 2025 16:40

Merge branch 'main' into heh/voice_agent_vllm

8ac92c0

update readme according to feedback

bb2c1c3

Signed-off-by: stevehuang52 <heh@nvidia.com>

Merge branch 'heh/voice_agent_vllm' of https://github.com/NVIDIA/NeMo …

f4c00de

…into heh/voice_agent_vllm

pin package version

0d97406

Signed-off-by: stevehuang52 <heh@nvidia.com>

github-actions bot removed the Run CICD label Sep 30, 2025

tango4j and others added 4 commits October 2, 2025 03:30

Removed backup file

f910273

Signed-off-by: taejinp <tango4j@gmail.com>

Adding config manager and llm-specific yamls and fixed the bugs

358d90a

Signed-off-by: taejinp <tango4j@gmail.com>

Adding NeMoTron Nano-9B-v2 as a default

3130538

Signed-off-by: taejinp <tango4j@gmail.com>

Apply isort and black reformatting

152e633

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

stevehuang52 commented Oct 3, 2025

View reviewed changes

nemo/agents/voice_agent/utils/config_manager.py Show resolved Hide resolved

stevehuang52 commented Oct 3, 2025

View reviewed changes

nemo/agents/voice_agent/utils/config_manager.py Outdated Show resolved Hide resolved

stevehuang52 added 4 commits October 3, 2025 14:00

fix environment

6e5e242

Signed-off-by: stevehuang52 <heh@nvidia.com>

fix hf param resolve

9a3f113

Signed-off-by: stevehuang52 <heh@nvidia.com>

update readme

596084d

Signed-off-by: stevehuang52 <heh@nvidia.com>

update config manager, add llama3.1 example, refactor config style

4212612

Signed-off-by: stevehuang52 <heh@nvidia.com>

stevehuang52 and others added 7 commits October 3, 2025 14:40

update default yaml

c77d7b7

Signed-off-by: stevehuang52 <heh@nvidia.com>

update readme

e40f3f1

Signed-off-by: stevehuang52 <heh@nvidia.com>

update readme

695f55e

Signed-off-by: stevehuang52 <heh@nvidia.com>

pin nemo to 2.5

ff24f65

Signed-off-by: stevehuang52 <heh@nvidia.com>

fix env and cfg

891a027

Signed-off-by: stevehuang52 <heh@nvidia.com>

Removing Qwen from generic hf config

ea8bcae

Signed-off-by: taejinp <tango4j@gmail.com>

Merge branch 'heh/voice_agent_vllm' of https://github.com/NVIDIA-NeMo…

d8eccbb

…/NeMo into heh/voice_agent_vllm

tango4j added Run CICD and removed Run CICD labels Oct 3, 2025

tango4j approved these changes Oct 3, 2025

View reviewed changes

github-actions bot removed the Run CICD label Oct 3, 2025

stevehuang52 merged commit 56ddc45 into main Oct 3, 2025
64 of 70 checks passed

stevehuang52 deleted the heh/voice_agent_vllm branch October 3, 2025 23:13

coderabbitai bot mentioned this pull request Mar 23, 2026

Add exclusion check in bridge pub handler nanomq/nanomq#2255

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add nemotron-nano-v2 support to voice agent#14704

Add nemotron-nano-v2 support to voice agent#14704
stevehuang52 merged 50 commits intomainfrom
heh/voice_agent_vllm

stevehuang52 commented Sep 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

tango4j left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!

stevehuang52 commented Oct 3, 2025

Uh oh!

tango4j commented Oct 3, 2025

Uh oh!

github-actions bot commented Oct 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

stevehuang52 commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Uh oh!

Uh oh!

tango4j left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!

stevehuang52 commented Oct 3, 2025

Uh oh!

tango4j commented Oct 3, 2025

Uh oh!

github-actions bot commented Oct 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stevehuang52 commented Sep 10, 2025 •

edited

Loading