Skip to content

Add nemotron-nano-v2 support to voice agent#14704

Merged
stevehuang52 merged 50 commits intomainfrom
heh/voice_agent_vllm
Oct 3, 2025
Merged

Add nemotron-nano-v2 support to voice agent#14704
stevehuang52 merged 50 commits intomainfrom
heh/voice_agent_vllm

Conversation

@stevehuang52
Copy link
Copy Markdown
Collaborator

@stevehuang52 stevehuang52 commented Sep 10, 2025

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do ?

Add support for using NVIDIA-Nemotron-Nano-9B-v2 as the LLM in voice agent

Note that you need at least 21GB of GPU memory to use this setting

To launch the voice agent server with this config:

NEMO_PATH=???  # Use your local NeMo path for the latest version
export PYTHONPATH=$NEMO_PATH:$PYTHONPATH
python ./server/server.py

Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
stevehuang52 and others added 2 commits September 9, 2025 21:04
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Copy link
Copy Markdown
Collaborator

@tango4j tango4j left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested the vLLM based NemoTron nano v2.
Consider updating the points I mentioned.

tango4j and others added 7 commits September 10, 2025 13:01
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
tango4j and others added 4 commits September 16, 2025 16:40
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
@github-actions github-actions bot removed the Run CICD label Sep 30, 2025
@github-actions
Copy link
Copy Markdown
Contributor

[🤖]: Hi @stevehuang52 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

tango4j and others added 4 commits October 2, 2025 03:30
Signed-off-by: taejinp <tango4j@gmail.com>
Signed-off-by: taejinp <tango4j@gmail.com>
Signed-off-by: taejinp <tango4j@gmail.com>
Signed-off-by: tango4j <tango4j@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
@stevehuang52
Copy link
Copy Markdown
Collaborator Author

Tested with new environment and llama3.1 config

stevehuang52 and others added 7 commits October 3, 2025 14:40
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: taejinp <tango4j@gmail.com>
@tango4j tango4j added Run CICD and removed Run CICD labels Oct 3, 2025
@tango4j
Copy link
Copy Markdown
Collaborator

tango4j commented Oct 3, 2025

unit test files were tested and also tested voice agent server check.
LGTM.

@github-actions github-actions bot removed the Run CICD label Oct 3, 2025
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Oct 3, 2025

[🤖]: Hi @stevehuang52 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

@stevehuang52 stevehuang52 merged commit 56ddc45 into main Oct 3, 2025
64 of 70 checks passed
@stevehuang52 stevehuang52 deleted the heh/voice_agent_vllm branch October 3, 2025 23:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants