Nano V2 VLM Blog #111

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

chrisalexiuk-nvidia wants to merge 8 commits into vllm-project:main from chrisalexiuk-nvidia:nano-2-vlm-blog

+111 −0

Contributor

chrisalexiuk-nvidia commented Oct 28, 2025

Blog for Nano V2 VLM

vercel bot deployed to Preview

October 28, 2025 20:46

View deployment


          Nano V2 VLM Blog

392bbeb

Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com>

chrisalexiuk-nvidia force-pushed the nano-2-vlm-blog branch from feca096 to 392bbeb Compare

October 28, 2025 20:48

vercel bot deployed to Preview

October 28, 2025 20:48

View deployment


          Merge branch 'main' into nano-2-vlm-blog

02ad989

vercel bot deployed to Preview

October 28, 2025 20:48

View deployment

chrisalexiuk-nvidia added 2 commits

October 28, 2025 18:31


          Nano V2 VLM Blog

8d24821

Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com>


          Merge branch 'nano-2-vlm-blog' of github.com:chrisalexiuk-nvidia/vllm…

93ea815

…-project.github.io into nano-2-vlm-blog

vercel bot deployed to Preview

October 28, 2025 22:33

View deployment

hmellor requested changes

View reviewed changes

_posts/2025-10-28-run-multimodal-reasoning-agents-nvidia-nemotron.md Outdated Show resolved Hide resolved

_posts/2025-10-28-run-multimodal-reasoning-agents-nvidia-nemotron.md Outdated Show resolved Hide resolved

ywang96 reviewed

View reviewed changes

_posts/2025-10-28-run-multimodal-reasoning-agents-nvidia-nemotron.md Outdated Show resolved Hide resolved


          update

e0ef491

Signed-off-by: Roger Wang <hey@rogerw.io>

vercel bot deployed to Preview

October 30, 2025 02:06

View deployment


          update

e7626f9

Signed-off-by: Roger Wang <hey@rogerw.io>

vercel bot deployed to Preview

October 30, 2025 02:16

View deployment


          add activation

194b929

Signed-off-by: Roger Wang <hey@rogerw.io>

vercel bot deployed to Preview

October 30, 2025 02:41

View deployment


          clean up

Signed-off-by: Roger Wang <hey@rogerw.io>

vercel bot deployed to Preview

October 30, 2025 02:44

View deployment

hmellor reviewed

View reviewed changes

assets/figures/2025-multimodal-nvidia-nemotron/figure2.png

Member

hmellor Oct 30, 2025

nit:

... 4x Fewer Tokens

hmellor reviewed

View reviewed changes

_posts/2025-10-29-run-multimodal-reasoning-agents-nvidia-nemotron.md

+              * Architecture:
+                * [CRADIOH-V2](https://huggingface.co/nvidia/C-RADIOv2-H) based Vision Encoder
+                * Efficient video sampling as token compression module
+                * Hybrid Transformer-Mamba Architecture- [Nemotron Nano 2 LLM](https://huggingface.co/nvidia/NVIDIA-Nemotron-Nano-9B-v2) backbone with reasoning.

Member

hmellor Oct 30, 2025

Suggested change

      
              * Hybrid Transformer-Mamba Architecture- [Nemotron Nano 2 LLM](https://huggingface.co/nvidia/NVIDIA-Nemotron-Nano-9B-v2) backbone with reasoning.   
          
              * Hybrid Transformer-Mamba Architecture - [Nemotron Nano 2 LLM](https://huggingface.co/nvidia/NVIDIA-Nemotron-Nano-9B-v2) backbone with reasoning.

hmellor reviewed

View reviewed changes

_posts/2025-10-29-run-multimodal-reasoning-agents-nvidia-nemotron.md

+                * [CRADIOH-V2](https://huggingface.co/nvidia/C-RADIOv2-H) based Vision Encoder
+                * Efficient video sampling as token compression module
+                * Hybrid Transformer-Mamba Architecture- [Nemotron Nano 2 LLM](https://huggingface.co/nvidia/NVIDIA-Nemotron-Nano-9B-v2) backbone with reasoning.
+              * Accuracy

Member

hmellor Oct 30, 2025

Suggested change

      
            * Accuracy  
          
            * Accuracy:

hmellor reviewed

View reviewed changes

_posts/2025-10-29-run-multimodal-reasoning-agents-nvidia-nemotron.md

+              * Get started:
+                * Download model weights from Hugging Face \- [BF16](https://huggingface.co/nvidia/Nemotron-Nano-12B-v2-VL-BF16), [FP8](https://huggingface.co/nvidia/Nemotron-Nano-12B-v2-VL-FP8), [FP4-QAD](https://huggingface.co/nvidia/Nemotron-Nano-12B-v2-VL-FP4-QAD)
+                * Run with vLLM for inference
+                * [Technical report](https://www.overleaf.com/project/68d1d48c83696e11ba669f70) to build custom, optimized models with Nemotron techniques..

Member

hmellor Oct 30, 2025

This links to overleaf, not a published report

hmellor reviewed

View reviewed changes

_posts/2025-10-29-run-multimodal-reasoning-agents-nvidia-nemotron.md

+              )
+              print(resp.choices[0].message.content)
+              ```
+              For more examples, check out our vLLM cookbook: [Nemotron-Nano2-VL/vllm_cookbook.ipynb](https://github.com/NVIDIA-NeMo/Nemotron/blob/main/usage-cookbook/Nemotron-Nano2-VL/vllm_cookbook.ipynb)

Member

hmellor Oct 30, 2025

Suggested change

      
            For more examples, check out our vLLM cookbook: [Nemotron-Nano2-VL/vllm_cookbook.ipynb](https://github.com/NVIDIA-NeMo/Nemotron/blob/main/usage-cookbook/Nemotron-Nano2-VL/vllm_cookbook.ipynb)
          
            For more examples, check out our vLLM cookbook: [Nemotron-Nano2-VL/vllm_cookbook.ipynb](https://github.com/NVIDIA-NeMo/Nemotron/blob/main/usage-cookbook/Nemotron-Nano2-VL/vllm_cookbook.ipynb).

hmellor reviewed

View reviewed changes

_posts/2025-10-29-run-multimodal-reasoning-agents-nvidia-nemotron.md

Comment on lines +109 to +111

		[Share your ideas](http://nemotron.ideas.nvidia.com/?ncid=so-othe-692335) and vote on what matters to help shape the future of Nemotron.

		Stay up to date on [NVIDIA Nemotron](https://developer.nvidia.com/nemotron) by subscribing to NVIDIA news and following NVIDIA AI on [LinkedIn](https://www.linkedin.com/showcase/nvidia-ai/posts/?feedView=all), [X](https://x.com/NVIDIAAIDev), [YouTube](https://www.youtube.com/@NVIDIADeveloper), and the [Nemotron channel](https://discord.com/channels/1019361803752456192/1407781691698708682) on [Discord](https://discord.com/invite/nvidiadeveloper).

Member

hmellor Oct 30, 2025

Itallic text seems unusual here

Suggested change

      
            [*Share your ideas*](http://nemotron.ideas.nvidia.com/?ncid=so-othe-692335) *and vote on what matters to help shape the future of Nemotron.* 
          
            *Stay up to date on [NVIDIA Nemotron](https://developer.nvidia.com/nemotron) by subscribing to NVIDIA news and following NVIDIA AI on [LinkedIn](https://www.linkedin.com/showcase/nvidia-ai/posts/?feedView=all), [X](https://x.com/NVIDIAAIDev), [YouTube](https://www.youtube.com/@NVIDIADeveloper)*, *and the [Nemotron channel](https://discord.com/channels/1019361803752456192/1407781691698708682) on [Discord](https://discord.com/invite/nvidiadeveloper).*
          
            [Share your ideas](http://nemotron.ideas.nvidia.com/?ncid=so-othe-692335) and vote on what matters to help shape the future of Nemotron.
          
            Stay up to date on [NVIDIA Nemotron](https://developer.nvidia.com/nemotron) by subscribing to NVIDIA news and following NVIDIA AI on [LinkedIn](https://www.linkedin.com/showcase/nvidia-ai/posts/?feedView=all), [X](https://x.com/NVIDIAAIDev), [YouTube](https://www.youtube.com/@NVIDIADeveloper), and the [Nemotron channel](https://discord.com/channels/1019361803752456192/1407781691698708682) on [Discord](https://discord.com/invite/nvidiadeveloper).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet