Nemotron Nano VL 12B · NVIDIA
NVIDIA Nemotron Nano 2 VL is a 12B multimodal model with a hybrid Transformer-Mamba architecture, designed for image, document and video understanding. It's the first vision-capable model available on the free plan — upload a photo of a problem, a receipt or a screenshot and ask away. 128K context.
About Nemotron Nano VL 12B
NVIDIA Nemotron Nano 2 VL is a 12B multimodal model with a hybrid Transformer-Mamba architecture, designed for image, document and video understanding. It's the first vision-capable model available on the free plan — upload a photo of a problem, a receipt or a screenshot and ask away. 128K context.
Where it shines: Free vision (images and video) · Document reading · 128K context.
How to use Nemotron Nano VL 12B
-
1
Type or upload
Type what you want in the box above — or upload the file if the tool asks for one.
-
2
Generate
Click the main button. Wait 2-30 seconds depending on the model and input size.
-
3
Download or share
Download the result or share the direct link. No watermark, ready to use.
Frequently asked questions
How much does it cost to use Nemotron Nano VL 12B?
Nemotron Nano VL 12B is one of the free models in the catalog. Each use discounts 10 tokens from your pool, but open models like Nemotron Nano VL 12B don’t cost us, so the rate-limit is generous. A free account comes with 500 initial tokens and 25 more every day — you usually don’t get to touch the card.
Is there a limit on the use of Nemotron Nano VL 12B?
There is no fixed monthly fee for Nemotron Nano VL 12B on the free account — the actual limit is the rate per minute/hour, not per month. Anonymous are limited by IP; with account you can do much more volume.If you reach 500+25 tokens and need more, a Pro plan at $9/month covers it.
What makes Nemotron Nano VL 12B special?
NVIDIA fine-tunes its models for fast inference on its own optimized hardware — good at technical questions and reasoning. Its specific strengths are: free viewing (images and video), document reading, and 128k context. It accepts images as input (multimodal) as well as text — useful for describing captures, reading graphs, or solving problems from a photo.
How fast does Nemotron Nano VL 12B respond?
Nemotron Nano VL 12B has a balanced speed: 5-15 seconds per response — neither the fastest nor the slowest in the catalog.The actual time also depends on the length of the prompt and the load of the datacenter — models with huge context take longer when you enter very long texts.
How do I use Nemotron Nano VL 12B in ia.gratis?
You can use Nemotron Nano VL 12B from /chat/ by selecting Nemotron Nano VL 12B in the picker, or via the REST API with `model=nemotron-vl` in the body of the POST. Quick summary: free view: read images, documents and video.The internal model identifier is `nemotron-vl` — useful when integrating by API.