Meta: Llama 3.2 11B Vision Instruct

Meta-llama

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and tex

Ultra-low costMultimodal vision

TextVision

Visit official site →Compare →

Specifications

Vendor	Meta-llama
Family	meta-llama
Context	131.1K · 98.3k words（≈ a thesis）
Max output	131.1K · 98.3k words（≈ a thesis）
Released	2026-06-19
Open source	No
Tool use	—

Pricing across channels

per 1M tokens, USD

Channel	Type	Input	Output	Cache read	Batch	→
OpenRouter (auto)	Aggregator	$0.34	$0.34	—	—	visitAd

OpenRouter API (auto-fetched)· 2026-06-19

Related models

Similar tier, vendor or price range

Meta: Llama 3.1 70B Instruct

Intel. —$0.40/$0.40

Meta: Llama 3.3 70B Instruct

Intel. —$0.10/$0.32

Meta: Llama 3.2 3B Instruct

Intel. —$0.051/$0.34

Meta: Llama Guard 4 12B

Intel. —$0.18/$0.18