Meta: Llama 3.2 11B Vision Instruct

Meta-llama

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and tex

Ultra-low costMultimodal vision
TextVision

Specifications

VendorMeta-llama
Familymeta-llama
Context131.1K · 98.3k words(≈ a thesis)
Max output131.1K · 98.3k words(≈ a thesis)
Released2026-06-19
Open sourceNo
Tool use

Pricing across channels

per 1M tokens, USD

ChannelTypeInputOutput
OpenRouter (auto)Aggregator$0.34$0.34visitAd

Related models

Similar tier, vendor or price range