Google: Gemma 3 12B

Google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 12

Ultra-low costMultimodal vision
TextVision

Specifications

VendorGoogle
Familygoogle
Context131.1K · 98.3k words(≈ a thesis)
Max output131.1K · 98.3k words(≈ a thesis)
Released2026-06-19
Open sourceNo
Tool use

Pricing across channels

per 1M tokens, USD

ChannelTypeInputOutput
OpenRouter (auto)Aggregator$0.050$0.15visitAd

Related models

Similar tier, vendor or price range