ByteDance: UI-TARS 7B
ByteDance (Doubao)UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, we
Ultra-low costMultimodal vision
TextVision
Specifications
| Vendor | ByteDance (Doubao) |
| Family | doubao |
| Context | 128K · 96k words(≈ a thesis) |
| Max output | 128K · 96k words(≈ a thesis)Below average |
| Released | 2026-06-19 |
| Open source | No |
| Tool use | — |
Pricing across channels
per 1M tokens, USD
| Channel | Type | Input | Output | → |
|---|---|---|---|---|
| OpenRouter (auto) | Aggregator | $0.10 | $0.20 | visitAd |
OpenRouter API (auto-fetched)· 2026-06-19
Related models
Similar tier, vendor or price range