allenai

AllenAI: Molmo2 8B

Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.

Input Cost
$0.20
per 1M tokens
Output Cost
$0.20
per 1M tokens
Context Window
36,864
tokens
Compare vs GPT-4o
Developer ID: allenai/molmo-2-8b

Related Models

allenai
$0.05/1M

AllenAI: Olmo 2 32B Instruct

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March ...

📝 128,000 ctx Compare →
allenai
$0.12/1M

AllenAI: Olmo 3 7B Think

Olmo 3 7B Think is a research-oriented language model in the Olmo family designed for adva...

📝 65,536 ctx Compare →
allenai
$0.10/1M

AllenAI: Olmo 3 7B Instruct

Olmo 3 7B Instruct is a supervised instruction-fine-tuned variant of the Olmo 3 7B base mo...

📝 65,536 ctx Compare →