Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Cloud models · Ollama
Cloud models on Ollama.
  • nemotron-3-ultra

    NVIDIA Nemotron 3 Ultra is built for high-throughput reasoning and long-running agent workflows.

    tools thinking cloud

    5,255  Pulls 1  Tag Updated  5 days ago

  • minimax-m3

    MiniMax M3: Coding & Agentic Frontier. 1M context window. Native Multimodality.

    vision tools thinking cloud

    33.7K  Pulls 1  Tag Updated  1 week ago

  • gemma4

    Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

    vision tools thinking audio cloud e2b e4b 12b 26b 31b

    12.6M  Pulls 47  Tags Updated  3 days ago

  • qwen3.5

    Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

    vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

    13.3M  Pulls 64  Tags Updated  2 weeks ago

  • glm-5.1

    GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.

    tools thinking cloud

    2.2M  Pulls 1  Tag Updated  2 months ago

  • minimax-m2.7

    MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

    tools thinking cloud

    2.2M  Pulls 1  Tag Updated  2 months ago

  • nemotron-3-super

    NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

    tools thinking cloud 120b

    2.4M  Pulls 7  Tags Updated  2 months ago

  • glm-5

    A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

    tools thinking cloud

    2.3M  Pulls 1  Tag Updated  3 months ago

  • minimax-m2.5

    MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

    tools thinking cloud

    2.2M  Pulls 1  Tag Updated  3 months ago

  • qwen3-coder-next

    Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

    tools cloud

    1.5M  Pulls 4  Tags Updated  4 months ago

  • glm-4.7

    Advancing the Coding Capability

    tools thinking cloud

    2.2M  Pulls 1  Tag Updated  5 months ago

  • gemini-3-flash-preview

    Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

    vision tools thinking cloud

    2.2M  Pulls 2  Tags Updated  5 months ago

  • minimax-m2.1

    Exceptional multilingual capabilities to elevate code engineering

    tools cloud

    2.1M  Pulls 1  Tag Updated  5 months ago

  • kimi-k2.6

    Kimi K2.6 is an open-source, native multimodal agentic model that advances practical capabilities in long-horizon coding, coding-driven design, proactive autonomous execution, and swarm-based task orchestration.

    vision tools thinking cloud

    290.8K  Pulls 1  Tag Updated  1 month ago

  • deepseek-v4-pro

    DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a 1M-token context window and three reasoning modes.

    tools thinking cloud

    111.1K  Pulls 1  Tag Updated  1 month ago

  • deepseek-v4-flash

    DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.

    tools thinking cloud

    110.4K  Pulls 1  Tag Updated  1 month ago

  • nemotron-3-nano

    Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

    tools thinking cloud 4b 30b

    531.3K  Pulls 9  Tags Updated  2 months ago

  • kimi-k2.5

    Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

    vision tools thinking cloud

    314.3K  Pulls 1  Tag Updated  4 months ago

  • gpt-oss

    OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

    tools thinking cloud 20b 120b

    10.1M  Pulls 5  Tags Updated  8 months ago

  • qwen3-vl

    The most powerful vision-language model in the Qwen model family to date.

    vision tools thinking cloud 2b 4b 8b 30b 32b 235b

    4M  Pulls 59  Tags Updated  7 months ago

© 2026 Ollama
Blog Contact