AI Models

186 models · 0 new in 60d

Compare →
  • o3

    OpenAI · 200K tokens · $2/M → $8/M

    Best for: Hard math, science, multi-step planning, complex debugging

    How: Use reasoning_effort param: 'low'/'medium'/'high'. No system prompt — use developer message instead.

    Example: Debug a distributed system deadlock by feeding it the full trace + architecture.

    GPQA Diamond 79.7%AIME 2024 96.7%SWE-bench 69.1%
    reasoningmathscienceplanning

    API: api.openai.com — same SDK, just model='o3'

  • o4-mini

    OpenAI · 200K tokens · $1.10/M → $4.40/M

    Best for: Coding with reasoning, moderate-complexity math, budget reasoning

    How: Cheaper reasoning model. Use when o3 is overkill but you need chain-of-thought.

    Example: Generate a migration plan for a database schema change with safety checks.

    AIME 2024 93.4%SWE-bench 68.1%
    reasoningcodingcost-efficient reasoning

    API: api.openai.com — same SDK

  • Grok 3 mini

    xAI · 128K tokens · $0.30/M → $0.50/M

    Best for: Budget reasoning tasks, math, lightweight chain-of-thought

    How: Excellent cost-to-reasoning ratio. Use reasoning_effort param.

    Example: Validate Terraform plans with reasoning about dependency chains for pennies.

    fast reasoningvery cheapmath

    API: api.x.ai — same as Grok 3

  • NVIDIA BioNeMo

    NVIDIA · N/A · api

    Best for: Computational biology tasks

    How: Use NVIDIA BioNeMo recipes for fine-tuning

    Example: Fine-tuning ESM2 protein language models

    Fine-tuning biological foundation modelsPretrained on massive corpora of protein or genomic sequences

    Auto-discovered from news articles.

  • Nemotron 3 Ultra

    NVIDIA · api

    Best for: maintaining context and completing tasks across many turns

    How: deploy on Renesas RZ/V series for production

    Example: use in chatbots evolving into long-running agents

    faster reasoningmore efficientlong-running agents

    Auto-discovered from news articles.

  • NVIDIA Nemotron 3 Ultra

    NVIDIA · api

    Best for: Maintaining context and efficiency across many turns

    How: Integrate with existing chatbot frameworks to enhance long-running agent capabilities

    Example: Use Nemotron 3 Ultra to power a chatbot that can reason and maintain context over multiple interactions

    Faster reasoningMore efficient for long-running agents

    Auto-discovered from news articles.

  • NVIDIA Cosmos 3

    NVIDIA · N/A · api

    Best for: Developing Physical AI systems that need to understand and act within the real world

    How: Integrate NVIDIA Cosmos 3 into your Physical AI system to enable reasoning and action capabilities

    Example: Using NVIDIA Cosmos 3 to develop a robot that can understand and interact with its environment

    Physical AI reasoningAction modelsUnderstanding real world

    Auto-discovered from news articles.

  • Gordon

    Docker · api

    Best for: container workflow management

    How: Integrate Gordon with Docker Desktop

    Example: Gordon proposes fixes and takes action across your entire Docker workflow

    understands environmentproposes fixestakes action across Docker workflow

    Auto-discovered from news articles.

  • Mythos

    Cloudflare · N/A · api

    Best for: analyzing live code across critical parts of infrastructure

    How: Point Mythos at live code to observe its strengths and weaknesses

    Example: Mythos was used to analyze live code across critical parts of Cloudflare's infrastructure

    security-focusedcode analysis

    Auto-discovered from news articles.

  • GPT-Rosalind

    OpenAI · N/A · api

    Best for: life sciences research

    How: N/A

    Example: N/A

    accelerate drug discoverygenomics analysisprotein reasoningscientific research workflows

    Auto-discovered from news articles.