AI Models
186 models · 0 new in 60d
- ▾o3
OpenAI · 200K tokens · $2/M → $8/M
Best for: Hard math, science, multi-step planning, complex debugging
How: Use reasoning_effort param: 'low'/'medium'/'high'. No system prompt — use developer message instead.
Example: Debug a distributed system deadlock by feeding it the full trace + architecture.
GPQA Diamond 79.7%AIME 2024 96.7%SWE-bench 69.1%reasoningmathscienceplanningAPI: api.openai.com — same SDK, just model='o3'
- ▾o4-mini
OpenAI · 200K tokens · $1.10/M → $4.40/M
Best for: Coding with reasoning, moderate-complexity math, budget reasoning
How: Cheaper reasoning model. Use when o3 is overkill but you need chain-of-thought.
Example: Generate a migration plan for a database schema change with safety checks.
AIME 2024 93.4%SWE-bench 68.1%reasoningcodingcost-efficient reasoningAPI: api.openai.com — same SDK
- ▾Grok 3 mini
xAI · 128K tokens · $0.30/M → $0.50/M
Best for: Budget reasoning tasks, math, lightweight chain-of-thought
How: Excellent cost-to-reasoning ratio. Use reasoning_effort param.
Example: Validate Terraform plans with reasoning about dependency chains for pennies.
fast reasoningvery cheapmathAPI: api.x.ai — same as Grok 3
- ▾NVIDIA BioNeMo
NVIDIA · N/A · api
Best for: Computational biology tasks
How: Use NVIDIA BioNeMo recipes for fine-tuning
Example: Fine-tuning ESM2 protein language models
Fine-tuning biological foundation modelsPretrained on massive corpora of protein or genomic sequencesAuto-discovered from news articles.
- ▾Nemotron 3 Ultra
NVIDIA · api
Best for: maintaining context and completing tasks across many turns
How: deploy on Renesas RZ/V series for production
Example: use in chatbots evolving into long-running agents
faster reasoningmore efficientlong-running agentsAuto-discovered from news articles.
- ▾NVIDIA Nemotron 3 Ultra
NVIDIA · api
Best for: Maintaining context and efficiency across many turns
How: Integrate with existing chatbot frameworks to enhance long-running agent capabilities
Example: Use Nemotron 3 Ultra to power a chatbot that can reason and maintain context over multiple interactions
Faster reasoningMore efficient for long-running agentsAuto-discovered from news articles.
- ▾NVIDIA Cosmos 3
NVIDIA · N/A · api
Best for: Developing Physical AI systems that need to understand and act within the real world
How: Integrate NVIDIA Cosmos 3 into your Physical AI system to enable reasoning and action capabilities
Example: Using NVIDIA Cosmos 3 to develop a robot that can understand and interact with its environment
Physical AI reasoningAction modelsUnderstanding real worldAuto-discovered from news articles.
- ▾Gordon
Docker · api
Best for: container workflow management
How: Integrate Gordon with Docker Desktop
Example: Gordon proposes fixes and takes action across your entire Docker workflow
understands environmentproposes fixestakes action across Docker workflowAuto-discovered from news articles.
- ▾Mythos
Cloudflare · N/A · api
Best for: analyzing live code across critical parts of infrastructure
How: Point Mythos at live code to observe its strengths and weaknesses
Example: Mythos was used to analyze live code across critical parts of Cloudflare's infrastructure
security-focusedcode analysisAuto-discovered from news articles.
- ▾GPT-Rosalind
OpenAI · N/A · api
Best for: life sciences research
How: N/A
Example: N/A
accelerate drug discoverygenomics analysisprotein reasoningscientific research workflowsAuto-discovered from news articles.