AI Model Release Dashboard

Tracking releases from leading AI providers

Provider↕	Model↕	Release Date↓	Notes
Mistral	Mistral Small 4	Mar 17, 2026	Unified model combining text, vision, reasoning, and coding. MoE 119B total / 6B active params. 256K context. Apache 2.0.
OpenAI	GPT-5.4 mini	Mar 17, 2026	Most capable small OpenAI model. 2x faster than GPT-5.4. Strong on coding, reasoning, and computer use. Free and Go users.
OpenAI	GPT-5.4 nano	Mar 17, 2026	Smallest and cheapest OpenAI model. Optimized for classification, data extraction, ranking, and coding subagents.
OpenAI	GPT-5.4	Mar 5, 2026	Most capable frontier model for professional work. Native computer use, 1M context, consolidated coding+reasoning. 33% fewer hallucinations vs GPT-5.2.
Google	Gemini 3.1 Flash Lite	Mar 3, 2026	Ultra-efficient Gemini 3.1 model for developers. Fast and cost-effective for high-volume API workloads.
OpenAI	GPT-5.3 Instant	Mar 3, 2026	Smoother, more helpful everyday conversations. 400K context. 26.8% fewer hallucinations than GPT-5.2 with web search. Reduces unnecessary refusals and moralizing preambles.
Google	Gemini 3.1 Flash Image	Feb 26, 2026	Faster image generation with 4K resolution, improved text rendering, and multi-language support. #1 on Artificial Analysis Image Arena at launch. Default in Gemini app, Search, and Lens across 141 countries.
Google	Gemini 3.1 Pro	Feb 19, 2026	Advanced reasoning over complex problems. 77.1% on ARC-AGI-2. Available via Gemini API, Vertex AI, and NotebookLM.
Anthropic	Claude Sonnet 4.6	Feb 17, 2026	Default model for free and Pro users. 1M token context (beta). Improved computer use, coding, and large-scale data processing.
Anthropic	Claude Opus 4.6	Feb 5, 2026	1M token context window. 14.5-hour task completion horizon. Top Finance Agent benchmark. Stronger coding and planning.
OpenAI	GPT-5.3-Codex	Feb 5, 2026	Most capable agentic coding model to date. 25% faster than GPT-5.2-Codex. First OpenAI model instrumental in creating itself. State-of-the-art on SWE-Bench Pro and Terminal-Bench.
OpenAI	GPT-5.2-Codex	Dec 18, 2025	Agentic coding model optimized for long-horizon task completion. Context compaction for multi-window workflows. 56.4% on SWE-Bench Pro, 64.0% on Terminal-Bench 2.0.
Google	Gemini 3 Flash	Dec 17, 2025	Efficient Gemini 3 model. Includes Gemini 3 Pro reasoning in a faster, cheaper package. Default model in the Gemini app.
OpenAI	GPT-5.2	Dec 11, 2025	Next upgrade to the GPT-5 series. Smarter and more useful for work and learning. 400K context, 128K output. Available in Instant, Thinking, and Pro variants.
Google	Gemini 3 Deep Think	Dec 4, 2025	Extended-thinking version of Gemini 3 Pro. Rolled out to Ultra subscribers. Designed for the hardest reasoning tasks requiring longer deliberation.
Mistral	Mistral Large 3	Dec 2, 2025	MoE architecture: 41B active / 675B total parameters. Multimodal, 256K context. Apache 2.0. #2 on LMArena OSS leaderboard.
Mistral	Ministral 3 (3B/8B/14B)	Dec 2, 2025	Family of dense small models in 3 sizes × 3 variants (Base/Instruct/Reasoning). Runs on 4GB VRAM. Apache 2.0 license.
Anthropic	Claude Opus 4.5	Nov 24, 2025	Most robustly aligned Claude model. Reclaimed coding crown with 80.9% SWE-bench Verified. Significantly reduced pricing.
Google	Gemini 3 Pro	Nov 18, 2025	Best multimodal understanding model. Most powerful agentic and coding model from Google. Replaces Gemini 2.5 Pro.
OpenAI	GPT-5.1	Nov 12, 2025	Warmer personality, improved instruction-following. Adds shopping research and 8 personality options. Available in Instant, Thinking, and Pro variants. Retired March 2026.
Anthropic	Claude Haiku 4.5	Oct 15, 2025	Fast and affordable Claude 4 family model. Targets latency-sensitive applications. Available on Claude.ai and mobile.
Anthropic	Claude Sonnet 4.5	Sep 29, 2025	Best coding model at launch with 77.2% on SWE-bench Verified. 30+ hour autonomous task horizon. Launched alongside Claude Agent SDK and Claude Code 2.0.
Google	Gemini 2.5 Flash Image	Aug 26, 2025	Dedicated image generation model from Google. Better instruction following and text rendering than previous image models.
OpenAI	GPT-5	Aug 7, 2025	Unifies reasoning and standard model capabilities. Automatically switches between fast and deep thinking modes based on task needs.
Anthropic	Claude Opus 4.1	Aug 5, 2025	Drop-in replacement for Opus 4 with superior performance. Handles complex multi-step agentic tasks with more rigor. Improved precision for real-world coding workflows.
Anthropic	Claude Sonnet 4	Jul 9, 2025	Next generation Claude model. Enhanced coding, reasoning, and agentic task performance.
Anthropic	Claude Opus 4	Jul 9, 2025	Most intelligent Claude model. Designed for complex, long-horizon tasks and deep reasoning.
Mistral	Mistral Small 3.2	Jun 20, 2025	24B parameter open-weight model (Apache 2.0). Improved instruction following and function calling over v3.1. 84.78% on Mistral internal benchmark.
Google	Gemini 2.5 Flash	Jun 17, 2025	Efficient version of Gemini 2.5 (GA). Controllable thinking budget. Best price-performance in Gemini family.
Google	Gemini 2.5 Flash-Lite	Jun 17, 2025	Ultra-efficient Gemini 2.5 variant optimized for speed and cost. Lowest latency in the Gemini 2.5 family.
OpenAI	o3-pro	Jun 10, 2025	Extended-thinking version of o3. Thinks longer for more reliable responses on the hardest tasks. ChatGPT Pro and API.
Mistral	Magistral Small	Jun 10, 2025	Mistral's first reasoning model with chain-of-thought capabilities. 24B parameters, open-source under Apache 2.0. Released alongside Magistral Medium.
OpenAI	o3	Apr 16, 2025	Flagship reasoning model. Top ARC-AGI scores. Available in API with high/medium/low reasoning effort settings.
OpenAI	o4-mini	Apr 16, 2025	Cost-efficient reasoning model with vision. Faster than o3, strong coding performance. Tool use support.
OpenAI	GPT-4.1	Apr 14, 2025	Specialized for coding and instruction-following. Stronger than GPT-4o at precise web dev tasks. API-only release.
OpenAI	GPT-4.1 mini	Apr 14, 2025	Fast, capable small model. Replaces GPT-4o mini for paid users. Significant improvements in coding and instruction-following.
Meta	Llama 4	Apr 5, 2025	MoE architecture models: Scout (17Bx16E) and Maverick (17Bx128E). Native multimodal, 10M context window.
Google	Gemini 2.5 Pro (Preview)	Mar 25, 2025	State-of-the-art reasoning model. 1M context, native audio/video/image understanding. Top LMArena scores.
Mistral	Mistral Small 3.1	Mar 17, 2025	Updated small model with vision capabilities added. 24B parameters, 128K context. Apache 2.0 license.
OpenAI	GPT-4.5	Feb 27, 2025	Largest and most capable GPT model. Improved emotional intelligence and world knowledge. 128K context.
Anthropic	Claude 3.7 Sonnet	Feb 24, 2025	First hybrid reasoning model. Switchable extended thinking mode. Top scores on SWE-bench Verified (70.3%).
Mistral	Mistral Saba	Feb 17, 2025	Specialized for Arabic and South Asian languages. Strong Middle East/North Africa cultural context understanding.
Google	Gemini 2.0 Flash	Feb 5, 2025	Generally available release. Upgraded image generation, improved multilingual audio. 1M context. Free tier available.
Google	Gemini 2.0 Pro (Experimental)	Feb 5, 2025	Most capable Gemini 2.0 model for complex tasks. 2M context window. Code execution and Google Search grounding.
OpenAI	o3-mini	Jan 31, 2025	Cost-efficient small reasoning model. Three effort levels (low/medium/high). Outperforms o1 on AIME and SWE-bench.
Mistral	Mistral Small 3	Jan 15, 2025	24B parameter model with 81% MMLU. Apache 2.0 license. Best-in-class small model for latency-sensitive tasks.
Mistral	Codestral 25.01	Jan 13, 2025	Updated code generation model. 256K context window, supports 80+ programming languages. Available via API.
Google	Gemini 2.0 Flash (Experimental)	Dec 11, 2024	First Gemini 2.0 model. Multimodal I/O, native tool use, 1M context. Faster than 1.5 Pro. Agentic capabilities.
Meta	Llama 3.3 70B	Dec 6, 2024	Updated 70B model with performance matching Llama 3.1 405B. More compute-efficient. Open weights.
OpenAI	o1	Dec 5, 2024	Full reasoning model release. Includes vision support, 200K context. Available in ChatGPT and API.
Mistral	Pixtral Large	Nov 7, 2024	124B parameter multimodal model. Strong document understanding, charts, and scientific diagrams. Available via API.
Anthropic	Claude 3.5 Haiku	Nov 4, 2024	Fastest Claude model. Matches Claude 3 Opus on many benchmarks. 200K context. API and Bedrock availability.
Anthropic	Claude 3.5 Sonnet (v2)	Oct 22, 2024	Upgraded Sonnet with improved coding (SWE-bench 49%), vision, and computer use capability (beta).
Google	Gemini 1.5 Flash-8B	Oct 3, 2024	Smallest Gemini model for high-frequency, low-latency tasks. 1M token context. Generally available via API.
Meta	Llama 3.2	Sep 25, 2024	Multimodal models (11B, 90B) with vision. Lightweight text models (1B, 3B) for edge deployment. Open weights.
OpenAI	o1-preview	Sep 12, 2024	Early access reasoning model with extended chain-of-thought. Designed for complex science, coding, and math tasks.
OpenAI	o1-mini	Sep 12, 2024	Smaller, faster reasoning model. Cost-effective for coding and STEM tasks without broad world knowledge needs.
Mistral	Pixtral 12B	Sep 11, 2024	First multimodal model from Mistral. 12B parameters with vision capabilities. Apache 2.0 license.
Mistral	Mistral Large 2	Jul 24, 2024	128K context, strong coding and multilingual capabilities. 123B parameters. Apache 2.0 license.
Meta	Llama 3.1	Jul 23, 2024	Open-weights models: 8B, 70B, 405B. 128K context window. First open model competitive with frontier closed models.
OpenAI	GPT-4o mini	Jul 18, 2024	Affordable, fast frontier model. 128K context, supports vision. Lower cost alternative to GPT-4o.

Updated daily · Sources: official provider blogs & changelogs