Evertune

AI Model Release Dashboard

Tracking releases from leading AI providers

ProviderModelRelease DateNotes
Mistral
Mistral Small 4
Mar 17, 2026

Unified model combining text, vision, reasoning, and coding. MoE 119B total / 6B active params. 256K context. Apache 2.0.

OpenAI
GPT-5.4 mini
Mar 17, 2026

Most capable small OpenAI model. 2x faster than GPT-5.4. Strong on coding, reasoning, and computer use. Free and Go users.

OpenAI
GPT-5.4 nano
Mar 17, 2026

Smallest and cheapest OpenAI model. Optimized for classification, data extraction, ranking, and coding subagents.

OpenAI
GPT-5.4
Mar 5, 2026

Most capable frontier model for professional work. Native computer use, 1M context, consolidated coding+reasoning. 33% fewer hallucinations vs GPT-5.2.

Google
Gemini 3.1 Flash Lite
Mar 3, 2026

Ultra-efficient Gemini 3.1 model for developers. Fast and cost-effective for high-volume API workloads.

OpenAI
GPT-5.3 Instant
Mar 3, 2026

Smoother, more helpful everyday conversations. 400K context. 26.8% fewer hallucinations than GPT-5.2 with web search. Reduces unnecessary refusals and moralizing preambles.

Google
Gemini 3.1 Flash Image
Feb 26, 2026

Faster image generation with 4K resolution, improved text rendering, and multi-language support. #1 on Artificial Analysis Image Arena at launch. Default in Gemini app, Search, and Lens across 141 countries.

Google
Gemini 3.1 Pro
Feb 19, 2026

Advanced reasoning over complex problems. 77.1% on ARC-AGI-2. Available via Gemini API, Vertex AI, and NotebookLM.

Anthropic
Claude Sonnet 4.6
Feb 17, 2026

Default model for free and Pro users. 1M token context (beta). Improved computer use, coding, and large-scale data processing.

Anthropic
Claude Opus 4.6
Feb 5, 2026

1M token context window. 14.5-hour task completion horizon. Top Finance Agent benchmark. Stronger coding and planning.

OpenAI
GPT-5.3-Codex
Feb 5, 2026

Most capable agentic coding model to date. 25% faster than GPT-5.2-Codex. First OpenAI model instrumental in creating itself. State-of-the-art on SWE-Bench Pro and Terminal-Bench.

OpenAI
GPT-5.2-Codex
Dec 18, 2025

Agentic coding model optimized for long-horizon task completion. Context compaction for multi-window workflows. 56.4% on SWE-Bench Pro, 64.0% on Terminal-Bench 2.0.

Google
Gemini 3 Flash
Dec 17, 2025

Efficient Gemini 3 model. Includes Gemini 3 Pro reasoning in a faster, cheaper package. Default model in the Gemini app.

OpenAI
GPT-5.2
Dec 11, 2025

Next upgrade to the GPT-5 series. Smarter and more useful for work and learning. 400K context, 128K output. Available in Instant, Thinking, and Pro variants.

Google
Gemini 3 Deep Think
Dec 4, 2025

Extended-thinking version of Gemini 3 Pro. Rolled out to Ultra subscribers. Designed for the hardest reasoning tasks requiring longer deliberation.

Mistral
Mistral Large 3
Dec 2, 2025

MoE architecture: 41B active / 675B total parameters. Multimodal, 256K context. Apache 2.0. #2 on LMArena OSS leaderboard.

Mistral
Ministral 3 (3B/8B/14B)
Dec 2, 2025

Family of dense small models in 3 sizes × 3 variants (Base/Instruct/Reasoning). Runs on 4GB VRAM. Apache 2.0 license.

Anthropic
Claude Opus 4.5
Nov 24, 2025

Most robustly aligned Claude model. Reclaimed coding crown with 80.9% SWE-bench Verified. Significantly reduced pricing.

Google
Gemini 3 Pro
Nov 18, 2025

Best multimodal understanding model. Most powerful agentic and coding model from Google. Replaces Gemini 2.5 Pro.

OpenAI
GPT-5.1
Nov 12, 2025

Warmer personality, improved instruction-following. Adds shopping research and 8 personality options. Available in Instant, Thinking, and Pro variants. Retired March 2026.

Anthropic
Claude Haiku 4.5
Oct 15, 2025

Fast and affordable Claude 4 family model. Targets latency-sensitive applications. Available on Claude.ai and mobile.

Anthropic
Claude Sonnet 4.5
Sep 29, 2025

Best coding model at launch with 77.2% on SWE-bench Verified. 30+ hour autonomous task horizon. Launched alongside Claude Agent SDK and Claude Code 2.0.

Google
Gemini 2.5 Flash Image
Aug 26, 2025

Dedicated image generation model from Google. Better instruction following and text rendering than previous image models.

OpenAI
GPT-5
Aug 7, 2025

Unifies reasoning and standard model capabilities. Automatically switches between fast and deep thinking modes based on task needs.

Anthropic
Claude Opus 4.1
Aug 5, 2025

Drop-in replacement for Opus 4 with superior performance. Handles complex multi-step agentic tasks with more rigor. Improved precision for real-world coding workflows.

Anthropic
Claude Sonnet 4
Jul 9, 2025

Next generation Claude model. Enhanced coding, reasoning, and agentic task performance.

Anthropic
Claude Opus 4
Jul 9, 2025

Most intelligent Claude model. Designed for complex, long-horizon tasks and deep reasoning.

Mistral
Mistral Small 3.2
Jun 20, 2025

24B parameter open-weight model (Apache 2.0). Improved instruction following and function calling over v3.1. 84.78% on Mistral internal benchmark.

Google
Gemini 2.5 Flash
Jun 17, 2025

Efficient version of Gemini 2.5 (GA). Controllable thinking budget. Best price-performance in Gemini family.

Google
Gemini 2.5 Flash-Lite
Jun 17, 2025

Ultra-efficient Gemini 2.5 variant optimized for speed and cost. Lowest latency in the Gemini 2.5 family.

OpenAI
o3-pro
Jun 10, 2025

Extended-thinking version of o3. Thinks longer for more reliable responses on the hardest tasks. ChatGPT Pro and API.

Mistral
Magistral Small
Jun 10, 2025

Mistral's first reasoning model with chain-of-thought capabilities. 24B parameters, open-source under Apache 2.0. Released alongside Magistral Medium.

OpenAI
o3
Apr 16, 2025

Flagship reasoning model. Top ARC-AGI scores. Available in API with high/medium/low reasoning effort settings.

OpenAI
o4-mini
Apr 16, 2025

Cost-efficient reasoning model with vision. Faster than o3, strong coding performance. Tool use support.

OpenAI
GPT-4.1
Apr 14, 2025

Specialized for coding and instruction-following. Stronger than GPT-4o at precise web dev tasks. API-only release.

OpenAI
GPT-4.1 mini
Apr 14, 2025

Fast, capable small model. Replaces GPT-4o mini for paid users. Significant improvements in coding and instruction-following.

Meta
Llama 4
Apr 5, 2025

MoE architecture models: Scout (17Bx16E) and Maverick (17Bx128E). Native multimodal, 10M context window.

Google
Gemini 2.5 Pro (Preview)
Mar 25, 2025

State-of-the-art reasoning model. 1M context, native audio/video/image understanding. Top LMArena scores.

Mistral
Mistral Small 3.1
Mar 17, 2025

Updated small model with vision capabilities added. 24B parameters, 128K context. Apache 2.0 license.

OpenAI
GPT-4.5
Feb 27, 2025

Largest and most capable GPT model. Improved emotional intelligence and world knowledge. 128K context.

Anthropic
Claude 3.7 Sonnet
Feb 24, 2025

First hybrid reasoning model. Switchable extended thinking mode. Top scores on SWE-bench Verified (70.3%).

Mistral
Mistral Saba
Feb 17, 2025

Specialized for Arabic and South Asian languages. Strong Middle East/North Africa cultural context understanding.

Google
Gemini 2.0 Flash
Feb 5, 2025

Generally available release. Upgraded image generation, improved multilingual audio. 1M context. Free tier available.

Google
Gemini 2.0 Pro (Experimental)
Feb 5, 2025

Most capable Gemini 2.0 model for complex tasks. 2M context window. Code execution and Google Search grounding.

OpenAI
o3-mini
Jan 31, 2025

Cost-efficient small reasoning model. Three effort levels (low/medium/high). Outperforms o1 on AIME and SWE-bench.

Mistral
Mistral Small 3
Jan 15, 2025

24B parameter model with 81% MMLU. Apache 2.0 license. Best-in-class small model for latency-sensitive tasks.

Mistral
Codestral 25.01
Jan 13, 2025

Updated code generation model. 256K context window, supports 80+ programming languages. Available via API.

Google
Gemini 2.0 Flash (Experimental)
Dec 11, 2024

First Gemini 2.0 model. Multimodal I/O, native tool use, 1M context. Faster than 1.5 Pro. Agentic capabilities.

Meta
Llama 3.3 70B
Dec 6, 2024

Updated 70B model with performance matching Llama 3.1 405B. More compute-efficient. Open weights.

OpenAI
o1
Dec 5, 2024

Full reasoning model release. Includes vision support, 200K context. Available in ChatGPT and API.

Mistral
Pixtral Large
Nov 7, 2024

124B parameter multimodal model. Strong document understanding, charts, and scientific diagrams. Available via API.

Anthropic
Claude 3.5 Haiku
Nov 4, 2024

Fastest Claude model. Matches Claude 3 Opus on many benchmarks. 200K context. API and Bedrock availability.

Anthropic
Claude 3.5 Sonnet (v2)
Oct 22, 2024

Upgraded Sonnet with improved coding (SWE-bench 49%), vision, and computer use capability (beta).

Google
Gemini 1.5 Flash-8B
Oct 3, 2024

Smallest Gemini model for high-frequency, low-latency tasks. 1M token context. Generally available via API.

Meta
Llama 3.2
Sep 25, 2024

Multimodal models (11B, 90B) with vision. Lightweight text models (1B, 3B) for edge deployment. Open weights.

OpenAI
o1-preview
Sep 12, 2024

Early access reasoning model with extended chain-of-thought. Designed for complex science, coding, and math tasks.

OpenAI
o1-mini
Sep 12, 2024

Smaller, faster reasoning model. Cost-effective for coding and STEM tasks without broad world knowledge needs.

Mistral
Pixtral 12B
Sep 11, 2024

First multimodal model from Mistral. 12B parameters with vision capabilities. Apache 2.0 license.

Mistral
Mistral Large 2
Jul 24, 2024

128K context, strong coding and multilingual capabilities. 123B parameters. Apache 2.0 license.

Meta
Llama 3.1
Jul 23, 2024

Open-weights models: 8B, 70B, 405B. 128K context window. First open model competitive with frontier closed models.

OpenAI
GPT-4o mini
Jul 18, 2024

Affordable, fast frontier model. 128K context, supports vision. Lower cost alternative to GPT-4o.

Updated daily · Sources: official provider blogs & changelogs