Explore AI Models
Discover the best large language models (LLMs) from different providers.
GPT-5.1
Flagship model for coding, reasoning, and agentic tasks with adjustable reasoning depth and multimodal input/output.
GPT-5.1 Codex
Version of GPT-5.1 optimized for agentic coding inside Codex and similar environments, with strong reasoning and multimodal support.
Sora 2
Flagship video generation model that produces high-quality dynamic videos with synced audio from natural language or image prompts.
Sora 2 Pro
Most advanced video generation model with synced audio, producing highly detailed, dynamic clips from natural language or image inputs.
GPT-5
A high-reasoning model for coding and agentic tasks with configurable reasoning effort, supporting text + image input and large context windows.
GPT-5 Codex
Version of GPT-5 optimized for agentic coding tasks in Codex, offering strong reasoning, reliable code generation, and long-context project understanding.
GPT-5 Mini
A faster, cost-efficient version of GPT-5 designed for well-defined tasks, precise prompts, and high-speed execution with strong reasoning.
GPT-5 Nano
The fastest and cheapest GPT-5 variant, ideal for summarization, classification, and lightweight tasks requiring high speed and low cost.
GPT-5 Pro
A premium GPT-5 variant that uses more compute to deliver consistently smarter, more precise reasoning for the toughest problems.
GPT-4.1
A highly capable non-reasoning model that excels at instruction following, tool calling, and broad domain knowledge with a 1M-token context window.
GPT-4.1 Mini
Smaller, faster version of GPT-4.1 with low latency, strong instruction following, and a large 1M-token context window optimized for lightweight tasks.
GPT-4.1 Nano
Fastest and most cost-efficient GPT-4.1 model with strong instruction following, tool calling, and a 1M-token context window for lightweight, real-time tasks.
GPT-OSS 120B
OpenAI's most powerful open-weight model (117B params, 5.1B active), fitting on a single H100 GPU - fully customizable, licensed for unrestricted commercial use.
GPT-OSS 20B
A 21-billion-parameter open-weight model from OpenAI, designed for efficient reasoning and long-context usage (≈ 128K tokens).
GPT Image 1.5
State-of-the-art image generation model with improved instruction following and adherence to prompts.
GPT Image 1
State-of-the-art image generation model that accepts text and image inputs and produces high-quality images across multiple resolutions and quality levels.
GPT Image 1 Mini
A cost-efficient, multimodal image generation model that accepts text and image inputs and produces images across multiple resolutions and quality levels.
o4-mini
A fast, cost-efficient small reasoning model optimized for coding and visual tasks; succeeded by GPT-5 mini.
o3
A powerful reasoning model excelling at complex, multi-step tasks across math, science, coding, and visual reasoning; succeeded by GPT-5.
o3-mini
A small, cost-efficient reasoning model offering high intelligence at the same pricing and latency targets as o1-mini, with strong support for structured outputs and developer tooling.
o1
A full-size o-series reasoning model trained with RL to think before answering, producing strong multi-step reasoning across math, code, and analysis tasks.
o1-pro
A high-compute version of the o1 reasoning model, trained with reinforcement learning to think before answering and produce consistently stronger multi-step reasoning across math, science, coding, and analysis tasks.
GPT-4o
A versatile, high-intelligence flagship GPT model that handles text and image inputs and produces fast, high-quality text outputs for a wide range of tasks.
GPT-4o mini
A fast, affordable small model for focused tasks with multimodal input support and strong performance for classification, extraction, translation, and lightweight reasoning.
GPT-4o Audio
Preview multimodal model that accepts and outputs audio, optimized for natural voice interactions and real-time conversational experiences.
GPT-4o mini Audio
Fast, affordable audio-capable model for lightweight voice interactions, real-time responses, and low-cost speech-based applications.
GPT-4 Turbo
Older high-intelligence GPT-4 generation model offering strong reasoning and image input support, now superseded by newer 4o-based models.
GPT-3.5 Turbo
Legacy lightweight GPT model for cheap text generation and chat tasks; now replaced by faster, smarter, and cheaper 4o-mini models.
Gemini 3 Pro
Google's most intelligent multimodal model designed for advanced reasoning, coding, and agentic tasks.
Nano Banana Pro
High-fidelity image model with precise controls, advanced text rendering, and world-knowledge grounding.
Gemini 2.5 Pro Experimental
Google's most advanced thinking model, leading benchmarks in reasoning, science, math, and coding with a massive multimodal context window.
Gemini 2.5 Flash
A fast, cost-efficient multimodal model optimized for everyday tasks with strong speed, long context, and native audio capabilities.
Nano Banana
High-quality, low-latency image model for generation, editing, fusion, and character consistency.
Gemini 1.5 Pro
A next-generation multimodal model with breakthrough long-context capability up to 1M tokens and strong reasoning across text, code, audio, video, and images.
Gemini 1.5 Flash
A fast, lightweight model optimized for low-latency, high-volume multimodal tasks with long-context support.
Gemini 1.0 Pro
A versatile multimodal model optimized for balanced performance across reasoning, language, and code tasks.
Claude 4.5 Sonnet
A frontier-level hybrid-reasoning model excelling at coding, long-horizon tasks, computer use, and domain reasoning with top-tier alignment and reliability.
Claude 4.5 Haiku
A fast, small model delivering near-frontier coding and computer-use performance at ultra-low cost with exceptional speed and strong safety.
Claude 4.1 Opus
A refined flagship model with improved coding, reasoning, research depth, and agentic task performance over Opus 4.
Claude 4 Sonnet
A balanced-hybrid reasoning model tuned for everyday assistant and high-volume tasks.
Claude 4 Opus
The flagship model, focused on deep reasoning, large-scale coding and sustained multi-step agentic workflows.
Claude 3.5 Sonnet
A fast, mid-tier model offering top-tier intelligence, strong reasoning, and advanced coding/vision capabilities at low cost.
Claude 3.5 Haiku
A fast, affordable model matching Claude 3 Opus on many tasks while delivering major improvements in coding, accuracy, and tool use.
Claude 3 Opus
The most intelligent Claude 3 model, built for highly complex reasoning, analysis, and open-ended problem solving across any domain.
Claude 3 Sonnet
Balanced model offering high intelligence with fast performance, excellent for scalable enterprise workloads and real-time responses.
Claude 3 Haiku
Ultra-fast, cost-efficient model built for real-time interactions, instant responses, and high-volume workloads.
Grok 4
A flagship multimodal model excelling in natural language, math, and deep reasoning with unmatched all-around performance.
Grok 3
A high-performance enterprise model for coding, extraction, reasoning, and domain-expert tasks across finance, healthcare, law, and science.
Grok 3 Mini
A lightweight reasoning model that is fast, efficient, and ideal for logic-heavy tasks without deep domain requirements.
Qwen3-Omni-Flash-Realtime
Real-time multimodal model with streaming audio input and VAD for live use.