Explore AI Models

Discover the best large language models (LLMs) from different providers.

GPT-5.1

Flagship model for coding, reasoning, and agentic tasks with adjustable reasoning depth and multimodal input/output.

GPT-5.1 Codex

Version of GPT-5.1 optimized for agentic coding inside Codex and similar environments, with strong reasoning and multimodal support.

Sora 2

Flagship video generation model that produces high-quality dynamic videos with synced audio from natural language or image prompts.

Sora 2 Pro

Most advanced video generation model with synced audio, producing highly detailed, dynamic clips from natural language or image inputs.

GPT-5

A high-reasoning model for coding and agentic tasks with configurable reasoning effort, supporting text + image input and large context windows.

GPT-5 Codex

Version of GPT-5 optimized for agentic coding tasks in Codex, offering strong reasoning, reliable code generation, and long-context project understanding.

GPT-5 Mini

A faster, cost-efficient version of GPT-5 designed for well-defined tasks, precise prompts, and high-speed execution with strong reasoning.

GPT-5 Nano

The fastest and cheapest GPT-5 variant, ideal for summarization, classification, and lightweight tasks requiring high speed and low cost.

GPT-5 Pro

A premium GPT-5 variant that uses more compute to deliver consistently smarter, more precise reasoning for the toughest problems.

GPT-4.1

A highly capable non-reasoning model that excels at instruction following, tool calling, and broad domain knowledge with a 1M-token context window.

GPT-4.1 Mini

Smaller, faster version of GPT-4.1 with low latency, strong instruction following, and a large 1M-token context window optimized for lightweight tasks.

GPT-4.1 Nano

Fastest and most cost-efficient GPT-4.1 model with strong instruction following, tool calling, and a 1M-token context window for lightweight, real-time tasks.

GPT-OSS 120B

OpenAI's most powerful open-weight model (117B params, 5.1B active), fitting on a single H100 GPU - fully customizable, licensed for unrestricted commercial use.

GPT-OSS 20B

A 21-billion-parameter open-weight model from OpenAI, designed for efficient reasoning and long-context usage (≈ 128K tokens).

GPT Image 1

State-of-the-art image generation model that accepts text and image inputs and produces high-quality images across multiple resolutions and quality levels.

GPT Image 1 Mini

A cost-efficient, multimodal image generation model that accepts text and image inputs and produces images across multiple resolutions and quality levels.

o4-mini

A fast, cost-efficient small reasoning model optimized for coding and visual tasks; succeeded by GPT-5 mini.

o3

A powerful reasoning model excelling at complex, multi-step tasks across math, science, coding, and visual reasoning; succeeded by GPT-5.

o3-mini

A small, cost-efficient reasoning model offering high intelligence at the same pricing and latency targets as o1-mini, with strong support for structured outputs and developer tooling.

o1

A full-size o-series reasoning model trained with RL to think before answering, producing strong multi-step reasoning across math, code, and analysis tasks.

o1-pro

A high-compute version of the o1 reasoning model, trained with reinforcement learning to think before answering and produce consistently stronger multi-step reasoning across math, science, coding, and analysis tasks.

GPT-4o

A versatile, high-intelligence flagship GPT model that handles text and image inputs and produces fast, high-quality text outputs for a wide range of tasks.

GPT-4o mini

A fast, affordable small model for focused tasks with multimodal input support and strong performance for classification, extraction, translation, and lightweight reasoning.

GPT-4o Audio

Preview multimodal model that accepts and outputs audio, optimized for natural voice interactions and real-time conversational experiences.

GPT-4o mini Audio

Fast, affordable audio-capable model for lightweight voice interactions, real-time responses, and low-cost speech-based applications.

GPT-4 Turbo

Older high-intelligence GPT-4 generation model offering strong reasoning and image input support, now superseded by newer 4o-based models.

GPT-3.5 Turbo

Legacy lightweight GPT model for cheap text generation and chat tasks; now replaced by faster, smarter, and cheaper 4o-mini models.

Gemini 3 Pro

Google's most intelligent multimodal model designed for advanced reasoning, coding, and agentic tasks.

Nano Banana Pro

High-fidelity image model with precise controls, advanced text rendering, and world-knowledge grounding.

Gemini 2.5 Pro Experimental

Google's most advanced thinking model, leading benchmarks in reasoning, science, math, and coding with a massive multimodal context window.

Gemini 2.5 Flash

A fast, cost-efficient multimodal model optimized for everyday tasks with strong speed, long context, and native audio capabilities.

Nano Banana

High-quality, low-latency image model for generation, editing, fusion, and character consistency.

Gemini 1.5 Pro

A next-generation multimodal model with breakthrough long-context capability up to 1M tokens and strong reasoning across text, code, audio, video, and images.

Gemini 1.5 Flash

A fast, lightweight model optimized for low-latency, high-volume multimodal tasks with long-context support.

Gemini 1.0 Pro

A versatile multimodal model optimized for balanced performance across reasoning, language, and code tasks.

Claude 4.5 Sonnet

A frontier-level hybrid-reasoning model excelling at coding, long-horizon tasks, computer use, and domain reasoning with top-tier alignment and reliability.

Claude 4.5 Haiku

A fast, small model delivering near-frontier coding and computer-use performance at ultra-low cost with exceptional speed and strong safety.

Claude 4.1 Opus

A refined flagship model with improved coding, reasoning, research depth, and agentic task performance over Opus 4.

Claude 4 Sonnet

A balanced-hybrid reasoning model tuned for everyday assistant and high-volume tasks.

Claude 4 Opus

The flagship model, focused on deep reasoning, large-scale coding and sustained multi-step agentic workflows.

Claude 3.5 Sonnet

A fast, mid-tier model offering top-tier intelligence, strong reasoning, and advanced coding/vision capabilities at low cost.

Claude 3.5 Haiku

A fast, affordable model matching Claude 3 Opus on many tasks while delivering major improvements in coding, accuracy, and tool use.

Claude 3 Opus

The most intelligent Claude 3 model, built for highly complex reasoning, analysis, and open-ended problem solving across any domain.

Claude 3 Sonnet

Balanced model offering high intelligence with fast performance, excellent for scalable enterprise workloads and real-time responses.

Claude 3 Haiku

Ultra-fast, cost-efficient model built for real-time interactions, instant responses, and high-volume workloads.

Grok 4

A flagship multimodal model excelling in natural language, math, and deep reasoning with unmatched all-around performance.

Grok 3

A high-performance enterprise model for coding, extraction, reasoning, and domain-expert tasks across finance, healthcare, law, and science.

Grok 3 Mini

A lightweight reasoning model that is fast, efficient, and ideal for logic-heavy tasks without deep domain requirements.

Qwen3-Max

Top-tier Qwen3 model for complex, multi-step reasoning and agent workflows.

Qwen-Max

High-performance general-purpose Qwen model with strong coding and reasoning abilities.

Qwen-Plus

Balanced Qwen model with strong speed, cost efficiency, and optional reasoning mode.

Qwen3-Plus

Improved Qwen3 generation of Plus model with better reasoning, tool use, and alignment.

Qwen-Flash

The fastest and cheapest Qwen model, ideal for high-volume workloads.

Qwen3-Flash

Upgraded Flash model with improved capabilities and hybrid reasoning support.

Qwen-Turbo

Fast, low-cost model for general tasks; being phased out in favor of Flash.

QwQ-Plus

A reasoning-optimized model built on Qwen2.5 with strong math and code performance.

Qwen-Long

Long-context model with 10M tokens for huge document analysis and summarization.

Qwen-Omni-Turbo

Multimodal turbo model supporting text, image, audio, and video with fast output.

Qwen3-Omni-Flash

Hybrid thinking multimodal model with upgraded vision, audio, and agent abilities.

Qwen3-Omni-Flash-Realtime

Real-time multimodal model with streaming audio input and VAD for live use.

QVQ-Max

High-end visual reasoning model with strong math, coding, and diagram understanding.

Qwen3-VL-Plus

Text-generation model with strong vision understanding, OCR, reasoning, and summaries.

LLaMA 3 70B

Meta's large-sized, open-source model for a wide range of tasks.

LLaMA 3 8B

Meta's small-sized, open-source model, suitable for simpler tasks.

DeepSeek V3

A data-analysis powerhouse built for large-scale pattern recognition, prediction, and working with massive datasets.

DeepSeek R1

A fast, real-time decision-making model optimized for rapid analysis, dynamic adjustments, and responsive AI behavior.