Discover the best large language models (LLMs) from different providers.
Flagship model for coding, reasoning, and agentic tasks with adjustable reasoning depth and multimodal input/output.
Version of GPT-5.1 optimized for agentic coding inside Codex and similar environments, with strong reasoning and multimodal support.
Flagship video generation model that produces high-quality dynamic videos with synced audio from natural language or image prompts.
Most advanced video generation model with synced audio, producing highly detailed, dynamic clips from natural language or image inputs.
A high-reasoning model for coding and agentic tasks with configurable reasoning effort, supporting text + image input and large context windows.
Version of GPT-5 optimized for agentic coding tasks in Codex, offering strong reasoning, reliable code generation, and long-context project understanding.
A faster, cost-efficient version of GPT-5 designed for well-defined tasks, precise prompts, and high-speed execution with strong reasoning.
The fastest and cheapest GPT-5 variant, ideal for summarization, classification, and lightweight tasks requiring high speed and low cost.
A premium GPT-5 variant that uses more compute to deliver consistently smarter, more precise reasoning for the toughest problems.
A highly capable non-reasoning model that excels at instruction following, tool calling, and broad domain knowledge with a 1M-token context window.
Smaller, faster version of GPT-4.1 with low latency, strong instruction following, and a large 1M-token context window optimized for lightweight tasks.
Fastest and most cost-efficient GPT-4.1 model with strong instruction following, tool calling, and a 1M-token context window for lightweight, real-time tasks.
OpenAI's most powerful open-weight model (117B params, 5.1B active), fitting on a single H100 GPU - fully customizable, licensed for unrestricted commercial use.
A 21-billion-parameter open-weight model from OpenAI, designed for efficient reasoning and long-context usage (≈ 128K tokens).
State-of-the-art image generation model that accepts text and image inputs and produces high-quality images across multiple resolutions and quality levels.
A cost-efficient, multimodal image generation model that accepts text and image inputs and produces images across multiple resolutions and quality levels.
A fast, cost-efficient small reasoning model optimized for coding and visual tasks; succeeded by GPT-5 mini.
A powerful reasoning model excelling at complex, multi-step tasks across math, science, coding, and visual reasoning; succeeded by GPT-5.
A small, cost-efficient reasoning model offering high intelligence at the same pricing and latency targets as o1-mini, with strong support for structured outputs and developer tooling.
A full-size o-series reasoning model trained with RL to think before answering, producing strong multi-step reasoning across math, code, and analysis tasks.
A high-compute version of the o1 reasoning model, trained with reinforcement learning to think before answering and produce consistently stronger multi-step reasoning across math, science, coding, and analysis tasks.
A versatile, high-intelligence flagship GPT model that handles text and image inputs and produces fast, high-quality text outputs for a wide range of tasks.
A fast, affordable small model for focused tasks with multimodal input support and strong performance for classification, extraction, translation, and lightweight reasoning.
Preview multimodal model that accepts and outputs audio, optimized for natural voice interactions and real-time conversational experiences.
Fast, affordable audio-capable model for lightweight voice interactions, real-time responses, and low-cost speech-based applications.
Older high-intelligence GPT-4 generation model offering strong reasoning and image input support, now superseded by newer 4o-based models.
Legacy lightweight GPT model for cheap text generation and chat tasks; now replaced by faster, smarter, and cheaper 4o-mini models.
Google's most intelligent multimodal model designed for advanced reasoning, coding, and agentic tasks.
High-fidelity image model with precise controls, advanced text rendering, and world-knowledge grounding.
Google's most advanced thinking model, leading benchmarks in reasoning, science, math, and coding with a massive multimodal context window.
A fast, cost-efficient multimodal model optimized for everyday tasks with strong speed, long context, and native audio capabilities.
High-quality, low-latency image model for generation, editing, fusion, and character consistency.
A next-generation multimodal model with breakthrough long-context capability up to 1M tokens and strong reasoning across text, code, audio, video, and images.
A fast, lightweight model optimized for low-latency, high-volume multimodal tasks with long-context support.
A versatile multimodal model optimized for balanced performance across reasoning, language, and code tasks.
A frontier-level hybrid-reasoning model excelling at coding, long-horizon tasks, computer use, and domain reasoning with top-tier alignment and reliability.
A fast, small model delivering near-frontier coding and computer-use performance at ultra-low cost with exceptional speed and strong safety.
A refined flagship model with improved coding, reasoning, research depth, and agentic task performance over Opus 4.
A balanced-hybrid reasoning model tuned for everyday assistant and high-volume tasks.
The flagship model, focused on deep reasoning, large-scale coding and sustained multi-step agentic workflows.
A fast, mid-tier model offering top-tier intelligence, strong reasoning, and advanced coding/vision capabilities at low cost.
A fast, affordable model matching Claude 3 Opus on many tasks while delivering major improvements in coding, accuracy, and tool use.
The most intelligent Claude 3 model, built for highly complex reasoning, analysis, and open-ended problem solving across any domain.
Balanced model offering high intelligence with fast performance, excellent for scalable enterprise workloads and real-time responses.
Ultra-fast, cost-efficient model built for real-time interactions, instant responses, and high-volume workloads.
A flagship multimodal model excelling in natural language, math, and deep reasoning with unmatched all-around performance.
A high-performance enterprise model for coding, extraction, reasoning, and domain-expert tasks across finance, healthcare, law, and science.
A lightweight reasoning model that is fast, efficient, and ideal for logic-heavy tasks without deep domain requirements.
Real-time multimodal model with streaming audio input and VAD for live use.
Use Appaca to build and launch your AI products in minutes.