AI Models Directory

Compare pricing, context windows, and strengths for 78 AI models from 7 providers - and see how teams put each one to work in Appaca, the no-code AI workspace.

OpenAI models

Frontier GPT models for reasoning, coding, and knowledge work, plus image generation and speech models - the most widely used AI model family for business workflows.

text 1M context

GPT-5.5

OpenAI's smartest and most capable model yet for agentic coding, knowledge work, and computer use, delivering a new class of intelligence at GPT-5.4 latency.

From $5 / 1M tokens Learn more
text 1.1M context

GPT-5.4

OpenAI's frontier model for complex professional work with best intelligence at scale for agentic, coding, and professional workflows.

From $2.5 / 1M tokens Learn more
text 400K context

GPT-5.2

Previous frontier model for complex professional work with configurable reasoning effort.

From $1.75 / 1M tokens Learn more
text 400K context

GPT-5.1

Flagship model for coding, reasoning, and agentic tasks with adjustable reasoning depth and multimodal input/output.

From $1.25 / 1M tokens Learn more
text 400K context

GPT-5.3 Codex

Most capable agentic coding model to date, optimized for long-horizon software engineering tasks with configurable reasoning and multimodal input.

From $1.75 / 1M tokens Learn more
text 400K context

GPT-5.2 Codex

Highly capable coding model optimized for long-horizon, agentic coding tasks with configurable reasoning and strong codebase awareness.

From $1.75 / 1M tokens Learn more
text 400K context

GPT-5.1 Codex

Version of GPT-5.1 optimized for agentic coding inside Codex and similar environments, with strong reasoning and multimodal support.

From $1.25 / 1M tokens Learn more
video 400K context

Sora 2

Flagship video generation model that produces high-quality dynamic videos with synced audio from natural language or image prompts.

$0.1 / sec Learn more
video 400K context

Sora 2 Pro

Most advanced video generation model with synced audio, producing highly detailed, dynamic clips from natural language or image inputs.

$0.5 / sec Learn more
text 400K context

GPT-5

A high-reasoning model for coding and agentic tasks with configurable reasoning effort, supporting text + image input and large context windows.

From $1.25 / 1M tokens Learn more
text 400K context

GPT-5 Codex

Version of GPT-5 optimized for agentic coding tasks in Codex, offering strong reasoning, reliable code generation, and long-context project understanding.

From $1.25 / 1M tokens Learn more
text 400K context

GPT-5 Mini

A faster, cost-efficient version of GPT-5 designed for well-defined tasks, precise prompts, and high-speed execution with strong reasoning.

From $0.25 / 1M tokens Learn more
text 400K context

GPT-5 Nano

The fastest and cheapest GPT-5 variant, ideal for summarization, classification, and lightweight tasks requiring high speed and low cost.

From $0.05 / 1M tokens Learn more
text 400K context

GPT-5 Pro

A premium GPT-5 variant that uses more compute to deliver consistently smarter, more precise reasoning for the toughest problems.

From $15 / 1M tokens Learn more
text 1.0M context

GPT-4.1

A highly capable non-reasoning model that excels at instruction following, tool calling, and broad domain knowledge with a 1M-token context window.

From $2 / 1M tokens Learn more
text 1.0M context

GPT-4.1 Mini

Smaller, faster version of GPT-4.1 with low latency, strong instruction following, and a large 1M-token context window optimized for lightweight tasks.

From $0.4 / 1M tokens Learn more
text 1.0M context

GPT-4.1 Nano

Fastest and most cost-efficient GPT-4.1 model with strong instruction following, tool calling, and a 1M-token context window for lightweight, real-time tasks.

From $0.1 / 1M tokens Learn more
text 131.1K context

GPT-OSS 120B

OpenAI's most powerful open-weight model (117B params, 5.1B active), fitting on a single H100 GPU - fully customizable, licensed for unrestricted commercial use.

Open weight Learn more
text 128K context

GPT-OSS 20B

A 21-billion-parameter open-weight model from OpenAI, designed for efficient reasoning and long-context usage (≈ 128K tokens).

Open weight Learn more
image

GPT Image 1.5

State-of-the-art image generation model with improved instruction following and adherence to prompts.

From $5 / 1M tokens Learn more
image

GPT Image 1

State-of-the-art image generation model that accepts text and image inputs and produces high-quality images across multiple resolutions and quality levels.

From $5 / 1M tokens Learn more
image

GPT Image 1 Mini

A cost-efficient, multimodal image generation model that accepts text and image inputs and produces images across multiple resolutions and quality levels.

From $2 / 1M tokens Learn more
text 200K context

o4-mini

A fast, cost-efficient small reasoning model optimized for coding and visual tasks; succeeded by GPT-5 mini.

From $1.1 / 1M tokens Learn more
text 200K context

o3

A powerful reasoning model excelling at complex, multi-step tasks across math, science, coding, and visual reasoning; succeeded by GPT-5.

From $2 / 1M tokens Learn more
text 200K context

o3-mini

A small, cost-efficient reasoning model offering high intelligence at the same pricing and latency targets as o1-mini, with strong support for structured outputs and developer tooling.

From $1.1 / 1M tokens Learn more
text 200K context

o1

A full-size o-series reasoning model trained with RL to think before answering, producing strong multi-step reasoning across math, code, and analysis tasks.

From $15 / 1M tokens Learn more
text 200K context

o1-pro

A high-compute version of the o1 reasoning model, trained with reinforcement learning to think before answering and produce consistently stronger multi-step reasoning across math, science, coding, and analysis tasks.

From $150 / 1M tokens Learn more
text 128K context

GPT-4o

A versatile, high-intelligence flagship GPT model that handles text and image inputs and produces fast, high-quality text outputs for a wide range of tasks.

From $2.5 / 1M tokens Learn more
text 128K context

GPT-4o mini

A fast, affordable small model for focused tasks with multimodal input support and strong performance for classification, extraction, translation, and lightweight reasoning.

From $0.15 / 1M tokens Learn more
audio 128K context

GPT-4o Audio

Preview multimodal model that accepts and outputs audio, optimized for natural voice interactions and real-time conversational experiences.

From $2.5 / 1M tokens Learn more
audio 128K context

GPT-4o mini Audio

Fast, affordable audio-capable model for lightweight voice interactions, real-time responses, and low-cost speech-based applications.

From $0.15 / 1M tokens Learn more
text 128K context

GPT-4 Turbo

Older high-intelligence GPT-4 generation model offering strong reasoning and image input support, now superseded by newer 4o-based models.

From $10 / 1M tokens Learn more
text 16.4K context

GPT-3.5 Turbo

Legacy lightweight GPT model for cheap text generation and chat tasks; now replaced by faster, smarter, and cheaper 4o-mini models.

From $0.5 / 1M tokens Learn more

Google models

The Gemini family pairs strong reasoning with million-token context windows, alongside the Nano Banana models for fast, high-quality image generation.

text 1.0M context

Gemini 3.1 Pro

Google's most advanced reasoning Gemini model, built for complex multimodal problem-solving, software engineering, and long-horizon agentic workflows.

From $4 / 1M tokens Learn more
image

Nano Banana 2

High-efficiency native image model optimized for fast generation, editing, and conversational image workflows at high throughput.

See pricing Learn more
text 1M context

Gemini 3 Pro

Google's most intelligent multimodal model designed for advanced reasoning, coding, and agentic tasks.

From $4 / 1M tokens Learn more
image

Nano Banana Pro

High-fidelity image model with precise controls, advanced text rendering, and world-knowledge grounding.

See pricing Learn more
text 1.0M context

Gemini 2.5 Pro Experimental

Google's most advanced thinking model, leading benchmarks in reasoning, science, math, and coding with a massive multimodal context window.

From $1.5 / 1M tokens Learn more
text 1M context

Gemini 2.5 Flash

A fast, cost-efficient multimodal model optimized for everyday tasks with strong speed, long context, and native audio capabilities.

From $0.3 / 1M tokens Learn more
image

Nano Banana

High-quality, low-latency image model for generation, editing, fusion, and character consistency.

See pricing Learn more
text 1M context

Gemini 1.5 Pro

A next-generation multimodal model with breakthrough long-context capability up to 1M tokens and strong reasoning across text, code, audio, video, and images.

From $3.5 / 1M tokens Learn more
text 1M context

Gemini 1.5 Flash

A fast, lightweight model optimized for low-latency, high-volume multimodal tasks with long-context support.

From $0.075 / 1M tokens Learn more
text 128K context

Gemini 1.0 Pro

A versatile multimodal model optimized for balanced performance across reasoning, language, and code tasks.

From $0.5 / 1M tokens Learn more

Anthropic models

Claude models are known for careful reasoning, long-document analysis, and dependable writing - a favourite for teams drafting reports, proposals, and policies.

text 1M context

Claude 4.7 Opus

Anthropic's latest frontier Opus model, purpose-built for advanced software engineering, long-horizon agent work, and high-resolution multimodal reasoning.

From $5 / 1M tokens Learn more
text 1M context

Claude 4.6 Sonnet

Anthropic's most capable Sonnet model, delivering a full upgrade across coding, computer use, long-context reasoning, agent planning, and professional knowledge work.

From $3 / 1M tokens Learn more
text 1M context

Claude 4.5 Sonnet

A frontier-level hybrid-reasoning model excelling at coding, long-horizon tasks, computer use, and domain reasoning with top-tier alignment and reliability.

From $3 / 1M tokens Learn more
text 200K context

Claude 4.5 Haiku

A fast, small model delivering near-frontier coding and computer-use performance at ultra-low cost with exceptional speed and strong safety.

From $1 / 1M tokens Learn more
text 1M context

Claude 4.6 Opus

Anthropic's most intelligent model for building agents and coding, with stronger reliability and precision for long-horizon engineering and enterprise workflows.

From $5 / 1M tokens Learn more
text 200K context

Claude 4.5 Opus

Anthropic's November 2025 flagship model, combining maximum capability with practical performance for coding, agents, computer use, and enterprise workflows.

From $5 / 1M tokens Learn more
text 1M context

Claude 4.1 Opus

A refined flagship model with improved coding, reasoning, research depth, and agentic task performance over Opus 4.

From $15 / 1M tokens Learn more
text 1M context

Claude 4 Sonnet

A balanced-hybrid reasoning model tuned for everyday assistant and high-volume tasks.

From $3 / 1M tokens Learn more
text 200K context

Claude 4 Opus

The flagship model, focused on deep reasoning, large-scale coding and sustained multi-step agentic workflows.

From $15 / 1M tokens Learn more
text 200K context

Claude 3.5 Sonnet

A fast, mid-tier model offering top-tier intelligence, strong reasoning, and advanced coding/vision capabilities at low cost.

From $3 / 1M tokens Learn more
text 200K context

Claude 3.5 Haiku

A fast, affordable model matching Claude 3 Opus on many tasks while delivering major improvements in coding, accuracy, and tool use.

From $0.8 / 1M tokens Learn more
text 200K context

Claude 3 Opus

The most intelligent Claude 3 model, built for highly complex reasoning, analysis, and open-ended problem solving across any domain.

From $15 / 1M tokens Learn more
text 200K context

Claude 3 Sonnet

Balanced model offering high intelligence with fast performance, excellent for scalable enterprise workloads and real-time responses.

From $3 / 1M tokens Learn more
text 200K context

Claude 3 Haiku

Ultra-fast, cost-efficient model built for real-time interactions, instant responses, and high-volume workloads.

From $0.25 / 1M tokens Learn more

Alibaba Cloud models

The Qwen family spans flagship reasoning models, fast and affordable workhorses, and omni models that handle text, image, and audio together.

text 262.1K context

Qwen3-Max

Top-tier Qwen3 model for complex, multi-step reasoning and agent workflows.

From $0.861 / 1M tokens Learn more
text 32.8K context

Qwen-Max

High-performance general-purpose Qwen model with strong coding and reasoning abilities.

From $1.6 / 1M tokens Learn more
text 1M context

Qwen-Plus

Balanced Qwen model with strong speed, cost efficiency, and optional reasoning mode.

From $0.115 / 1M tokens Learn more
text 1M context

Qwen3-Plus

Improved Qwen3 generation of Plus model with better reasoning, tool use, and alignment.

From $0.115 / 1M tokens Learn more
text 1M context

Qwen-Flash

The fastest and cheapest Qwen model, ideal for high-volume workloads.

From $0.022 / 1M tokens Learn more
text 1M context

Qwen3-Flash

Upgraded Flash model with improved capabilities and hybrid reasoning support.

From $0.022 / 1M tokens Learn more
text 1M context

Qwen-Turbo

Fast, low-cost model for general tasks; being phased out in favor of Flash.

From $0.044 / 1M tokens Learn more
text 131.1K context

QwQ-Plus

A reasoning-optimized model built on Qwen2.5 with strong math and code performance.

From $0.23 / 1M tokens Learn more
text 10M context

Qwen-Long

Long-context model with 10M tokens for huge document analysis and summarization.

From $0.072 / 1M tokens Learn more
multimodal 32.8K context

Qwen-Omni-Turbo

Multimodal turbo model supporting text, image, audio, and video with fast output.

From $0.058 / 1M tokens Learn more
multimodal 65.5K context

Qwen3-Omni-Flash

Hybrid thinking multimodal model with upgraded vision, audio, and agent abilities.

From $0.43 / 1M tokens Learn more
multimodal 65.5K context

Qwen3-Omni-Flash-Realtime

Real-time multimodal model with streaming audio input and VAD for live use.

From $0.52 / 1M tokens Learn more
vision 131.1K context

QVQ-Max

High-end visual reasoning model with strong math, coding, and diagram understanding.

From $1.147 / 1M tokens Learn more
vision 262.1K context

Qwen3-VL-Plus

Text-generation model with strong vision understanding, OCR, reasoning, and summaries.

From $0.4 / 1M tokens Learn more
Appaca

Put any of these models to work

Appaca is the AI workspace for operators. Build internal tools and AI co-workers powered by any model in this directory - connected to your real data and ready for your whole team.

Describe it, and it's built

Tell the Appaca agent the internal tool you need - a lead scorer on Claude, a report generator on GPT-5.5, an image tool on Nano Banana - and it builds a working app. No code, no API keys, no deployment.

Connected to your real data

Connect Slack, Notion, Google Sheets, Airtable, and more, plus a built-in database - so every model works with your team's real context instead of generic answers.

Automated for the whole team

Schedule tools to run on autopilot - daily digests, weekly reports, real-time triggers - and share them with your whole team from one workspace.

Describe it, and it's built

Tell the Appaca agent what your team needs and it builds a working app powered by the model you choose - connected to the tools you already use.

SlackGoogle SheetsGoogle DriveGoogle CalendarAirtableNotionWhatsappHubspot
Chat to app Appaca app builder

FAQs

Which AI model should my team use?

It depends on the job. Frontier models like GPT-5.5, Gemini 3.1 Pro, and Claude 4.7 Opus handle complex reasoning and long documents. Fast, low-cost models like Gemini Flash, Claude Haiku, and DeepSeek V3 suit high-volume tasks. Image models like Nano Banana generate visuals. In Appaca, you can pick a different model for each tool - and switch later without rebuilding anything.

How does AI model pricing work?

Most text models charge per token - separate rates per million input and output tokens. Image models charge per generated image, usually varying by resolution and quality. Open-weight models like Llama and DeepSeek can be run without per-token licensing fees. Each model page in this directory lists its current pricing and context window.

Can I use these AI models without writing code?

Yes. Appaca is a no-code AI workspace: describe the internal tool your team needs and the Appaca agent builds it as a working app powered by the model you choose - with a built-in database, team access, and integrations. No API keys to wire up and nothing to deploy.

What is a context window?

The context window is how much text a model can consider at once, measured in tokens. A larger context window - like Gemini's 1M tokens - means the model can work across entire codebases, long contracts, or months of records in a single pass without losing track.

What is Appaca?

Appaca is an AI workspace for operators. Describe what your team needs and get a working app with a built-in database, team access, and no coding or deployment required. It supports the AI models in this directory, so your tools always run on the model that fits the task.

Build AI agents and tools that use any AI model

Describe the tool your team needs and get a working app powered by the right model - with a built-in database, team access, and integrations. No code, no deployment.