Explore AI Models

Discover the best large language models (LLMs) from different providers.

GPT-5.2

OpenAI's flagship model for coding and agentic tasks across industries.

GPT-5.1

Flagship model for coding, reasoning, and agentic tasks with adjustable reasoning depth and multimodal input/output.

GPT-5.1 Codex

Version of GPT-5.1 optimized for agentic coding inside Codex and similar environments, with strong reasoning and multimodal support.

Sora 2

Flagship video generation model that produces high-quality dynamic videos with synced audio from natural language or image prompts.

Sora 2 Pro

Most advanced video generation model with synced audio, producing highly detailed, dynamic clips from natural language or image inputs.

GPT-5

A high-reasoning model for coding and agentic tasks with configurable reasoning effort, supporting text + image input and large context windows.

GPT-5 Codex

Version of GPT-5 optimized for agentic coding tasks in Codex, offering strong reasoning, reliable code generation, and long-context project understanding.

GPT-5 Mini

A faster, cost-efficient version of GPT-5 designed for well-defined tasks, precise prompts, and high-speed execution with strong reasoning.

GPT-5 Nano

The fastest and cheapest GPT-5 variant, ideal for summarization, classification, and lightweight tasks requiring high speed and low cost.

GPT-5 Pro

A premium GPT-5 variant that uses more compute to deliver consistently smarter, more precise reasoning for the toughest problems.

GPT-4.1

A highly capable non-reasoning model that excels at instruction following, tool calling, and broad domain knowledge with a 1M-token context window.

GPT-4.1 Mini

Smaller, faster version of GPT-4.1 with low latency, strong instruction following, and a large 1M-token context window optimized for lightweight tasks.

GPT-4.1 Nano

Fastest and most cost-efficient GPT-4.1 model with strong instruction following, tool calling, and a 1M-token context window for lightweight, real-time tasks.

GPT-OSS 120B

OpenAI's most powerful open-weight model (117B params, 5.1B active), fitting on a single H100 GPU - fully customizable, licensed for unrestricted commercial use.

GPT-OSS 20B

A 21-billion-parameter open-weight model from OpenAI, designed for efficient reasoning and long-context usage (≈ 128K tokens).

GPT Image 1.5

State-of-the-art image generation model with improved instruction following and adherence to prompts.

GPT Image 1

State-of-the-art image generation model that accepts text and image inputs and produces high-quality images across multiple resolutions and quality levels.

GPT Image 1 Mini

A cost-efficient, multimodal image generation model that accepts text and image inputs and produces images across multiple resolutions and quality levels.

o4-mini

A fast, cost-efficient small reasoning model optimized for coding and visual tasks; succeeded by GPT-5 mini.

o3

A powerful reasoning model excelling at complex, multi-step tasks across math, science, coding, and visual reasoning; succeeded by GPT-5.

o3-mini

A small, cost-efficient reasoning model offering high intelligence at the same pricing and latency targets as o1-mini, with strong support for structured outputs and developer tooling.

o1

A full-size o-series reasoning model trained with RL to think before answering, producing strong multi-step reasoning across math, code, and analysis tasks.

o1-pro

A high-compute version of the o1 reasoning model, trained with reinforcement learning to think before answering and produce consistently stronger multi-step reasoning across math, science, coding, and analysis tasks.

GPT-4o

A versatile, high-intelligence flagship GPT model that handles text and image inputs and produces fast, high-quality text outputs for a wide range of tasks.

GPT-4o mini

A fast, affordable small model for focused tasks with multimodal input support and strong performance for classification, extraction, translation, and lightweight reasoning.

GPT-4o Audio

Preview multimodal model that accepts and outputs audio, optimized for natural voice interactions and real-time conversational experiences.

GPT-4o mini Audio

Fast, affordable audio-capable model for lightweight voice interactions, real-time responses, and low-cost speech-based applications.

GPT-4 Turbo

Older high-intelligence GPT-4 generation model offering strong reasoning and image input support, now superseded by newer 4o-based models.

GPT-3.5 Turbo

Legacy lightweight GPT model for cheap text generation and chat tasks; now replaced by faster, smarter, and cheaper 4o-mini models.

Gemini 3 Pro

Google's most intelligent multimodal model designed for advanced reasoning, coding, and agentic tasks.

Nano Banana Pro

High-fidelity image model with precise controls, advanced text rendering, and world-knowledge grounding.

Gemini 2.5 Pro Experimental

Google's most advanced thinking model, leading benchmarks in reasoning, science, math, and coding with a massive multimodal context window.

Gemini 2.5 Flash

A fast, cost-efficient multimodal model optimized for everyday tasks with strong speed, long context, and native audio capabilities.

Nano Banana

High-quality, low-latency image model for generation, editing, fusion, and character consistency.

Gemini 1.5 Pro

A next-generation multimodal model with breakthrough long-context capability up to 1M tokens and strong reasoning across text, code, audio, video, and images.

Gemini 1.5 Flash

A fast, lightweight model optimized for low-latency, high-volume multimodal tasks with long-context support.

Gemini 1.0 Pro

A versatile multimodal model optimized for balanced performance across reasoning, language, and code tasks.

Claude 4.5 Sonnet

A frontier-level hybrid-reasoning model excelling at coding, long-horizon tasks, computer use, and domain reasoning with top-tier alignment and reliability.

Claude 4.5 Haiku

A fast, small model delivering near-frontier coding and computer-use performance at ultra-low cost with exceptional speed and strong safety.

Claude 4.1 Opus

A refined flagship model with improved coding, reasoning, research depth, and agentic task performance over Opus 4.

Claude 4 Sonnet

A balanced-hybrid reasoning model tuned for everyday assistant and high-volume tasks.

Claude 4 Opus

The flagship model, focused on deep reasoning, large-scale coding and sustained multi-step agentic workflows.

Claude 3.5 Sonnet

A fast, mid-tier model offering top-tier intelligence, strong reasoning, and advanced coding/vision capabilities at low cost.

Claude 3.5 Haiku

A fast, affordable model matching Claude 3 Opus on many tasks while delivering major improvements in coding, accuracy, and tool use.

Claude 3 Opus

The most intelligent Claude 3 model, built for highly complex reasoning, analysis, and open-ended problem solving across any domain.

Claude 3 Sonnet

Balanced model offering high intelligence with fast performance, excellent for scalable enterprise workloads and real-time responses.

Claude 3 Haiku

Ultra-fast, cost-efficient model built for real-time interactions, instant responses, and high-volume workloads.

Grok 4

A flagship multimodal model excelling in natural language, math, and deep reasoning with unmatched all-around performance.

Grok 3

A high-performance enterprise model for coding, extraction, reasoning, and domain-expert tasks across finance, healthcare, law, and science.

Grok 3 Mini

A lightweight reasoning model that is fast, efficient, and ideal for logic-heavy tasks without deep domain requirements.

Qwen3-Max

Top-tier Qwen3 model for complex, multi-step reasoning and agent workflows.

Qwen-Max

High-performance general-purpose Qwen model with strong coding and reasoning abilities.

Qwen-Plus

Balanced Qwen model with strong speed, cost efficiency, and optional reasoning mode.

Qwen3-Plus

Improved Qwen3 generation of Plus model with better reasoning, tool use, and alignment.

Qwen-Flash

The fastest and cheapest Qwen model, ideal for high-volume workloads.

Qwen3-Flash

Upgraded Flash model with improved capabilities and hybrid reasoning support.

Qwen-Turbo

Fast, low-cost model for general tasks; being phased out in favor of Flash.

QwQ-Plus

A reasoning-optimized model built on Qwen2.5 with strong math and code performance.

Qwen-Long

Long-context model with 10M tokens for huge document analysis and summarization.

Qwen-Omni-Turbo

Multimodal turbo model supporting text, image, audio, and video with fast output.

Qwen3-Omni-Flash

Hybrid thinking multimodal model with upgraded vision, audio, and agent abilities.

Qwen3-Omni-Flash-Realtime

Real-time multimodal model with streaming audio input and VAD for live use.

QVQ-Max

High-end visual reasoning model with strong math, coding, and diagram understanding.

Qwen3-VL-Plus

Text-generation model with strong vision understanding, OCR, reasoning, and summaries.

LLaMA 3 70B

Meta's large-sized, open-source model for a wide range of tasks.

LLaMA 3 8B

Meta's small-sized, open-source model, suitable for simpler tasks.

DeepSeek V3

A data-analysis powerhouse built for large-scale pattern recognition, prediction, and working with massive datasets.

DeepSeek R1

A fast, real-time decision-making model optimized for rapid analysis, dynamic adjustments, and responsive AI behavior.

The platform for your ideal software

Use Appaca to to do the most with any software you need, just for your use case.