GPT 5.4 Mini
NEWHOTopenai/gpt-5.4-miniGPT-5.4 Mini — latest gen fast and affordable
openai/gpt-5.4-miniGPT-5.4 Mini — latest gen fast and affordable
openai/o4-miniNewest compact reasoning model with tool use support
openai/gpt-5.1GPT-5.1 — improved reasoning and longer context window
openai/gpt-5GPT-5 flagship model with advanced reasoning and longer context
openai/gpt-5.4-nanoGPT-5.4 Nano — ultra-fast, low-latency for classification, extraction, and sub-agents
openai/gpt-5.3-chatGPT-5.3 Chat — previous generation flagship chat model
openai/gpt-5.1-chatGPT-5.1 chat-optimized variant
openai/gpt-5.1-codexGPT-5.1 Codex — advanced code generation and analysis
openai/gpt-5.1-codex-miniGPT-5.1 Codex Mini — fast and affordable code generation
openai/gpt-5-chatGPT-5 chat-optimized variant
openai/gpt-5-miniGPT-5 Mini — affordable with strong reasoning
openai/gpt-5-nanoGPT-5 Nano — fastest and cheapest in the GPT-5 family
openai/gpt-5-proGPT-5 Pro — maximum capability, best for complex tasks
openai/gpt-5-codexGPT-5 Codex — advanced code generation and analysis
openai/o3Latest reasoning model with improved speed and accuracy
openai/gpt-4.1Flagship model with 200K context, best for complex reasoning and coding
openai/gpt-4.1-miniFast and affordable, great balance of speed and intelligence
openai/gpt-4.1-nanoFastest and cheapest, ideal for simple tasks and classification
openai/gpt-4oPrevious flagship with vision, strong all-around performance
openai/gpt-4o-miniCompact model optimized for speed and cost efficiency
openai/gpt-4o-audio-previewMultimodal model supporting audio input and output
openai/gpt-4o-mini-ttsGPT-4o Mini TTS — expressive, natural speech with emotion and tone control
openai/gpt-4o-transcribeGPT-4o powered transcription — more accurate than Whisper, supports structured output
openai/gpt-4o-mini-transcribeFast and affordable GPT-4o Mini transcription
openai/gpt-4o-transcribe-diarizeGPT-4o transcription with speaker diarization — identifies who said what
openai/text-embedding-3-smallCompact embedding model, 1536 dimensions
openai/text-embedding-3-largeLarge embedding model, up to 3072 dimensions
openai/text-embedding-ada-002Legacy embedding model, 1536 dimensions
openai/tts-1Standard text-to-speech, fast and natural sounding
openai/tts-1-hdHigh-definition text-to-speech with premium voice quality
openai/whisper-1Industry-leading speech-to-text transcription
openai/gpt-oss-120bGPT-OSS 120B — open-source large language model
openai/gpt-oss-20bGPT-OSS 20B — compact open-source model
anthropic/claude-opus-4-6Latest Opus with improved coding and reduced cost
anthropic/claude-sonnet-4-6Latest Sonnet, top choice for Claude Code and Cursor
anthropic/claude-haiku-4-5Fast and capable, great for real-time applications
anthropic/claude-sonnet-4-thinkingSonnet 4 with extended thinking for deeper reasoning
anthropic/claude-opus-4.5Claude Opus 4.5 via Bedrock, top-tier reasoning and creativity
anthropic/claude-sonnet-4.5Claude Sonnet 4.5 via Bedrock, strong balanced performance
anthropic/claude-opus-4-7Most capable Claude yet (2026-04-16): 1M context, 128K output, agentic workflows
anthropic/claude-opus-4.1Claude Opus 4.1 via Bedrock, premium reasoning model
anthropic/claude-sonnet-4Balanced Claude with strong coding and reasoning abilities
anthropic/claude-haiku-3.5Previous Haiku generation, compact and efficient
google/gemini-2.5-proGoogle most capable model with 1M context window
google/gemini-2.5-flashFast Gemini with strong reasoning at low cost
google/gemini-3-pro-previewGemini 3 Pro Preview — next-gen reasoning
google/gemini-3-pro-image-previewGemini 3 Pro Image — highest quality image generation & editing, up to 4K
google/gemini-3-flash-previewGemini 3 Flash Preview — fast next-gen model
google/gemini-2.5-flash-liteGemini 2.5 Flash Lite — lightweight and fast
google/gemini-2.5-flash-imageGemini 2.5 Flash Image — vision and image generation
google/gemini-3.1-pro-previewGemini 3.1 Pro Preview — latest capabilities
google/gemini-3.1-flash-image-previewGemini 3.1 Flash Image — vision and image generation
google/gemini-3.1-flash-lite-previewGemini 3.1 Flash Lite — ultra-low cost for high-throughput tasks
google/gemini-embedding-001Gemini embedding model for text retrieval
deepseek/deepseek-v3.2Latest DeepSeek V3.2 with hybrid thinking mode
deepseek/deepseek-r1-0528DeepSeek R1 snapshot from May 2025
deepseek/deepseek-reasonerChain-of-thought reasoning model rivaling o1
deepseek/deepseek-v3.1DeepSeek V3.1 with hybrid thinking mode
deepseek/deepseek-v3DeepSeek V3 text generation model
deepseek/deepseek-r1DeepSeek R1 reasoning-only model
deepseek/deepseek-chatOpen-source powerhouse, strong coding and math skills
qwen/qwen3.6-plusQwen 3.6 Plus — latest flagship, rivals Claude Opus 4.5 on benchmarks
qwen/qwen3-coder-plusSpecialized coding model by Qwen
qwen/qwen3-coder-nextNext-gen Qwen coding model, top tier for code tasks
qwen/qwen3-coder-30bQwen3 Coder 30B, compact coding specialist via Bedrock
qwen/qwen3-coder-flashQwen3 Coder Flash — fastest, cheapest coding model
qwen/qwen3-coder-480bQwen3 Coder 480B — flagship coding model, largest in the series
qwen/qwq-plusQwen reasoning model, rivals DeepSeek-R1
qwen/qwen3-maxQwen 3 flagship, top reasoning and coding
qwen/qwen3.5-plusQwen 3.5 Plus with 1M context
qwen/qwen3.5-flashUltra-fast Qwen 3.5 with 1M context
qwen/qwen3.5-omniQwen 3.5 Omni native multimodal — text, image, audio, video
qwen/qwen3.5-397bQwen 3.5 397B MoE flagship, top-tier reasoning
qwen/qwen3-vl-plusQwen vision-language model
qwen/qwen3-next-80bQwen3 Next 80B via Bedrock, efficient MoE architecture
qwen/qwen3-vl-235bQwen3 VL 235B, large vision-language model via Bedrock
qwen/qwen3-32bQwen3 32B dense model via Bedrock, strong all-rounder
qwen/qwen3-tts-flashQwen3 TTS Flash — fast text-to-speech with natural voice
qwen/qwen3-tts-instruct-flashQwen3 TTS Instruct — instruction-controlled speech synthesis
qwen/qwen3.5-omni-flashQwen 3.5 Omni Flash — fast multimodal (text+image+audio)
qwen/qwen3.5-omni-plusQwen 3.5 Omni Plus — premium multimodal understanding
qwen/qwen3-vl-flashQwen3 VL Flash — fast and cheap visual understanding
qwen/qwen-plusBalanced Qwen model via direct API
qwen/qwen-plus-latestAlways-latest Qwen Plus snapshot
qwen/qwen-turboFast Qwen model, deprecated in favor of Qwen Flash
qwen/qwen-long10M ultra-long context for massive documents
qwen/qwen-maxQwen flagship via direct API
qwen/qwen2.5-coder-32bSpecialized coding model with 32B parameters
qwen/qwen-vl-maxQwen vision-language model via direct API
qwen/qwen-flashUltra-fast Qwen Flash, upgraded to Qwen3.5
qwen/qwen-max-latestAlways-latest Qwen Max snapshot
qwen/qwen-vl-plusQwen VL Plus for vision-language tasks
qwen/wan2.7-imageWan 2.7 Image — affordable image generation
qwen/wan2.7-image-proWan 2.7 Image Pro — high-quality image generation
qwen/cosyvoice-v2CosyVoice v2 text-to-speech via Bailian
qwen/sensevoice-v1SenseVoice speech-to-text via Bailian
qwen/paraformer-v2Paraformer v2 speech recognition via Bailian
qwen/wan2.7-t2vWan 2.7 Text-to-Video
qwen/wan2.6-t2vWan 2.6 Text-to-Video
qwen/wan2.7-i2vWan 2.7 Image-to-Video
qwen/wan2.6-i2vWan 2.6 Image-to-Video
qwen/wan2.6-i2v-flashWan 2.6 Image-to-Video Flash — fast generation
qwen/qvq-maxQVQ-Max — flagship visual reasoning model, deep visual understanding
qwen/qvq-plusQVQ-Plus — balanced visual reasoning, cost-effective
qwen/qwen-vl-ocrQwen VL OCR — specialized document OCR, table/form extraction
qwen/qwen-image-2.0Qwen Image 2.0 — text-to-image generation
qwen/qwen-image-2.0-proQwen Image 2.0 Pro — high-quality text-to-image
qwen/qwen-image-maxQwen Image Max — best quality text-to-image
qwen/qwen-image-edit-plusQwen Image Edit Plus — conversational image editing, Chinese & English prompts
qwen/qwen-image-edit-maxQwen Image Edit Max — premium conversational image editing
qwen/qwen-math-plusQwen Math Plus — specialized mathematical reasoning
qwen/qwen-math-turboQwen Math Turbo — fast math reasoning at lower cost
qwen/qwen-omni-turboQwen Omni Turbo — text, image, and audio understanding
qwen/z-image-turboZ-Image Turbo — lightweight fast image generation
qwen/qwen-mt-plusQwen MT Plus — professional machine translation
qwen/qwen-mt-turboQwen MT Turbo — fast translation at lower cost
zhipu/glm-5.1GLM-5.1 latest — improved coding and reasoning, 94% of Claude Opus 4.6 coding
zhipu/glm-5GLM-5 744B open-source flagship with thinking mode
zhipu/glm-5-turboGLM-5 Turbo — fast and cost-effective coding model
zhipu/glm-5v-turboGLM-5V Turbo — vision-language model for design-to-code and image understanding
zhipu/glm-4.7GLM-4.7 with hybrid thinking mode
zhipu/glm-4.6GLM-4.6 with hybrid thinking mode
zhipu/glm-4.5GLM-4.5 hybrid thinking model
zhipu/glm-4.5-airLightweight GLM-4.5 for fast inference
zhipu/glm-4.7-flashGLM-4.7 Flash, fast and affordable via Bedrock
xai/grok-4.1-fastGrok 4.1 Fast — latest xAI reasoning model
xai/grok-4.1-fast-non-reasoningGrok 4.1 Fast — latest model, quick responses
xai/grok-4-fastGrok 4 Fast — high-speed reasoning
xai/grok-4-fast-non-reasoningGrok 4 Fast — quick responses without deep reasoning
xai/grok-3xAI flagship with deep reasoning and real-time knowledge
xai/grok-3-miniFast and affordable Grok for everyday tasks
xai/grok-2Previous generation Grok model
moonshot/kimi-k2.5Kimi K2.5, multimodal flagship with 262K context
moonshot/kimi-k2-thinkingKimi K2 thinking model with deep reasoning
doubao/doubao-1.5-pro-256kByteDance Doubao with 256K context
doubao/doubao-1.5-pro-32kDoubao Pro with standard 32K context
doubao/doubao-1.5-lite-32kUltra-affordable Doubao for basic tasks
meta/llama-4-maverickLatest Llama with 1M context and multimodal support
meta/llama-4-scoutEfficient Llama 4 variant with 512K context
meta/llama-3.3-70bStrong open-source model for general tasks
meta/llama-3.1-405bLargest open-source model, near-frontier performance
meta/llama-3.1-70bVersatile 70B model with good cost-performance ratio
meta/llama-3.1-8bLightweight and fast, ideal for simple tasks
meta/llama-3.2-90bLlama 3.2 vision model with 90B parameters
meta/llama-3.2-11bLlama 3.2 vision model with 11B parameters
meta/llama-3.2-3bCompact Llama 3.2 for lightweight tasks
meta/llama-3.2-1bSmallest Llama 3.2, ultra-fast and ultra-cheap
mistral/mistral-largeMistral flagship, strong multilingual and reasoning
mistral/pixtral-largeMultimodal model with vision capabilities
mistral/mistral-large-3Mistral Large 3 675B, flagship model for complex tasks
mistral/devstral-2Devstral 2 123B, purpose-built for software engineering
mistral/magistral-smallMagistral Small with strong reasoning at low cost
mistral/ministral-14bMinistral 14B, balanced small model
mistral/codestralSpecialized code generation model by Mistral
mistral/voxtral-smallVoxtral Small — speech-to-text
mistral/voxtral-miniVoxtral Mini — compact speech-to-text
minimax/minimax-m2.5MiniMax M2.5, fast output with reasoning
minimax/minimax-m2.1MiniMax M2.1 with web search support
minimax/minimax-m2MiniMax M2, solid general-purpose model via Bedrock
minimax/minimax-m2.7MiniMax M2.7 — latest reasoning and code capabilities
minimax/speech-2.8-hdMiniMax Speech 2.8 HD — high-quality TTS
minimax/speech-2.8-turboMiniMax Speech 2.8 Turbo — fast TTS
amazon/nova-microAmazon fastest text-only model, ultra-low cost
amazon/nova-liteMultimodal model for image, video and text at low cost
amazon/nova-proAmazon most capable Nova for accuracy and complex tasks
amazon/nova-premierAmazon flagship model for complex reasoning with 1M context
amazon/nova-2-liteAmazon Nova 2 Lite — fast and affordable
amazon/nova-2-proAmazon Nova 2 Pro — advanced reasoning and vision
amazon/nova-embed-multimodalAmazon Nova Embed — text and image embedding
amazon/nova-sonicAmazon Nova Sonic — speech and audio model
amazon/nova-reel-1.1Amazon Nova Reel 1.1 — video generation
amazon/nova-reel-1.0Amazon Nova Reel 1.0 — video generation
cohere/command-r-plusEnterprise-grade RAG and tool use specialist
cohere/command-rEfficient model optimized for retrieval tasks
cohere/command-aLatest Command model with improved reasoning
cohere/embed-v4Cohere Embed v4 — state-of-the-art embedding model
cohere/embed-multilingual-v3Cohere multilingual embedding, 100+ languages
cohere/rerank-3.5Cohere Rerank 3.5 — search result reranking
nvidia/nemotron-super-3-120bNVIDIA Nemotron Super 3 120B, top-tier open model
nvidia/nemotron-nano-3-30bNVIDIA Nemotron Nano 3 30B, efficient and fast
google/gemma-3-27bGoogle Gemma 3 27B, capable open model with vision
google/gemma-3-12bGoogle Gemma 3 12B, balanced open model
google/gemma-3-4bGoogle Gemma 3 4B, ultra-compact and fast
ai21/jamba-1.5-largeAI21 hybrid SSM-Transformer with 256K context window
ai21/jamba-1.5-miniCompact Jamba model, fast and affordable with 256K context
microsoft/phi-4Phi-4 — efficient small language model
microsoft/phi-4-reasoningPhi-4 Reasoning — enhanced chain-of-thought
microsoft/phi-4-miniPhi-4 Mini — compact and efficient
microsoft/phi-4-multimodalPhi-4 Multimodal — vision-capable small model
writer/palmyra-x5Palmyra X5 — enterprise AI writing and analysis
writer/palmyra-x4Palmyra X4 — versatile enterprise model
writer/palmyra-visionPalmyra Vision — multimodal document understanding
twelvelabs/marengo-embed-3.0Marengo Embed 3.0 — multimodal video embedding
kling/kling-v3-videoKling V3 — high-quality AI video generation by Kuaishou
kling/kling-v3-omni-videoKling V3 Omni — multi-modal video generation with enhanced quality
pixverse/pixverse-v6PixVerse V6 — general-purpose AI video generation with multi-shot support
pixverse/pixverse-c1PixVerse C1 — dynamic scenes, fighting and magic effects
vidu/viduq3-proVidu Q3 Pro — professional text-to-video generation
vidu/viduq3-turboVidu Q3 Turbo — fast text-to-video at lower cost
azure/mai-image-2Microsoft MAI-Image-2 — photorealistic image generation
azure/flux-kontext-proFLUX.1 Kontext Pro — context-aware image editing via Azure
google/lyria-3-pro-previewLyria 3 Pro — full-length AI music generation by Google DeepMind
google/lyria-3-clip-previewLyria 3 Clip — 30-second AI music clips by Google DeepMind
google/imagen-4.0Imagen 4.0 — high-quality image generation
google/imagen-4.0-fastImagen 4.0 Fast — quick affordable image generation
amazon/titan-embed-text-v2Amazon Titan Text Embed v2
azure/flux-2-proFLUX.2 Pro — professional image generation
google/imagen-4.0-ultraImagen 4.0 Ultra — highest quality generation
google/veo-3.1Veo 3.1 — high-quality video generation
google/veo-3.1-fastVeo 3.1 Fast — quick video generation
google/veo-3.1-liteVeo 3.1 Lite — affordable video generation