Models
27+ models from 7+ providers. Many are free.
DeepSeek R1 0528
DeepSeek
Latest R1 update, near o3-level reasoning performance. Free on Waterfall.
Llama 4 Maverick
Meta
Latest Llama with 1M context and excellent tool calling. Free on Waterfall.
Sonar Reasoning
Perplexity
DeepSeek R1 + web search. Reasoning with real-time data.
Claude Sonnet 4
Anthropic
Anthropic's latest balanced model. Excellent coding and reasoning.
Kimi K2
Moonshot
Excellent tool calling model on Groq free tier. 1000 req/day free.
DeepSeek R1
DeepSeek
State-of-the-art reasoning model competitive with o1. Completely free through Waterfall.
Gemini 2.5 Pro
Google's most capable model with 1M context. Free experimental access through Waterfall.
Llama 4 Scout
Meta
Fast Llama 4 variant with 512K context. Free with tool calling support.
QwQ 32B
Qwen
Qwen's reasoning model. Competitive with DeepSeek R1 at 32B parameters. Free.
Gemma 3 27B
Open-weights model from Google. Great general purpose performance. Free.
DeepSeek V3 Chat
DeepSeek
Excellent all-around chat model. 128K context. Free on Waterfall.
Mistral Small 3.1
Mistral
Mistral's best small model. Tool calling, 128K context. Free.
Gemini 2.0 Flash
Fast multimodal model with vision and tool use. 1M context. Free.
Nemotron Nano 8B
NVIDIA
NVIDIA-optimized Llama 3.1. Very fast, 128K context. Free.
Llama 4 Scout
Meta
Llama 4 Scout on Groq. Ultra-fast inference at $0.11/M input tokens.
DeepSeek V3
DeepSeek
Full DeepSeek V3. Excellent quality at extremely low cost.
Gemini 2.5 Flash
Fast and capable. Great for high-volume use cases.
GPT-4o Mini
OpenAI
OpenAI's small model. Fast, cheap, reliable tool calling.
Sonar
Perplexity
Quick Q&A with web search and citations. Real-time information.
Sonar Pro
Perplexity
Deeper analysis with web search. Multi-step research capability.
GPT-4o
OpenAI
OpenAI's flagship multimodal model. Excellent all-around performance.
Gemini 2.5 Pro
Google's best model. 1M context with multimodal capabilities.
DeepSeek R1
DeepSeek
Full R1 reasoning model. o1-level performance at a fraction of the cost.
o1
OpenAI
OpenAI's reasoning model. Excels at math, science, and complex logic.
Claude Opus 4
Anthropic
Anthropic's most capable model. Best-in-class for complex tasks.
Qwen3 32B
Qwen
Qwen3 on Groq free tier. Tool calling support, 1000 req/day.
Llama 3.3 70B
Meta
Proven reliable Llama model. 128K context with tool calling. Free.