AI Models

Choose the AI model that powers your bot. Each model has different capabilities, speeds, and costs.

Available Models

GPT-4oRecommended

Best overall performance. Fast, accurate, and excellent at following instructions. Best for most production use cases.

GPT-4 TurboHigh-end

128K context window. Great for very long documents or complex reasoning. More expensive than GPT-4o.

GPT-3.5 TurboBudget

Faster and cheaper than GPT-4. Good for simple Q&A where cost is a concern.

Claude 3 OpusPremium

Most capable Claude model. Excellent reasoning, nuanced responses. Best for complex tasks.

Claude 3 SonnetBalanced

Great balance of performance and cost. Recommended for production workloads.

Claude 3 HaikuFast

Fastest Claude model. Best for high-volume, simple queries where speed matters.

Gemini ProGeneral

Google's multimodal model. Good general-purpose performance with competitive pricing.

Model	Speed	Quality	Cost	Context
GPT-4o	⚡⚡⚡	⭐⭐⭐⭐⭐	$$	128K
GPT-4 Turbo	⚡⚡	⭐⭐⭐⭐⭐	$$$	128K
GPT-3.5 Turbo	⚡⚡⚡⚡	⭐⭐⭐	$	16K
Claude 3 Opus	⚡⚡	⭐⭐⭐⭐⭐	$$$$	200K
Claude 3 Sonnet	⚡⚡⚡	⭐⭐⭐⭐	$$	200K
Claude 3 Haiku	⚡⚡⚡⚡⚡	⭐⭐⭐	$	200K
Gemini Pro	⚡⚡⚡	⭐⭐⭐⭐	$$	32K

temperaturenumberdefault: 0.7

Controls randomness in responses. Range: 0-2

For RAG Applications

Lower temperatures (0.3-0.5) often work better for RAG bots where accuracy matters more than creativity.

maxTokensintegerdefault: 1000

Maximum length of generated response in tokens

Model Switching

Different models may respond differently to the same prompt. Test thoroughly when switching models, especially your system prompt.