What is Context Window?

The context window defines the maximum amount of text (measured in tokens) that a language model can consider at once. It includes the system prompt, conversation history, user input, and the model's output. Exceeding the context window causes the model to truncate or reject the request.

Context windows vary significantly across models: GPT-4o supports 128K tokens, Claude 3.5 Sonnet supports 200K, and Gemini 1.5 Pro supports up to 2M tokens. Larger context windows enable longer conversations and document analysis but also increase cost.

Efficient context management — keeping only relevant history, summarizing old turns — is a key part of token optimization. GateCtr's Context Optimizer automatically trims and compresses context to stay within efficient token ranges.

Termes associés

Voir GateCtr en action — gratuit