Build with the latest DeepSeek models
DeepSeek currently offers two LLM access points via its API: deepseek-chat and deepseek-reasoner. According to the official documentation, both currently correspond to DeepSeek-V3.2 with a 128K context window; deepseek-chat stands for Non-Thinking mode, deepseek-reasoner for Thinking mode. The API is OpenAI-compatible and supports, among other things, JSON Output, Tool Calls, Chat Prefix Completion, and, in the case of deepseek-chat, additionally FIM Completion.DeepSeek API
LLM "Build with the latest DeepSeek models"
deepseek-v4-flash Faster, more efficient model with Thinking and Non-Thinking mode, 1M context, JSON output, tool calls, and chat prefix completion.
deepseek-v4-pro More powerful model for more complex reasoning, coding, agents, and long contexts; also supports 1M context, JSON output, and tool calls.
OpenAI-/Anthropic-compatible API Usage via an OpenAI-compatible base URL or an Anthropic-compatible endpoint; suitable for existing SDKs and agent tools.
Open Weights / Self-hosting path DeepSeek-V4 was released as an open-weights family; self-hosting requires your own infrastructure and should be considered separately from the official API.