Models & Pricing

The prices listed below are in units of per 1M tokens. A token, the smallest unit of text that the model recognizes, can be a word, a number, or even a punctuation mark. We will bill based on the total number of input and output tokens by the model.

Model Details

NOTE: The deepseek-chat and deepseek-reasoner correspond to the model version DeepSeek-V3.2 (128K context limit), which differs from the APP/WEB version.

MODEL deepseek-chat deepseek-reasoner
BASE URL https://api.deepseek.com
MODEL VERSION DeepSeek-V3.2
(Non-thinking Mode) DeepSeek-V3.2
(Thinking Mode)
CONTEXT LENGTH 128K
MAX OUTPUT DEFAULT: 4K
MAXIMUM: 8K DEFAULT: 32K
MAXIMUM: 64K
FEATURES Json Output ✓ ✓
Tool Calls ✓ ✓
Chat Prefix Completion（Beta） ✓ ✓
FIM Completion（Beta） ✓ ✗
PRICING 1M INPUT TOKENS (CACHE HIT) $0.028
1M INPUT TOKENS (CACHE MISS) $0.28
1M OUTPUT TOKENS $0.42

Deduction Rules

The expense = number of tokens × price. The corresponding fees will be directly deducted from your topped-up balance or granted balance, with a preference for using the granted balance first when both balances are available.

Product prices may vary and DeepSeek reserves the right to adjust them. We recommend topping up based on your actual usage and regularly checking this page for the most recent pricing information.

MODEL		deepseek-chat	deepseek-reasoner
BASE URL		https://api.deepseek.com
MODEL VERSION		DeepSeek-V3.2 (Non-thinking Mode)	DeepSeek-V3.2 (Thinking Mode)
CONTEXT LENGTH		128K
MAX OUTPUT		DEFAULT: 4K MAXIMUM: 8K	DEFAULT: 32K MAXIMUM: 64K
FEATURES	Json Output	✓	✓
	Tool Calls	✓	✓
	Chat Prefix Completion（Beta）	✓	✓
	FIM Completion（Beta）	✓	✗
PRICING	1M INPUT TOKENS (CACHE HIT)	$0.028
	1M INPUT TOKENS (CACHE MISS)	$0.28
	1M OUTPUT TOKENS	$0.42

Models & Pricing

Model Details​

Deduction Rules​

Model Details

Deduction Rules