DeepSeek: DeepSeek V3 Base

deepseek/deepseek-v3-base

Note that this is a base model mostly meant for testing, you need to provide detailed prompts for the model to return useful responses.

DeepSeek-V3 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.

DeepSeek-V3 Base is the pre-trained model behind DeepSeek V3

Model weights

Modalities

Context

Avg

131K

Released

Mar 29, 2025

Knowledge Cutoff

Jul 2024

Activity

Token volume and request traffic to this model over time.

OpenRouter

Product

Chat
Rankings
Apps
Models
Providers
Pricing
Enterprise
Labs

Company

About
Blog
CareersHiring
Privacy
Terms of Service
Support
State of AI
Works With OR
Data

Developer

Documentation
API Reference
SDK
Status

Connect

Discord
GitHub
LinkedIn
X
YouTube

DeepSeek: DeepSeek V3 Base