Skip to content
No models found
OpenRouter
© 2026 OpenRouter, Inc

Product

  • Chat
  • Rankings
  • Apps
  • Models
  • Providers
  • Pricing
  • Enterprise
  • Labs

Company

  • About
  • Blog
  • CareersHiring
  • Privacy
  • Terms of Service
  • Support
  • State of AI
  • Works With OR
  • Data

Developer

  • Documentation
  • API Reference
  • SDK
  • Status

Connect

  • Discord
  • GitHub
  • LinkedIn
  • X
  • YouTube
Favicon for deepseek

DeepSeek: DeepSeek V3 Base

deepseek/deepseek-v3-base

Note that this is a base model mostly meant for testing, you need to provide detailed prompts for the model to return useful responses.

DeepSeek-V3 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.

DeepSeek-V3 Base is the pre-trained model behind DeepSeek V3

Model weights

Modalities

Context

Avg

131K

Released

Mar 29, 2025

Knowledge Cutoff

Jul 2024

Activity

Activity

Token volume and request traffic to this model over time.