DeepSeek V4 Launches with 1 Trillion Parameters
DeepSeek V4 arrives with 1 trillion parameters and 32 billion active, featuring native multimodal support and 1M+ token context. The MODEL1 architecture with tiered KV cache storage cuts memory by 40%, while sparse FP8 decoding achieves 1.8x inference speedup.