# Deepseek Company and family of Open source [[Large Language Models (LLMs)]]. Models in the family: - [[DeepSeek v4]]: V4-Pro (1.6T total / 49B active) and V4-Flash (284B / 13B), released April 2026. Folds the R reasoning line into the base model. - [[DeepSeek V3]]: 671B total / 37B active MoE (Dec 2024 → V3.2 in 2025). The release that put DeepSeek on the global frontier-model map and the efficiency baseline for V4. - R1: 671B - R1-Zero - An R1 prototype fine-tuned using only unsupervised reinforcement learning (RL) - VL: Vision-Language understanding - Coder - Coder V2 - Math - VL - ... There are variants of R1, but those are distilled & fine-tuned models based on [[Qwant]], [[Llama]], etc. For example: DeepSeek-R1-Distill-Qwen-7B. That's really not R1. The only R1 is the 671B one. Deepseek also has an API platform. ## Notable researchers - [[Deli Chen]] — NLP researcher, ex-Peking University LANCO Lab ## References - Official website: https://www.deepseek.com/ - Chat: https://chat.deepseek.com/sign_in - Mobile app: https://cdn.deepseek.com/download-app/index.html - Coder: https://coder.deepseek.com/ - API documentation: https://api-docs.deepseek.com/ - API platform: https://platform.deepseek.com/ - LLM source code: https://github.com/deepseek-ai/DeepSeek-LLM - VL source code: https://github.com/deepseek-ai/DeepSeek-VL - Coder source code: https://github.com/deepseek-ai/DeepSeek-Coder - Coder v2 source code: https://github.com/deepseek-ai/DeepSeek-Coder-V2 - Math source code: https://github.com/deepseek-ai/DeepSeek-Math - Deepseek @ AWS: https://aws.amazon.com/blogs/aws/deepseek-r1-models-now-available-on-aws - Deepseek @ Azure AI Foundry: https://azure.microsoft.com/en-us/blog/deepseek-r1-is-now-available-on-azure-ai-foundry-and-github - Videos - Running R1 (671B) on a $2000 local server: https://www.youtube.com/watch?v=Tq_cmN4j2yY