An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, "distillation" from other models, impact on Nvidia, AGI, and more (Ben Thompson/Stratechery)

Ben Thompson / Stratechery: An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, impact on Nvidia, AGI, and more  —  It's Monday, January 27.  Why haven't you written about DeepSeek yet?  —  I did!  I wrote about R1 last Tuesday.

Jan 27, 2025 - 16:39
 0
An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, "distillation" from other models, impact on Nvidia, AGI, and more (Ben Thompson/Stratechery)

Ben Thompson / Stratechery:
An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, impact on Nvidia, AGI, and more  —  It's Monday, January 27.  Why haven't you written about DeepSeek yet?  —  I did!  I wrote about R1 last Tuesday.