➀ Cerebras Systems' wafer-scale AI chip (WSE-3) outperforms the fastest GPUs by 57 times in executing DeepSeek-R1 models with 70 billion parameters.
➁ Cerebras CEO Andrew Feldman states that enterprise clients are highly enthusiastic about DeepSeek's new R1 inference model, with a surge in demand within ten days of its launch.
➂ The WSE-3 chip, made on a 12-inch wafer with TSMC's 5nm process, has 4 trillion transistors, 900,000 AI cores, 44GB on-chip SRAM, and a total memory bandwidth of 21PB/s, with a peak performance of 125 FP16 PetaFLOPS.
➃ DeepSeek-R1 offers performance comparable to advanced inference models from OpenAI at a low training cost and has been open-sourced, allowing tech firms to build AI applications and chip manufacturers to optimize for the model.
➄ Andrew Feldman emphasizes that while DeepSeek poses some risks, users should exercise basic judgment, as seen with the use of electric saws.