Cerebras - SemiVoice News

➀ Cerebras Systems' wafer-scale AI chip (WSE-3) outperforms the fastest GPUs by 57 times in executing DeepSeek-R1 models with 70 billion parameters.

➁ Cerebras CEO Andrew Feldman states that enterprise clients are highly enthusiastic about DeepSeek's new R1 inference model, with a surge in demand within ten days of its launch.

➂ The WSE-3 chip, made on a 12-inch wafer with TSMC's 5nm process, has 4 trillion transistors, 900,000 AI cores, 44GB on-chip SRAM, and a total memory bandwidth of 21PB/s, with a peak performance of 125 FP16 PetaFLOPS.

➃ DeepSeek-R1 offers performance comparable to advanced inference models from OpenAI at a low training cost and has been open-sourced, allowing tech firms to build AI applications and chip manufacturers to optimize for the model.

➄ Andrew Feldman emphasizes that while DeepSeek poses some risks, users should exercise basic judgment, as seen with the use of electric saws.

AI Chip Cerebras

electronicsforu

11 months ago

Wafer Scale Engines For AI Efficiency

➀ Cerebras推出新一代晶圆级AI加速器CS-3，其速度比前代快两倍；➁ Cerebras的WSE-3将高速SRAM均匀分布在整个芯片表面，每个核心都能单周期访问快速内存；➂ Cerebras与Tri-Labs合作，利用晶圆级技术加速模拟，如材料科学和药物研究。

AI Cerebras SK Hynix Supercomputing TSMC

tomshardware

12 months ago

OpenAI execs mused over Cerebras acquisition in 2017 — to mitigate predicted Nvidia supply woes

➀ OpenAI considered acquiring Cerebras in 2017 to reduce its reliance on Nvidia; ➁ The deal was shelved due to potential conflicts of interest; ➂ Cerebras is now preparing for an IPO and has raised $715 million.

AI AI Processor Acquisition Cerebras Chipmaking Elon Musk NVIDIA OpenAI Tesla hardware innovation technology

weixin

about 1 year ago

Cerebras Unveils World's Fastest AI Inference Chip: 1800 Tokens/Second

➀ Cerebras Systems introduces the WSE-3 AI chip, designed for training the largest AI models with 5nm technology and 4 trillion transistors. ➁ The chip features 900,000 AI-optimized cores, offering 125 petaFLOPS of peak AI performance. ➂ Cerebras targets the inference market with its new product, claiming to generate 1,800 tokens per second, significantly outperforming Nvidia's H100. ➃ The company utilizes SRAM for high bandwidth, achieving 21 PBps, contrasting with Nvidia's HBM3e at 4.8 TBps. ➄ Cerebras plans to support more models and aims to provide competitive pricing, starting at 10 cents per million tokens.

AI芯片 Cerebras 推理服务

weixin

about 1 year ago

Cerebras Unveils World's Fastest AI Inference Chip: 1800 Tokens/Second

➀ Cerebras Systems introduces the WSE-3 AI chip, designed for training the largest AI models with 5nm technology and 4 trillion transistors. ➁ The chip features 900,000 AI-optimized cores, offering 125 petaFLOPS of peak AI performance. ➂ Cerebras targets the inference market with its new product, claiming to generate 1,800 tokens per second, significantly outperforming Nvidia's H100. ➃ The company utilizes SRAM for high bandwidth, achieving 21 PBps, contrasting with Nvidia's HBM3e at 4.8 TBps. ➄ Cerebras plans to support more models and aims to provide competitive pricing, starting at 10 cents per million tokens.

AI芯片 Cerebras 推理服务

SemiVoice

Recent #Cerebras news in the semiconductor industry

Cerebras CEO: DeepSeek Triggers Surge in Enterprise Demand for AI Chips!

Wafer Scale Engines For AI Efficiency

OpenAI execs mused over Cerebras acquisition in 2017 — to mitigate predicted Nvidia supply woes

Cerebras Unveils World's Fastest AI Inference Chip: 1800 Tokens/Second

Cerebras Unveils World's Fastest AI Inference Chip: 1800 Tokens/Second