Recent #GPU news in the semiconductor industry

about 1 year ago

➀ Nvidia's Ian Buck discusses the shift from focusing on single chips to integrated systems;

➁ The transformation of data centers into AI factories due to the rise of AI;

➂ The increasing power and cooling requirements of GPUs and Nvidia's involvement in the Coolerchips project.

AIData centerGPU
about 1 year ago
➀ Dell has launched the new PowerEdge XE9712 with NVIDIA GB200 NVL72 AI servers, offering 30x faster real-time LLM performance over the H100 AI GPU; ➁ The system features 72 x B200 AI GPUs connected with NVLink technology, providing lightning-fast connectivity; ➂ Dell highlights the liquid-cooled system for maximizing datacenter power utilization and rapid deployment of AI clusters.
AIData centerDellGPULLMNVIDIAPerformanceTraininginference
about 1 year ago
➀ Arm is exploring the feasibility of running LLMs on mobile devices; ➁ Arm's optimization techniques for LLMs on mobile; ➂ The importance of practical use cases for LLMs in mobile devices
2nm3D IC3nmAIAI PCAI chipAMDASUSArmCPUChipletDRAMDellEDAEMIBEUVGDDRGPUGaNHBMHPCInfineonLinuxMobileNPUNVIDIAPCIePrivacyRaspberry PiSEMICONDUCTORSK hynixSSDSwitchTIautomotivecoolingcybersecuritygamingiOSlaptopmemorymicrochipmonitorsoftware
about 1 year ago

➀ AMD has detailed the Instinct MI300X at Hot Chips 2024, with MI325X expected to be released soon.

➁ MI300X is a significant revenue source for AMD with over $4 billion in sales in the AI industry.

➂ AMD has acquired ZT Systems, the manufacturer of the Microsoft Azure MI300X platform.

➃ MI300X features a 192MB HBM3, a multi-chiplet chip for computing applications.

➄ AMD's CDNA 3 architecture has evolved with 8-stack HBM3 memory arrays, reaching 192GB in capacity.

➅ MI300X can operate as a single partition or across different memory and compute partitions.

➆ AMD's current major platform is the 8-way MI300X OAM platform.

➇ AMD discusses ROCm, which is improving.

➈ AMD's MI300X can compete with NVIDIA H100 in some cases.

➉ AMD is expected to release MI325X this year and Instinct MI350 288GB GPU in 2025.

AMDGPUNVIDIA
about 1 year ago

➀ Computing power is an important indicator of a computer's information processing capability, with AI computing power focusing on AI applications, commonly measured in TOPS and TFLOPS, and provided by dedicated chips such as GPU, ASIC, and FPGA for algorithm model training and inference.

➁ AI chip accuracy is a way to measure computing power level, with FP16 and FP32 used in model training, and FP16 and INT8 used in model inference.

➂ AI chips typically use GPU and ASIC architectures. GPUs are the key components in AI computing due to their advantages in computation and parallel task processing.

➃ Tensor Core, an enhanced AI computing core compared to the parallel computation performance of Cuda Core, is more focused on the deep learning field and accelerates AI deep learning training and inference tasks through optimized matrix operations.

➄ TPUs, a type of ASIC designed for machine learning, stand out in high energy efficiency in machine learning tasks compared to CPUs and GPUs.

AI ChipASICComputing PowerGPUTPU
about 1 year ago
1. Nvidia's upcoming Blackwell GPU is expected to drive significant revenue growth due to high demand from Data Centers and hyperscalers; 2. CEO Jensen Huang highlighted 'insane demand' for Blackwell GPUs, indicating strong pricing power and revenue potential; 3. The author's conservative estimate suggests Nvidia could generate $80-120B in incremental revenue from Blackwell in FY 2025, significantly higher than consensus forecasts.
GPUrevenue growth
about 1 year ago
➀ The research investigates the effects of adding TiC nanoparticles to aluminum alloy 7075, aiming to enhance casting performance and improve fluidity and surface quality; ➁ TiC nanoparticles were introduced in two concentrations, and their impact on fluidity and microstructure was analyzed; ➂ The results showed a significant improvement in fluidity and surface quality, with finer grain sizes and smoother surfaces.
2nm3D IC3nmAIAI ChipAMDArmAsusChipletCoolingDellEDAEMIBEUVGDDRGPUGaNHBMHPCInfineonLaptopLinuxMicrochipNPUNVIDIAPCIePrivacyRaspberry PiSSDSoftwareSwitchTIautomotivecpucybersecurityiosmemorymonitorsemiconductor