Recent #NVIDIA news in the semiconductor industry

3 months ago

➀ NVIDIA collaborates with Texas Instruments, Infineon, STMicroelectronics, and others to advance 800Vdc power architecture for AI data centers;

➁ 800Vdc distribution aims to improve power density by 85% and efficiency via dc-dc conversion, supporting future 1MW AI server racks;

➂ The project involves power component leaders like Delta, Schneider Electric, and Navitas to optimize energy infrastructure.

NVIDIATexas Instrumentsdata centres
4 months ago

➀ The article compares the performance and cost efficiency of AMD and NVIDIA GPUs for various AI tasks such as chat, translation, reasoning, and summarization.

➁ It highlights the MI325X and MI300X as cost-effective options for Llama3 70B chat and translation tasks.

➂ The analysis reveals that AMD GPUs are less cost-effective in rental scenarios due to limited availability and higher prices.

➃ The article discusses the need for better inference benchmarks and explores the features and capabilities of NVIDIA's Dynamo framework.

AMDGPUNVIDIAbenchmarkperformance
4 months ago

➀ NVIDIA plans to launch the Blackwell-based RTX PRO 6000D (B40) in China as a cheaper alternative to the banned H20 GPU, priced between $6,500–$8,000;

➁ The B40 uses GDDR7 memory, avoids TSMC’s CoWoS packaging, and lacks NVLink support, limiting multi-GPU scalability;

➂ Deliveries could begin by Q3 2025, relying on PCIe 6.0 switches and Spectrum-X networking for server integration.

GPUHBMNVIDIA
4 months ago

➀ Intel Xeon 6 processors with High-Priority Cores (P-cores) are positioned as a key feature in NVIDIA DGX B300 AI servers, using the 64-core Xeon 6776P CPU for enhanced workload distribution;

➁ Intel markets its Xeon 6700P's 2DPC memory speed advantage (5,200 MT/s) over AMD EPYC 9005 (4,000 MT/s), though AMD offers more memory channels and capacity;

➂ NVIDIA's preference for Intel CPUs in GPU servers stems from reduced competitive overlap in AI accelerators, benefiting from Intel's canceled Rialto Bridge and Falcon Shores projects.

AIIntelNVIDIA
4 months ago

➀ Intel launched three new Xeon 6 P-Core CPUs (6732P, 6774P, 6776P) optimized for GPU-powered AI workloads, featuring dynamic core prioritization and up to 8TB memory support;

➁ The processors debut in Nvidia's DGX B300 AI systems with 30% faster memory speeds vs. AMD EPYC and 2.3x higher bandwidth over prior-gen Xeon;

➂ The 350W TDP chips include Intel AMX with FP16 support for AI tasks, alongside a lower-power 235W 40-core B-variant model.

AIIntelNVIDIA
4 months ago

➀ Nvidia announced the Grace Blackwell Ultra AI chip (GB300) shipping in Q3 and a 4x performance boost in its RTX Pro Server for AI models;

➁ Nvidia and Infineon collaborated on centralized 800V HVDC power architecture for AI data centers to optimize space and efficiency;

➂ Wolfspeed faces bankruptcy due to $6.5B debt and loss of key clients like Tesla and Volkswagen, reducing its SiC market share from 32% to 19%.

InfineonNVIDIAWolfspeed
4 months ago

➀ Nvidia and Infineon collaborate to develop a centralized 800V HVDC power architecture for AI data centers, aiming to replace decentralized AC/DC systems and reduce energy losses;

➀ The new approach minimizes power conversion stages, improves efficiency, and supports higher power densities (projected over 1MW per rack by 2030), critical for large-scale AI workloads involving 100,000+ GPUs;

➂ Centralized HVDC enhances scalability, reliability, and sustainability while meeting the performance demands of next-gen AI infrastructure.

AIInfineonNVIDIA
4 months ago

➀ AMD announces Radeon RX 9060 XT with 8GB ($299) and 16GB ($349) variants, targeting competition against NVIDIA's RTX 5060 Ti.

➁ Featuring Navi 44 silicon on TSMC’s 4nm process, the GPU upgrades to PCIe 5.0 x16, offering 14% higher clock speeds and 11% more memory bandwidth than its predecessor.

➂ AMD highlights 6% performance lead over NVIDIA’s RTX 5060 Ti 8GB at 1440p, but 8GB VRAM model faces criticism for potential obsolescence.

AMDGPUNVIDIA
4 months ago

➀ AMD's Computex 2025 keynote focuses on client and workstation products, led by Senior VP Jack Huynh;

➁ Anticipated announcements include Zen 5-based Threadripper CPUs, mainstream RDNA 4 Radeon GPUs (response to NVIDIA's RTX 5060), and promotion of Ryzen AI MAX laptops;

➂ Emphasis on gaming, workstation hardware, and competition in the mainstream GPU market.

AMDGPUNVIDIA
4 months ago

➀ China aims to capture 40% of its domestic AI chip market in 2024, led by local players like Huawei's Ascend series, Cambricon, Alibaba's Hanguang 800, and Baidu's Kunlun chips;

➁ Global competitors such as Intel, AMD, Google (Trillium TPU), AWS (Trainium), and Meta (MTIA) are advancing their own AI processors, intensifying pressure on Nvidia;

➂ Imec's proposal for reconfigurable AI processors could disrupt current development paradigms, potentially challenging the strategies of major AI chip designers.

AlibabaHuaweiNVIDIA
4 months ago

➀ Razer unveils the thinnest Blade 14 laptop at Computex 2025, featuring Nvidia RTX 50 series GPUs (5060/5070) and AMD Ryzen AI 9 365 processor.

➀ The laptop boasts a redesigned thermal system with a 'Thermal Hood' and vapor chamber, a 14-inch 3K OLED 120Hz display, and up to 11-hour battery life.

➂ Priced from $2,299, it aims to challenge Asus TUF Gaming A14 as the top 14-inch gaming laptop.

AMDNVIDIAgaming
4 months ago

➀ Intel unveiled its Arc Pro B50 and B60 GPUs at Computex 2025, targeting the workstation and AI market with 16GB (B50) and 24GB (B60) memory capacities, emphasizing cost efficiency and AI inference capabilities;

➁ The B50 features a 70W bus-powered design, while the B60 supports up to 200W, with a focus on Linux-based virtualization (SR-IOV) and scalable configurations like dual-GPU systems for AI workloads;

➂ Intel also showcased its Panther Lake 18A processors, aiming to challenge TSMC's process leadership, with GPUs expected to launch in Q3 2025 to address the underserved low-end AI inference and server market.

AI ChipIntelNVIDIA
4 months ago
➀ Nvidia's AI PC software stack now supports over 750 games and applications, leveraging RTX GPUs' acceleration. TensorRT offers top AI performance with up to 2x gains over DirectML. The new NIM for RTX container aids development with pre-packaged generative AI models optimized for TensorRT. ➁ Project G-Assist expands with new plugins for popular apps and professional tools. Developer tools for building plugins are now available. ➂ Nvidia's focus on AI is evident, with gaming becoming a secondary focus.
NVIDIA
4 months ago

➀ NVIDIA announced NVLink Fusion at Computex 2025, enabling third-party CPUs and accelerators to integrate with its NVLink ecosystem through semi-custom designs;

➁ The technology includes two approaches: NVLink C2C for connecting custom CPUs to NVIDIA GPUs, and a proprietary NVLink 5 chiplet for third-party accelerators to join NVLink networks;

➂ While offering flexibility, all nodes must include NVIDIA hardware, reflecting a strategic balance between openness and maintaining control over its ecosystem.

ChipletNVIDIA
4 months ago

➀ NVIDIA announced the upgraded Grace Blackwell Ultra GPU (GB300) shipping in Q3 2025, alongside new AI infrastructure advancements;

➁ The RTX Pro Server system delivers 4x H100 performance for DeepSeek and 1.7x gains for Llama models, with NVLink Fusion enabling cross-company hardware compatibility;

➂ Future plans include the Feynman architecture (2028) and immediate production of the DGX Spark workstation, offering 1,000 AI TOPS for researchers.

AIGPUNVIDIA
4 months ago

➀ NVIDIA introduces NVLink Fusion, enabling external chipmakers to integrate custom CPUs and AI accelerators with its ecosystem, breaking past proprietary limitations;

➀ Key partners include Qualcomm, Fujitsu, Marvell, and Mediatek, with chiplet-based design and software support from Synopsys and Cadence;

➂ NVIDIA's move aims to counter the open-standard UALink consortium led by AMD, Intel, and Broadcom, consolidating its dominance in AI infrastructure.

AIHPCNVIDIA
4 months ago

➀ NVIDIA CEO Jensen Huang headlines the Computex 2025 keynote, focusing on AI advancements and data center innovations;

➁ The event emphasizes software updates and partnerships with Taiwanese manufacturers, following major hardware announcements at GTC 2025;

➂ The 90-minute keynote highlights NVIDIA's AI ecosystem and live-streaming accessibility via YouTube.

AIBlackwellNVIDIA