SemiVoice

Apple fumbles AI again: new 'LLM Siri' update for iOS 19 delayed, now expected with iOS in 2026

➀ Apple's AI-powered Siri update, initially planned for iOS 19, has been delayed, highlighting Apple's struggles in AI development.

➁ The integration of advanced AI features is behind schedule, with a modernized Siri not expected until iOS 20 in 2027.

➂ Competitors have surpassed Apple, and internal challenges, including leadership and resource issues, hinder progress.

AI AI Development Alexa Apple ChatGPT Grok Innovation LLM Siri iOS 19 iOS 20 technology

semiwiki

AI AI Accelerators ASIC Computational Efficiency GPU Hardware LLM Memory Bandwidth

The Double-Edged Sword of AI Processors: Batch Sizes, Token Rates, and the Hardware Hurdles in Large Language Model Processing

➀ AI software modeling represents a significant shift from traditional programming, enabling systems to learn from data.

➁ The complexity of AI systems lies in their model parameters, which can number in the billions or trillions.

➂ GPUs have become essential for AI processing, but they face efficiency challenges, particularly during inference with large language models.

tomshardware

AI Performance LLM Server CPUs cpu

Chinese CPU maker Zhaoxin rolls out DeepSeek support to all processors — entire product lineup now runs DeepSeek LLMs natively

➀ Zhaoxin has announced the adoption of DeepSeek-R1 LLM across its product lineup;

➁ The company's processors can natively run various DeepSeek models without GPU assistance;

➂ Zhaoxin highlights its KaiXian KX-7000/8 consumer processor and Kaisheng KH-40000/32 server processor for AI performance.

electronicsweekly

AI Regulation Government LLM UK US Government

Government delays AI Bill

➀ The government has delayed its AI Bill to avoid upsetting the US government, as reported by the Guardian.

➁ The Bill would have required companies using LLMs in AI programs to submit them for vetting to the AI Security Institute.

➂ The intention was to present the Bill to parliament before Christmas, but it will now be postponed until summer.

AI AI Corruption AI Ethics AI Security Data Misinformation Healthcare LLM

10 months ago

Researchers discover if 0.001% of AI training data misinformation the AI becomes corrupted

➀ Researchers found that even 0.001% misinformation in AI training data can compromise the entire system; ➁ The study injected AI-generated medical misinformation into a commonly used LLM training dataset, leading to a significant increase in harmful content; ➂ The researchers emphasized the need for better safeguards and security research in the development of medical LLMs.

tomshardware

AMD Ryzen AI 9 HX 375 outperforms Intel's Core Ultra 7 258V in LLM performance — Team Red provided benchmarks show a strong lead of up to 27% in LM Studio

➀ AMD claims its Ryzen AI 9 HX 375 outperforms Intel's Core Ultra 7 258V in LLM performance; ➁ Benchmarks show a lead of up to 27% in LM Studio; ➂ AMD's Strix Point APUs offer lower latency and better integrated graphics performance.

AI AMD Intel LLM NPU benchmark cpu

hothardware

AI LLM OpenAI artificial-intelligence

Orion Could Be OpenAI's Game-Changing AI Model

➀ Orion is expected to be released before the end of 2024; ➁ It may be 100 times more capable than GPT-4; ➂ The use of synthetic data in training raises concerns about model output.

Dell PowerEdge XE9712: NVIDIA GB200 NVL72-based AI GPU cluster for LLM training, inference

➀ Dell has launched the new PowerEdge XE9712 with NVIDIA GB200 NVL72 AI servers, offering 30x faster real-time LLM performance over the H100 AI GPU; ➁ The system features 72 x B200 AI GPUs connected with NVLink technology, providing lightning-fast connectivity; ➂ Dell highlights the liquid-cooled system for maximizing datacenter power utilization and rapid deployment of AI clusters.

AI Data center Dell GPU LLM NVIDIA Performance Training inference

weixin

GPU LLM energy efficiency

A New Chip Challenges GPU Performance

➀ A 30 billion parameter LLM is demonstrated with a prototype inference device equipped with 16 IBM AIU NorthPole processors, achieving a system throughput of 28,356 tokens/second and a latency below 1 ms/token; ➁ NorthPole offers 72.7 times better energy efficiency and lower latency compared to GPUs at the lowest GPU delay; ➂ NorthPole architecture is inspired by the brain, optimized for AI inference, and demonstrates superior performance in LLM推理.

SK hynix starts mass production of 12-layer HBM3E memory: 36GB capacity per module @ 9.6Gbps

➀ SK hynix has begun mass production of the world's first 12-layer HBM3E memory with a capacity of up to 36GB and a bandwidth of 9.6Gbps; ➁ The new memory is designed for AI GPUs and is set to be supplied to NVIDIA within 12 months; ➂ SK hynix aims to maintain its leadership in AI memory with the introduction of this new technology.

AI AI GPUs Bandwidth Blackwell H200 HBM3E Hopper H100 LLM Llama 3 70B NVIDIA SK hynix memory

weixin