China's LineShine supercomputer has topped the TOP500 list for the first time since 2017, achieving 2.198 exaflops using an all-CPU design built entirely without Nvidia, AMD, or Intel chips. Housed at the National Supercomputing Center in Shenzhen, it uses custom LX2 processors based on Armv9, runs KylinOS (a Linux variant), and uses a homegrown interconnect called LingQi. The machine is linked to Huawei. While it leads in high-precision scientific computing, it ranks only fourth on AI-style mixed-precision benchmarks, and major US hyperscaler clusters don't even enter the contest. The achievement is framed as a direct response to US export controls, which critics say have inadvertently accelerated China's push for semiconductor self-sufficiency — and exposed a loophole since CPU exports face far looser restrictions than GPUs.
Nguồn: https://thenextweb.com/news/china-lineshine-supercomputer-top500-no-us-chips. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.
OpenAI và Broadcom hợp tác phát triển chip AI tùy chỉnh Jalapeño nhằm cạnh tranh với Nvidia Blackwell và Google TPU, nhắm vào workloads inference. Chip này đã được thử nghiệm với mô hình GPT-5.3-Codex-Spark và dự kiến triển khai vào cuối năm 2025, trong khi tình trạng thiếu hụt HBM đang ảnh hưởng đến biên lợi nhuận của Broadcom.
Lập trình viên nên đọc bài này để hiểu cách các công ty lớn như OpenAI và Broadcom hợp tác phát triển chip AI chuyên dụng, giúp tối ưu hóa hiệu suất cho các mô hình lớn như GPT-5.3, ảnh hưởng trực tiếp đến hiệu năng và chi phí của các ứng dụng AI trong tương lai.
NVIDIA ra mắt NVIDIA Agent Toolkit, một nền tảng mã nguồn mở và mô-đun giúp doanh nghiệp xây dựng các tác nhân AI chuyên biệt đáng tin cậy. Bộ công cụ tích hợp các mô hình Nemotron (tùy chỉnh lý luận), NemoClaw (đảm bảo hành vi an toàn) và OpenShell (thực thi bảo mật), được triển khai trong các lĩnh vực như y tế, an ninh mạng và thiết kế chip.
Lập trình viên chuyên về AI nên đọc bài này để hiểu cách xây dựng các hệ thống agent chuyên dụng, an toàn và có thể kiểm soát được, giúp họ ứng dụng kiến thức về mô hình open-source, bảo mật và tích hợp vào các dự án doanh nghiệp thực tế.
Arm-sponsored content arguing that CPUs play a critical but underappreciated role in agentic AI infrastructure. While accelerators handle model performance, CPUs act as the control plane — managing data movement, workload scheduling, and secure isolation. Arm's Neoverse platform underpins custom silicon from AWS (Graviton), Google (Axion), Microsoft (Azure Cobalt), and NVIDIA (Grace Hopper/Blackwell), all reflecting a shift toward purpose-built Arm-based processors in cloud and AI datacenters. The piece introduces the Arm AGI CPU, built with Meta, targeting rack-level density for agentic AI deployments.
TensorX and Solstice have announced a $1bn financing facility to fund AI hardware and data-centre capacity across the EU, targeting the growing demand for sovereign compute that stays on European soil. Alongside this, Solstice is launching aiUSX, a yield-bearing asset that lets companies put idle AI-earmarked capital to work as infrastructure lending. The product is capped at $5m at launch and is designed to generate yield that can later offset inference costs. Both companies operate within the Deus X Capital ecosystem, which positions itself as the connective tissue enabling the partnership.
NVIDIA's GeForce NOW is running summer membership discounts alongside the Steam Summer Sale, offering $70 off a 12-month Ultimate membership and $35 off a Performance membership. The Ultimate tier delivers RTX 4080/5080-class cloud performance at up to 4K/120fps with DLSS and ray tracing. Six new games join the GeForce NOW library this week, headlined by Devolver Digital's Dark Scrolls and Square Enix's The Adventures of Elliot: The Millennium Tales.
The RTX 50 series launched with headline features like Multi Frame Generation, Ray Reconstruction, and Neural Texture Compression that were either unfinished or lacked broad software adoption. Months after launch, major fixes and updates are still arriving, and the most compelling exclusive features primarily benefit 4K gaming — a niche most PC gamers don't occupy. RTX 40-series owners already receive the biggest DLSS 4.5 image quality improvements, leaving the 50 series in an awkward middle ground. The author argues the generation feels like a transitional stepping stone, with the upcoming RTX 60 series (Rubin) positioned to be the hardware that fully realizes Nvidia's long-term rendering ambitions.
AMD's Ryzen AI Halo Developer Platform is a $3,999 mini PC powered by the Ryzen AI Max+ 395 APU with 128GB of unified memory, targeting local AI professionals who need to run massive LLMs without discrete GPU constraints. It can handle 200B parameter models, outpacing even the RTX 5090 in raw model capacity, while undercutting Nvidia's competing DGX Spark (now $4,699) on price. The machine ships with AMD's Ryzen AI Developer Center pre-configured, reducing the historically painful ROCm setup. However, ROCm still lags behind CUDA in maturity — Ollama can still require manual GPU path configuration, and quantization library support arrives later than on CUDA. AMD's upcoming Gorgon Halo platform promises 192GB of unified memory and 300B parameter model support, but closing the software gap with Nvidia remains the key challenge.
Orange Pi 6 is a new compact SBC (90x90mm) powered by the CIX P1 (CD8180) 12-core Arm Cortex-A720/A520 SoC with up to 32GB LPDDR5 RAM. Compared to the larger Orange Pi 6 Plus, it features 2.5GbE instead of 5GbE networking, drops LiPo battery support, and comes in a smaller form factor. Key specs include dual M.2 PCIe Gen4 x4 slots, multiple display outputs, a 28.85 TOPS NPU, and support for Debian, Ubuntu, Android, Windows 11, and OpenHarmony. Pricing starts at $239 for the 8GB model, reflecting the high cost of LPDDR5 RAM, making it significantly pricier than typical Orange Pi boards.