The New Stack0 Hot0 bình luận6 phút đọc2 giờ trước

The infrastructure lock-in costing AI companies hundreds of millions

AI workloads have evolved faster than the hardware built to support them, creating costly infrastructure lock-in for companies that optimized for today's models. Reasoning models, agents, and multimodal systems stress hardware differently than early LLMs, forcing a rethink of GPU-centric strategies. Nvidia's Vera Rubin platform, AMD's Helios, Google's TPUs, Amazon's Trainium/Inferentia, and Microsoft's Maia 200 all reflect a shift toward system-level co-design over raw accelerator performance. Broadcom's custom ASIC revenue crossed $10B in a single quarter, with custom silicon growing at triple the rate of merchant GPUs. Tenstorrent's Jim Keller argues that adaptability — not peak FLOPS — is the key metric, citing BlackHole's use of standard Ethernet to slot into existing deployments. The central challenge: infrastructure refresh cycles span years while AI models reinvent themselves every few months.

Đọc bài gốc

#nvidia #gpu #ai-infrastructure

Nguồn: https://thenewstack.io/future-proof-ai-infrastructure. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

NVIDIA1 Hot4 phút3 giờ trướcAI

NVIDIA BioNeMo Agent Toolkit Brings Accelerated AI to Life Sciences Researchers in Claude Science

NVIDIA BioNeMo Agent Toolkit tích hợp các khả năng khoa học GPU-accelerated (như NVIDIA Parabricks, RAPIDS-singlecell, nvMolKit) vào Claude Science, cho phép các nhà nghiên cứu mô tả nhiệm vụ bằng ngôn ngữ tự nhiên (như dự đoán cấu trúc protein) để AI orchestrate thực hiện. Toolkit này là mã nguồn mở, framework-agnostic, có sẵn trên GitHub, trong khi Claude Science đang trong giai đoạn public beta.

Lập trình viên chuyên về AI sinh học nên đọc để khám phá cách tích hợp công nghệ GPU cao cấp của NVIDIA vào các pipeline nghiên cứu sinh học sinh thái, giúp tối ưu hóa hiệu suất và mở rộng khả năng tự động hóa cho các dự án liên quan đến gen, phân tử và dữ liệu sinh học thông minh.

The infrastructure lock-in costing AI companies hundreds of millions

Đề xuất cho bạn

NVIDIA BioNeMo Agent Toolkit Brings Accelerated AI to Life Sciences Researchers in Claude Science

Anthropic integration with Modal brings scalable compute to Claude Science

Claude Meets Blackwell Ultra: Anthropic’s Models Now Run on NVIDIA GB300 in Azure

AI inference is obviously profitable

OpenAI and Broadcom build a chip to rival Nvidia’s Blackwell

The AI memory crisis just hit DDR2, a standard from 2003, with 60% price hikes

AI won't be powered by better models alone, says Oxylabs CEO Vytautas Savickas

Qt Canvas Painter: Accelerated performance using paths