NVIDIA0 Hot0 bình luận3 phút đọc2 giờ trước

NVIDIA Unlocks AI Compute at Scale, Inviting Partners to Power the AI Infrastructure Buildout

NVIDIA is launching a new business model to expand AI compute access for startups, model builders, and enterprises. The model allows AI cloud partners to procure NVIDIA infrastructure through a revenue-sharing and credit-support arrangement, enabling faster deployment of large-scale, multi-tenant AI factories without the typical delays of site selection, power procurement, and hardware setup. Sharon AI and Firmus are among the first partners, with Sharon AI deploying up to 40,000 Grace Blackwell GB300 GPUs and Firmus building a 360-megawatt campus in Indonesia with up to 170,000 GPUs.

Đọc bài gốc

#cloud #nvidia #gpu #ai-infrastructure

Nguồn: https://blogs.nvidia.com/blog/nvidia-unlocks-ai-compute-at-scale-capital-partners-to-power-ai-infrastructure-buildout. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

sean goedecke14 Hot6 phút5 ngày trướcAI

AI inference is obviously profitable

Phân tích chi phí sơ lược cho thấy suy luận (inference) AI thực sự sinh lời, với chi phí ước tính khoảng 1 USD cho mỗi triệu token đầu ra, thấp hơn nhiều so với mức giá 4,5 USD trở lên của các nhà cung cấp như OpenAI, qua đó đạt biên lợi nhuận gộp 70–80%. Suy luận AI có lợi nhuận, nhưng các phòng thí nghiệm AI như OpenAI và Anthropic sử dụng khoản lợi nhuận này để bù đắp chi phí đào tạo mô hình tốn kém.

Là người phát triển muốn tối ưu chi phí cho ứng dụng AI của mình, bài viết này giúp bạn hiểu rõ về lợi nhuận thực tế của quá trình inference AI, từ đó có thể xây dựng mô hình kinh doanh hiệu quả và tránh bỏ lỡ cơ hội tiết kiệm chi phí mà không phụ thuộc vào sự hỗ trợ từ các công ty lớn.

#llm

NVIDIA Unlocks AI Compute at Scale, Inviting Partners to Power the AI Infrastructure Buildout

Đề xuất cho bạn

AI inference is obviously profitable

NVIDIA BioNeMo Agent Toolkit Brings Accelerated AI to Life Sciences Researchers in Claude Science

Anthropic integration with Modal brings scalable compute to Claude Science

OpenAI and Broadcom build a chip to rival Nvidia’s Blackwell

Oracle is slashing its workforce as it automates with AI

Claude Meets Blackwell Ultra: Anthropic’s Models Now Run on NVIDIA GB300 in Azure

The AI memory crisis just hit DDR2, a standard from 2003, with 60% price hikes

AI won't be powered by better models alone, says Oxylabs CEO Vytautas Savickas