Kubernetes00 bình luận17 phút đọc1 ngày trước

Spotlight on WG Device Management

The Kubernetes Device Management Working Group (WG Device Management) is driving a fundamental shift in how Kubernetes handles specialized hardware like GPUs, TPUs, and FPGAs. Their primary deliverable, Dynamic Resource Allocation (DRA), recently graduated to GA in Kubernetes 1.34. DRA replaces the legacy Device Plugin API — which treated devices as opaque integers — with a structured, declarative framework covering four stages: modeling (ResourceSlice API), requesting (ResourceClaim API), scheduling, and actuation. The working group chairs from NVIDIA, Intel, and Google discuss the NP-hard scheduling challenges, cross-SIG coordination across sig-node, sig-scheduling, sig-autoscaling, sig-network, and sig-architecture, and upcoming work on device health monitoring, topology-aware multi-node scheduling, and consumable capacity sharing. NVIDIA has donated its DRA GPU driver to the Kubernetes project, and the community is growing rapidly.

Đọc bài gốc

#kubernetes #distributed-systems

Nguồn: https://kubernetes.io/blog/2026/06/24/wg-device-management-spotlight-2026. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Redpanda112 phút4 giờ trướcAI

Kafka's log compaction corrupts data. Here's how we fixed it

Apache Kafka có lỗ hổng trong cơ chế log compaction khiến dữ liệu bị hỏng do xung đột giữa compaction và replication, gây ra bốn vấn đề: dữ liệu đã xóa tái xuất hiện, giao dịch bị hủy hiện dưới dạng đã commit, dữ liệu đã commit bị ẩn, và consumers read_committed bị đóng băng partition. Redpanda Streaming khắc phục bằng giao thức compaction phối hợp, sử dụng các cặp offset (MCCO/MTRO, MXFO/MXRO) để đảm bảo tombstones và transaction markers không bị xóa trước khi tất cả replicas xử lý xong. Lỗi này có thể tái hiện trên Kafka phiên bản 3.9 đến 4.2 bằng Docker Compose.

Lập trình viên cần đọc bài này để hiểu cách giải quyết vấn đề lỗi race condition trong log compaction của Kafka, giúp tránh mất dữ liệu và bảo đảm tính nhất quán khi xử lý các trường hợp đồng bộ hóa dữ liệu trên nhiều broker.

Spotlight on WG Device Management

Đề xuất cho bạn

Kafka's log compaction corrupts data. Here's how we fixed it

Prioritizing Recent Messages with Go Channels

The inside scoop on alerting changes in Kubernetes Monitoring

How to Build a Durable, Autoscaling AI Agent with Temporal, Composio, KEDA, and Kubernetes

AI & Kubernetes

How to use traces to avoid breaking changes

Configuration management at Giant Swarm: a historical overview

Grab Builds Secure Agentic AI Workload Platform