ITNEXT11 bình luận54 phút đọc2 giờ trước

GitOps for 15,000+ Clusters: What Large-Scale Testing with vCluster Taught Us

A detailed experience report from 31 iterations of large-scale GitOps fleet management testing using Argo CD, vCluster, Sveltos, and the open-source kubara framework on STACKIT Kubernetes Engine. Key findings: Argo CD's application controller hits OOM kills around 15k–20k cached objects per hub regardless of tuning (DRY vs WET manifests, sharding algorithms, processor counts). The root cause is that object count — not cluster or application count — drives memory usage non-linearly due to per-cluster caches, diffs, and live state. Sveltos addon controller handled the same workload at roughly 2 GB RAM vs 21 GB for Argo CD, and deployed 1,000 applications across 250 vClusters in 35 minutes with sharding (17 minutes in WET/pull mode). Centralized agent mode (Mode 2) was fastest at 13–16 minutes for 1,000 apps. The main architectural lesson: at very large scale (1,000+ clusters, 5,000+ real-world applications), a single Argo CD hub is not the right tool — architecture choices matter more than tuning.

Đọc bài gốc

#kubernetes #gitops #argocd

Nguồn: https://itnext.io/gitops-for-15-000-clusters-what-large-scale-testing-with-vcluster-taught-us-41e4b0d43e0b. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Blain Smith17 phút3 ngày trướcAI

Prioritizing Recent Messages with Go Channels

Khi xây dựng hệ thống chỉ quan tâm giá trị mới nhất, cơ chế chặn mặc định của Go channels trở thành hạn chế. Bài viết giới thiệu hai cách giải quyết: gửi không chặn bằng select/default (bỏ qua giá trị khi buffer đầy, an toàn cho nhiều producers) và xả buffer trước khi gửi (đảm bảo consumer nhận dữ liệu mới nhất, nhưng yêu cầu single producer). Các ví dụ kèm biểu đồ ASCII minh họa ưu nhược điểm của từng phương pháp.

Một lập trình viên nên đọc bài này để hiểu cách xử lý hiệu quả các kênh Go khi chỉ cần lưu giữ thông tin mới nhất, tránh rủi ro về dữ liệu cũ bị giữ lại trong buffer và chọn lựa giải pháp phù hợp với từng trường hợp sử dụng cụ thể.

#kubernetes

GitOps for 15,000+ Clusters: What Large-Scale Testing with vCluster Taught Us

Đề xuất cho bạn

Prioritizing Recent Messages with Go Channels

The inside scoop on alerting changes in Kubernetes Monitoring

Grafana 13.1 release: observability as code updates, extending Grafana Assistant across more data sources, and more

How to Build a Durable, Autoscaling AI Agent with Temporal, Composio, KEDA, and Kubernetes

AI & Kubernetes

Human-Centered Automation for Kubernetes Localization in the AI Era

Open source maintainership in the age of AI

AWS Stretches Elastic Kubernetes Service to Full Private Networking