ITNEXT00 bình luận12 phút đọc1 giờ trước

What `os.cpu_count()` Gets Wrong in a CPU-Limited Kubernetes Pod

When Python's os.cpu_count() runs inside a CPU-limited Kubernetes pod, it reports the node's full CPU count rather than the cgroup-enforced quota. A pod with a 500m CPU limit on a 20-core node returns 20 from os.cpu_count(), while the actual cgroup budget is 0.5 CPU. This mismatch causes Gunicorn worker formulas like workers = multiprocessing.cpu_count() * 2 + 1 to spawn 41 workers against a half-CPU budget. Benchmarks show 1 worker completed 101 requests at ~1s median latency, while 14 workers completed only 46 requests at ~6.4s median due to heavy CPU throttling (83% of cgroup periods throttled). The fix is to read the actual quota from /sys/fs/cgroup/cpu.max (cgroup v2) or CFS bandwidth files (cgroup v1) before sizing workers, with a provided Python helper function that handles both cgroup versions and falls back to WEB_CONCURRENCY as an override.

Đọc bài gốc

#python #kubernetes

Nguồn: https://itnext.io/what-os-cpu-count-gets-wrong-in-a-cpu-limited-kubernetes-pod-ab57756a9153. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

freeCodeCamp210 phút15 giờ trướcAI

How to Build a Personal AI Web Research Agent with Ollama and Qwen

Hướng dẫn từng bước xây dựng một agent nghiên cứu web AI cục bộ bằng Ollama, mô hình Qwen3.5:4b và Python. Agent này nhận lệnh nghiên cứu, tìm kiếm 5 kết quả web hàng đầu qua API tìm kiếm web của Ollama, trích xuất văn bản bằng BeautifulSoup, sau đó tóm tắt bằng mô hình Qwen chạy cục bộ. Kết quả được lưu dưới dạng file Markdown có dấu thời gian, hoạt động hoàn toàn trên thiết bị mà không tốn phí API hay xâm phạm quyền riêng tư.

Lập trình viên muốn tự động hóa công việc nghiên cứu web một cách hiệu quả, tiết kiệm chi phí và bảo mật dữ liệu cá nhân nên đọc bài này để xây dựng một hệ thống AI cá nhân hoạt động trên thiết bị riêng của mình.

#python

What `os.cpu_count()` Gets Wrong in a CPU-Limited Kubernetes Pod

Đề xuất cho bạn

How to Build a Personal AI Web Research Agent with Ollama and Qwen

From Python to Rust: Master Iterators by Rebuilding 10 Unix Tools

There Is No Magic: An AI Agent in 60 Lines of Python

Prioritizing Recent Messages with Go Channels

The inside scoop on alerting changes in Kubernetes Monitoring

How to Build a Durable, Autoscaling AI Agent with Temporal, Composio, KEDA, and Kubernetes

Using LlamaIndex for RAG in Python – Real Python

AI & Kubernetes