ClickHouse0 Hot0 bình luận23 phút đọc3 giờ trước

A Quadrillion Rows across three Clouds: scaling LogHouse

ClickHouse's internal logging platform LogHouse has scaled from 19 PiB to 431 PiB (1.59 quadrillion rows) across 30+ regions on three cloud providers. The post details the architectural decisions behind this 23x growth: geosharding with isolated cells for write scalability, Async Inserts to tame small-write problems and avoid TOO_MANY_PARTS errors, daily vs. monthly partitioning strategies for large tables, an S3-backed OTel pipeline for durability, and a three-level Distributed table hierarchy (local → regional → global) that hides topology from users. The sharding key mechanism using a dictionary sourced from system.clusters enables optimize_skip_unused_shards to prune irrelevant cells, keeping region-filtered queries under 300ms even cross-continent. Peak ingestion reaches 80 GiB/s and 190 million rows/second across 36 cells.

Đọc bài gốc

#observability #data-engineering #distributed-systems #opentelemetry #clickhouse

Nguồn: https://clickhouse.com/blog/a-quadrillion-rows-across-the-three-cloud-scaling-loghouse. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

GitGuardian1 Hot8 phút4 giờ trướcAI

IEEE Cloud Summit 2026: The Tunnels No One Mapped

IEEE Cloud Summit 2026 tập trung vào bảo mật và kiến trúc cho hệ thống AI agent, với những chia sẻ từ Salesforce về agent Kubernetes tự động hóa, AWS giới thiệu bảo mật ngữ cảnh cho agent, cùng công cụ AgentTrace giúp truy vết hành động của agent. Ba vấn đề chính nổi lên là quyền hạn quá mức của các danh tính phi con người, hệ thống xác suất chỉ nên xử lý nhiệm vụ mơ hồ, và khả năng truy xuất nguồn gốc phải là tiêu chuẩn thiết kế bắt buộc cho hệ thống agent.

Lập trình viên nên đọc bài này để hiểu cách ứng dụng kỹ thuật phân tích chính xác, bảo mật context-aware và tra cứu forensics trong các hệ thống AI agent, từ đó nâng cao kiến thức về cách xây dựng và bảo vệ các giải pháp cloud hiện đại, đặc biệt là khi triển khai các ứng dụng tự động hóa có độ tin cậy cao.

A Quadrillion Rows across three Clouds: scaling LogHouse

Đề xuất cho bạn

IEEE Cloud Summit 2026: The Tunnels No One Mapped

AI SDK 7 is now available

Tempo 3.0 release: a new architecture for scale and lower TCO, TraceQL metrics GA, and more

Monitor Laravel Queues, Commands, and Schedulers on Any Driver with Vigilance

We Built a Routing Layer to Cut Our AI Costs. It Broke the Product.

Dapr 1.18 Introduces Verifiable Execution, Bringing Cryptographic Trust to AI Agents and Workflows

Your Foundation Model is a Service. Operate it Like One

The inside scoop on alerting changes in Kubernetes Monitoring