Red Hat Developer00 bình luận10 phút đọc2 ngày trước

Connect EvalHub to protected production model servers

Connecting EvalHub to protected production model servers requires different authentication strategies depending on the model type. Three patterns are covered: (1) ServiceAccount tokens for internal OpenShift AI models using RBAC RoleBindings with no secrets needed, (2) API keys for external models like OpenAI stored as Kubernetes secrets, and (3) combining API keys with custom CA certificates for self-hosted models behind private TLS. The guide includes concrete kubectl commands, job configuration JSON, troubleshooting steps for common errors like HTTP 401 and SSL failures, and a real-world scenario evaluating three different model types simultaneously.

Đọc bài gốc

#kubernetes #llm #authentication

Nguồn: https://developers.redhat.com/articles/2026/06/23/connect-evalhub-protected-production-model-servers. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Vercel11 phút11 giờ trướcAI

Vercel Flags no longer requires SDK Keys for Vercel deployments

Vercel Flags giờ đây tự động xác thực thông qua OIDC tokens ngắn hạn mà không cần SDK Keys hay biến môi trường FLAGS cho các triển khai trên Vercel. Chỉ cần vercel link và vercel env pull là đủ cho phát triển local, trong khi các dự án cũ vẫn giữ nguyên yêu cầu SDK Keys cho các trường hợp đặc biệt.

Lập trình viên cần đọc bài này để hiểu cách tối ưu hóa quản lý tính năng động (flags) trong dự án Vercel mới nhất, giảm thiểu rủi ro về bảo mật khi sử dụng SDK Keys và khám phá giải pháp tự động hóa cho phát triển và triển khai.

Connect EvalHub to protected production model servers

Đề xuất cho bạn

Vercel Flags no longer requires SDK Keys for Vercel deployments

Letting an LLM Pick the Right RAG Page: The Arbiter Pattern at the End of Retrieval

Prioritizing Recent Messages with Go Channels

The Mirror You Trained

Anthropic’s Mythos found flaws in classified US systems during a government test

Don’t Let the Model Grade its Own Homework

My 7-year-old GPU runs local AI perfectly, and I don't need my cloud subscriptions anymore

Sub-agents: splitting context across specialized AI agents