XDA Developers0 Hot0 bình luận4 phút đọc2 giờ trước

I built Andrej Karpathy's "LLM Council" on my own hardware, and now no single model gets the last word

A developer rebuilt Andrej Karpathy's LLM Council concept on local hardware using an RTX 4070 Ti, running DeepSeek-R1 8B, Qwen 3.5 9B, and Gemma 4 E4B via Ollama. The three-stage process has each model generate answers independently, then anonymously review and rank each other's responses, before a designated chairman model (Gemma) synthesizes a final answer. The experiment revealed that synthesis is a distinct skill from generation or self-grading, and that the council approach is best reserved for complex queries where a second opinion matters — not as a daily driver due to its heavier GPU load and slower response times.

Đọc bài gốc

#ai-agents #self-hosting #ollama #local-ai

Nguồn: https://www.xda-developers.com/built-andrej-karpathys-llm-council-no-single-model-gets-last-word. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Elena's Growth Scoop2 Hot12 phút6 giờ trướcAI

Please stop the AI Confidence Theater

Bài viết chỉ trích "AI Confidence Theater" – xu hướng thổi phồng khả năng và quy trình AI trên mạng xã hội lẫn trong doanh nghiệp, gây hại bằng cách bóp méo kỳ vọng, tạo FOMO, khó khăn trong tuyển dụng và áp lực giả vờ thành thạo AI. Tác giả đề xuất thay đổi bằng cách chia sẻ kết quả thực tế, thừa nhận giới hạn và tập trung vào công việc duy trì hệ thống AI vốn ít hào nhoáng nhưng mang lại giá trị thực.

Nếu bạn đang tìm hiểu về cách xây dựng dự án AI thực tế và tránh bị lừa bởi hype không có cơ sở, bài viết này giúp bạn phân biệt giữa tuyên bố hype và kiến thức thực sự để đưa ra quyết định sáng suốt về việc đầu tư thời gian và nguồn lực.

#ai

I built Andrej Karpathy's "LLM Council" on my own hardware, and now no single model gets the last word

Đề xuất cho bạn

Please stop the AI Confidence Theater

Is your site ready for AI agents? Lighthouse now has an answer

Built for Mass Scale: Hard-Won Lessons from Teams Running High Volume Inference Workloads in Production

Codex vs Claude Code: Which AI Coding Assistant to Choose

Tigera Introduces Lynx, a Unified Control Plane for Kubernetes‑Native AI Agents

Anthropic launches Claude Sonnet 5 as a cheaper way to run agents

How We Built DEmate: Taming LLMs for Data Engineering at Meta

I gave a local LLM full control over my Proxmox node, and it worked better than I expected