XDA Developers00 bình luận5 phút đọc2 giờ trước

I quantized a local LLM on my home server and ditched cloud AI for smart home control entirely

A home lab enthusiast built a fully local, cloud-free smart home setup by running quantized LLMs (Gemma-4-26B and GPT-OSS-20B) on a Proxmox LXC with a decade-old gaming GPU via llama.cpp. Home Assistant is connected to the LLMs through the Home Agent HACS integration, with Whisper for speech-to-text and Piper for text-to-speech. An old Android tablet acts as a voice satellite for wake word detection. The setup achieves ~15-second response times and extends to other self-hosted services like Nextcloud and TrueNAS via MCP servers, eliminating reliance on any cloud AI service.

Đọc bài gốc

#tech-news #data-science #llama-cpp

Nguồn: https://www.xda-developers.com/i-hosted-a-local-llm-and-ditched-cloud-ai-for-smart-home-control. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

XDA Developers234 phút11 ngày trướcAI

Smart plugs are replacing smart appliances, and your decade-old washing machine just became future-proof

Smart plug (Zigbee) giá rẻ (~$15) thay thế smart appliance nhờ ưu điểm tiết kiệm chi phí, tránh lệ thuộc cloud, kéo dài tuổi thọ thiết bị và giảm rác thải điện tử. Chúng theo dõi dòng điện, kích hoạt tự động hóa (Home Assistant) như thông báo kết thúc chu trình, tính toán chi phí năng lượng hay ngắt an toàn mà không cần internet.

Lập trình viên nên đọc bài này để hiểu cách xây dựng hệ thống nhà thông minh tự động hóa hiệu quả bằng cách kết hợp các thiết bị cơ bản với các công cụ mở nguồn như Home Assistant, giảm chi phí và tránh phụ thuộc vào dịch vụ đám mây đắt tiền.

I quantized a local LLM on my home server and ditched cloud AI for smart home control entirely

Đề xuất cho bạn

Smart plugs are replacing smart appliances, and your decade-old washing machine just became future-proof

My 7-year-old GPU runs local AI perfectly, and I don't need my cloud subscriptions anymore

Which programming language to use for coding interviews

Google Consent Mode v2

The Busy Bar is an open-source productivity tool that comes from the Flipper Zero team

You, too, can build this open-source smart home keyboard that works with Home Assistant

Cloudflare Analytics Engine: store and query metrics

Gemma 4's smallest model runs on 3GB of VRAM, and it's the one I actually reach for