Quesma00 bình luận7 phút đọc3 giờ trước

Qwen 3.6 27B is the sweet spot for local development

Qwen 3.6 27B is presented as the first local model worth using as a general-purpose coding assistant. The dense 27B variant is recommended over the faster mixture-of-experts 35B A3B for its higher quality output. Setup involves running llama.cpp with an 8-bit quantized GGUF from Hugging Face, with multi-token prediction enabled for speed. On a MacBook Max M5 with 128GB RAM, it achieves ~30 tokens/second; Nvidia RTX 5090 users report ~50 tokens/second. The model integrates with AI coding agents like OpenCode via a simple config pointing to the local llama-server endpoint. Benchmarks from Artificial Analysis show it competitive with frontier models, outperforming Gemma 4 31B for local coding. The post also touches on the broader trend toward viable local models, mentioning GLM 5.2 as a new frontier-level open-weight option.

Đọc bài gốc

#local-ai #qwen #llama-cpp #opencode

Nguồn: https://quesma.com/blog/qwen-36-is-awesome. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Sebastian Raschka132 phút2 ngày trướcAI

Using Local Coding Agents

Hướng dẫn chi tiết cách thiết lập một hệ thống coding agent hoàn toàn cục bộ bằng các mô hình ngôn ngữ mã nguồn mở (LLM) như Qwen3.6 35B-A3B thông qua Ollama, thay thế các dịch vụ độc quyền như Claude Code hay Codex. Bài viết bao gồm kết nối với ba harness (Qwen-Code, Codex CLI, Claude Code), đánh giá hiệu suất, kiểm tra bảo mật, cấu hình quyền riêng tư, so sánh token usage, thiết lập SSH tunnel giữa máy Mac và DGX Spark, cùng kết quả benchmark cho thấy Qwen3.6 và North Mini Code vượt trội hơn Gemma 4 E2B trong các tác vụ sử dụng công cụ.

Nếu bạn muốn tự chủ hóa công cụ AI hỗ trợ lập trình, tránh phụ thuộc vào các dịch vụ cloud đắt tiền và có rủi ro về quyền riêng tư, bài hướng dẫn này sẽ giúp bạn xây dựng một hệ sinh thái mã nguồn mở hoàn toàn trên máy tính cá nhân của mình, tối ưu hóa hiệu suất và bảo mật.

#ai-agents

Qwen 3.6 27B is the sweet spot for local development

Đề xuất cho bạn

Using Local Coding Agents

I tried PewDiePie's open-source AI workspace, and it's weirdly great

My 7-year-old GPU runs local AI perfectly, and I don't need my cloud subscriptions anymore

Gemma 4's smallest model runs on 3GB of VRAM, and it's the one I actually reach for

I switched my local LLM setup to Ollama's new MLX engine, and my Mac suddenly feels twice as fast

I almost upgraded my GPU to run larger local LLMs, but this 8B model proved I didn't have to

I paired a local LLM with Frigate and Home Assistant, and my smart cameras finally understand what they are looking at

Untitled