John O'Reilly0 Hot0 bình luận5 phút đọc3 giờ trước

Adding embeddings/RAG support to the Koog-based AI agent in Confetti

A walkthrough of adding semantic search to the Confetti Compose Multiplatform app using Koog's embeddings and RAG modules. The implementation embeds conference session data using Gemini, stores vectors in a persistent backend via Okio, and exposes a SearchSessionsTool to a Koog AIAgent so it can answer topic-style queries (e.g. 'what AI talks are on?') even when session titles use different wording. Two separate indexes (title-only and title+description) are maintained to improve ranking, and platform-specific cache directories are wired through Koin DI.

Đọc bài gốc

#kotlin #rag #google-gemini #embeddings

Nguồn: https://johnoreilly.dev/posts/confetti-koog-rag. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Medium1 Hot4 phút12 giờ trướcAI

LangChain is great for prototypes. Here’s why I didn’t use it in production.

Một nhà phát triển xây dựng pipeline RAG cho trợ lý di trú chia sẻ lý do không dùng LangChain trong sản xuất vì các lớp trừu tượng của nó che giấu những quyết định quan trọng về chunking, chất lượng truy xuất và cấu trúc tài liệu. Việc xây dựng từ đầu với ChromaDB, pdfplumber và Groq API giúp kiểm soát toàn bộ code, dễ dàng gỡ lỗi và đưa ra quyết định thiết kế có ý nghĩa. LangChain vẫn phù hợp để tạo nguyên mẫu, nhưng tác giả khuyên nên tự xây dựng ít nhất một lần để hiểu những gì framework đang trừu tượng hóa.

Lập trình viên nên đọc bài này để hiểu cách LangChain có thể làm giảm bớt trách nhiệm thiết kế chi tiết trong pipeline AI như xử lý đoạn văn, tìm kiếm dữ liệu và cấu trúc tài liệu, nhưng khi chuyển sang sản phẩm thực tế, sự kiểm soát trực tiếp từ code gốc sẽ giúp tránh những lỗi khó debug và tối ưu hóa hiệu suất.

Adding embeddings/RAG support to the Koog-based AI agent in Confetti

Đề xuất cho bạn

LangChain is great for prototypes. Here’s why I didn’t use it in production.

For the First Time, Zero Confabulation Is Reproducible on Any AI: Open Sourcing ConteX Law

A How-To Guide On Fine-Tuning

How to Build a RAG Q&A AI Agent for Your Documents Using LangChain v1

The Untaught Lessons of RAG Retrieval: Cosine Is Not the Foundation

Context vs. Memory Engineering in Agentic AI Systems

Three Years of Building Agents in Production (Part 1)

Kotlin Comes to BlueJ