Medium0 Hot0 bình luận10 phút đọc2 giờ trước

Building a Multilingual, Multi-Tenant RAG Engine in Django

A walkthrough of a production-style multilingual, multi-tenant RAG backend built with Django, PostgreSQL/pgvector, and Gemini. Covers three core challenges: tenant data isolation enforced via JWT and tenant-filtered vector queries, multilingual semantic search using the locally hosted intfloat/multilingual-e5-base model, and grounded generation with an explicit 'I don't know' fallback when retrieval returns no relevant results. Key architectural decisions include baking tenant_id into JWT tokens at login, using asymmetric E5 prefixes (passage:/query:) for embedding quality, and treating embedding model selection as an irreversible architectural commitment. The demo runs three tenants seeded from the MKQA multilingual QA benchmark across Spanish, Portuguese, and English.

Đọc bài gốc

#django #rag #multi-tenancy #pgvector

Nguồn: https://medium.com/@saymmalik08/building-a-multilingual-multi-tenant-rag-engine-in-django-c975aecf599a. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Mabl Engineering Blog1 Hot11 phút21 giờ trướcAI

Three Years of Building Agents in Production (Part 1)

Kỹ sư mabl chia sẻ ba năm kinh nghiệm xây dựng AI agents cho kiểm thử phần mềm sản xuất, từ những thất bại ban đầu với PaLM 2023 đến việc tận dụng sức mạnh LLM như LLM-as-judge, RAG với Gemini 2, và quản lý trạng thái đa nền tảng. Họ rút ra bài học: giao diện UI đơn giản vẫn khó điều hướng, nhóm ngữ nghĩa hiệu quả hơn so khớp từ, dữ liệu kiểm thử tĩnh không phù hợp cho AI xác suất, và CoT cứng nhắc phản tác dụng khi nâng cấp lên Gemini 2.5.

Bạn nên đọc bài này để hiểu cách chuyển đổi từ những thất bại ban đầu trong ứng dụng AI như PaLM sang xây dựng các hệ thống agent hiệu quả trong thực tế, từ đó tránh những sai lầm về cách tiếp cận và tối ưu hóa kiến trúc cho các ứng dụng AI trong sản xuất.

Building a Multilingual, Multi-Tenant RAG Engine in Django

Đề xuất cho bạn

Three Years of Building Agents in Production (Part 1)

How Five PostgreSQL Optimizations Sped Up Our Dashboard

Elastic Open-Sources Atlas Agent Memory Based on Cognitive Science

Inside Target’s LLM-Based System for Semantic Matching in Marketing Forecast Pipelines

EP220: RAG vs Graph RAG vs Agentic RAG

How to Build a Powerful LLM Knowledge Base

AI won't be powered by better models alone, says Oxylabs CEO Vytautas Savickas

The AI Agent Tech Stack Explained