The Next Web00 bình luận6 phút đọc1 ngày trước

Mistral OCR 4: cheap, self-hosted document AI

Mistral has released OCR 4, a document AI model that converts files into structured data with bounding boxes, block type classifications, and per-word confidence scores. Unlike older OCR tools that return flat text, OCR 4 maps the full layout of a document, making it suitable for AI agents that need to act on documents rather than just read them. It supports PDFs, Word, PowerPoint, and OpenDocument files across 170 languages. Pricing starts at $2 per 1,000 pages in batch mode, with a Document AI tier at $5. The model is small enough to run in a single container, enabling on-premises deployment for data-sovereignty-conscious enterprises like banks, hospitals, and governments. It is available via Mistral's studio, Amazon SageMaker, and Microsoft Foundry. Benchmarks show an 85.20 score on OlmOCRBench and a 72% human-judged win rate against rivals, though Mistral cautions the model is not suitable for medical, legal, or high-stakes financial decisions.

Đọc bài gốc

#machine-learning #rag #self-hosting #mistral-ai

Nguồn: https://thenextweb.com/news/mistral-ocr-4-document-ai-self-hosted. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Towards Data Science130 phút8 giờ trướcAI

Letting an LLM Pick the Right RAG Page: The Arbiter Pattern at the End of Retrieval

Bài viết giới thiệu "Arbiter Pattern" trong RAG, nơi LLM đóng vai trọng tài bằng cách phân loại và đánh giá các nguồn tài liệu ứng viên dựa trên cấu trúc dữ liệu đầu vào, thay thế phương pháp kết hợp điểm số truyền thống. Tác giả nhấn mạnh embeddings nên là phương pháp cuối cùng trong tài liệu doanh nghiệp do hạn chế trong việc xác định sự vắng mặt của thông tin, trong khi keyword retrieval cung cấp khả năng phủ định chắc chắn. Ngoài ra, bài viết đề cập đến bộ chọn phương pháp truy xuất theo loại câu hỏi, lược đồ JSON thống nhất cho kết quả truy xuất nhằm đảm bảo khả năng kiểm tra, và tầm quan trọng của xử lý "không tìm thấy" đáng tin cậy trong ngữ cảnh tuân thủ quy định.

Một lập trình viên cần đọc bài này để tìm hiểu cách tối ưu hóa hệ thống RAG bằng cách áp dụng —một giải pháp linh hoạt hơn fusion score, giúp xử lý các trường hợp phức tạp trong việc lựa chọn kết quả phù hợp từ nhiều nguồn thông tin khác nhau.

Mistral OCR 4: cheap, self-hosted document AI

Đề xuất cho bạn

Letting an LLM Pick the Right RAG Page: The Arbiter Pattern at the End of Retrieval

Knowledge graph RAG: structured retrieval for AI agents

Unlocking the Power of the TPU Stack: Introducing our new Developer Hub

Anchor Detection for RAG: Parallel Detectors, Then One LLM Call at the End

Context Windows Are Not Memory: What AI Agent Developers Need to Understand

Using LlamaIndex for RAG in Python – Real Python

I tried replacing OneDrive with self-hosted storage, and it didn't work out

Network-wide ad blocking with a Raspberry Pi Zero is easier than you think, but the Wi-Fi problem is real