Google Cloud0 Hot0 bình luận7 phút đọc2 giờ trước

Boost Performance and Lower Costs with AlloyDB AI Functions

AlloyDB now offers new AI functions and major performance improvements for running LLM calls directly within SQL queries. Three new functions are introduced: ai.summarize, ai.agg_summarize, and ai.analyze_sentiment, joining the existing GA functions (ai.generate, ai.rank, ai.if, ai.forecast). Two key performance breakthroughs are highlighted: Smart Batching, which intelligently groups AI function calls to reduce prompt overhead and achieve up to 2,400x throughput improvement; and Optimized AI Functions, which train a lightweight proxy model on your data to process decisions locally, achieving up to 100,000 rows/sec (23,000x improvement) and 6,000x cost reduction. Both features are currently in preview for ai.if and ai.rank.

Đọc bài gốc

#llm #backend #gcp #vector-search #alloydb

Nguồn: https://cloud.google.com/blog/products/databases/boost-performance-and-lower-costs-with-alloydb-ai-functions. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

The Next Web1 Hot4 phút4 giờ trướcAI

Meta's Brain2Qwerty reads typed sentences from the brain

Meta vừa công bố phiên bản 2 của hệ thống Brain2Qwerty, sử dụng máy quét MEG không xâm lấn để giải mã các câu văn bản từ hoạt động não bộ. Hệ thống đạt độ chính xác 61% cho từng từ (tối đa 78% ở người tham gia tốt nhất), vượt trội so với các hệ thống không xâm lấn trước đây chỉ đạt vài phần trăm. Mặc dù sử dụng pipeline LLM tương tự ChatGPT để tái tạo câu từ tín hiệu não nhiễu, hệ thống vẫn còn hạn chế lớn như thiết bị cồng kềnh, không hoạt động theo thời gian thực và yêu cầu người dùng phải gõ bàn phím để huấn luyện. Các phương pháp xâm lấn vẫn dẫn đầu về độ chính xác với 92% cho toàn bộ câu.

Lập trình viên nên đọc bài này để hiểu cách kết hợp mô hình ngôn ngữ lớn (LLM) và giải mã não bộ để tạo ra hệ thống mới trong lĩnh vực AI não-giao tiếp, giúp mở rộng ứng dụng của trí tuệ nhân tạo trong y tế và tương tác người-máy.

Boost Performance and Lower Costs with AlloyDB AI Functions

Đề xuất cho bạn

Meta's Brain2Qwerty reads typed sentences from the brain

Recommendations When Using LLM-backed Generative AI Systems for FOSS Contributions

Mastering Agentic Techniques: AI Agent Reinforcement Learning

ML Development in VS Code with Google Cloud Power: Workbench Extension Now Available

Anthropic launches Claude Sonnet 5 as a cheaper way to run agents

Most MCP servers don't need to exist. Your case might be an exception.—Martian Chronicles, Evil Martians’ team blog

The many journeys of learning Rust

Amazon SageMaker AI now supports serverless model customization for Gemma 4 models