Medium0 Hot0 bình luận24 phút đọc4 giờ trước

GenPage: Towards End-to-End Generative Homepage Construction at Netflix

Netflix introduces GenPage, a generative transformer model that replaces the traditional multi-stage homepage recommendation pipeline with a single decoder-only model. Treating user history and request context as a tokenized prompt, GenPage autoregressively generates the full two-dimensional homepage layout — rows and entities together — in real time. The system uses a domain-specific tokenizer for efficiency and product control, a pretraining-then-post-training recipe (either weighted binary classification or GRPO-based RL), and production-specific techniques including semantic embedding fusion for cold start, multi-cadence incremental training, constrained decoding for business rules, and hybrid row decoding for latency. In online A/B tests against a mature production system, GenPage achieved statistically significant gains on the core engagement metric while reducing end-to-end serving latency by 20%. Offline findings show that enriching the prompt context outperforms scaling model capacity in the current regime, and RL post-training increases homepage diversity even without an explicit diversity objective.

Đọc bài gốc

#machine-learning #netflix #reinforcement-learning

Nguồn: https://netflixtechblog.com/genpage-towards-end-to-end-generative-homepage-construction-at-netflix-77146fba8a08. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Gusto Engineering1 Hot6 phút14 giờ trướcAI

From Prompt to Classifier: A Production Case Study

Đội kỹ thuật của Gusto xây dựng bộ phân loại chuyển tiếp AI-sang-người cho hệ thống hỗ trợ khách hàng bằng cách bắt đầu với prompt LLM, sử dụng dữ liệu sản xuất để tạo dataset 3.500 lượt hội thoại, sau đó tinh chỉnh mô hình BERT nhẹ đạt 94% precision và 93% recall. Phương pháp LLM-đầu-tiên-sau-chuyên-biệt phù hợp cho quyết định ổn định, khối lượng lớn như phân loại intent, nhưng không hiệu quả với sinh văn bản mở hoặc quy tắc thay đổi.

Lập trình viên nên đọc bài này để hiểu cách chuyển từ việc sử dụng mô hình LLM trực tiếp sang xây dựng hệ thống chuyên biệt hiệu quả, đặc biệt là trong trường hợp phân loại quyết định cụ thể như phân luồng hỗ trợ khách hàng, giúp tối ưu hóa chi phí và tốc độ triển khai.

GenPage: Towards End-to-End Generative Homepage Construction at Netflix

Đề xuất cho bạn

From Prompt to Classifier: A Production Case Study

Unlocking the Power of the TPU Stack: Introducing our new Developer Hub

GLM-5.2: Built for Long-Horizon Tasks

How Far Can Classical NLP Go? From Bag-of-Words to Stacking on Spooky Author Identification

Ex-Tesla Optimus engineer settles trade secret lawsuit and raises $11M to build robot hands

WiMi Explores Neural Networks for Twin-Field Quantum Key Distribution Optimization

Hexora v0.3: New features and improvements

Twisted Tethers Make Tidal Energy Cheaper and Cleaner