XDA Developers0 Hot0 bình luận8 phút đọc3 giờ trước

Gemini Omni does almost everything, but there's one task I still won't trust it with

A hands-on personal take on Google's Gemini Omni Flash video generation model, covering its capabilities, limitations, and ethical concerns. The author tests Omni for motion graphics work, finding it visually impressive but unreliable for precise element control, capped at 720p, and prone to context degradation after a few editing turns. The author draws firm personal lines: no use for professional design or motion graphics work (where the creative process and decision-making matter more than output), and no generating real people's likenesses due to consent and safety concerns. Also raises ethical issues around YouTube creator data being used to train Omni without meaningful opt-out options.

Đọc bài gốc

#genai #google-gemini #ethical-ai #video-generation

Nguồn: https://www.xda-developers.com/gemini-omni-does-almost-everything-but-theres-one-task-i-still-wont-trust-it-with. 8sync News chỉ tóm tắt và dẫn link; bản quyền nội dung thuộc tác giả và nguồn gốc.

Đề xuất cho bạn

Charity7 Hot17 phút11 ngày trướcAI

Make AI Boring Again (xpost)

Charity Majors cho rằng AI không phải là công nghệ độc ác đặc biệt mà chỉ là công cụ, và các kỹ sư công nghệ có trách nhiệm đạo đức tham gia vào thay vì từ bỏ vì "sự trong sạch". Bà chỉ ra những tác hại thực tế (khai thác dữ liệu huấn luyện, tiêu thụ năng lượng, lao động, tập trung quyền lực) nhưng nhấn mạnh nhận thức về hại nên thúc đẩy cải tiến chứ không phải từ bỏ. Bà phê phán xu hướng "thuần khiết biểu diễn" là vô hiệu và tự cao, đồng thời kêu gọi học sâu về AI, thảo luận thẳng thắn nơi làm việc, thúc đẩy trách nhiệm giải trình và tham gia xây dựng công cụ này thay vì rời bỏ.

Lập trình viên nên đọc bài này để hiểu cách chuyển đổi sự lo ngại về AI từ sự phản đối bề ngoài sang hành động thực sự xây dựng giải pháp trách nhiệm, thay vì chỉ ngồi trong tư tưởng "tránh xa" mà không đóng góp vào việc định hình tương lai công nghệ.

Gemini Omni does almost everything, but there's one task I still won't trust it with

Đề xuất cho bạn

Make AI Boring Again (xpost)

Three Years of Building Agents in Production (Part 1)

Germany’s AI rollout is being sold as a fix for its worker shortage

Gemini 3.5 Flash can now see and control your screen, and Google wants enterprises to trust it

Toward More Controllable AI Video Editing: An Early Research Exploration at Netflix

Theta Labs Launches AI Gaming Services and AI Characters

Goodbye, forever, probably.

Adding embeddings/RAG support to the Koog-based AI agent in Confetti