#lora

Tin lập trình mới nhất về lora, tóm tắt tiếng Việt bằng AI.

JetBrains022 phút11 giờ trước

Our Research on Membership Inference Attacks and Preventing Privacy Leaks

JetBrains researchers present EZ MIA (Error Zone Membership Inference Attack), a lightweight method for detecting whether specific data was used to train fine-tuned LLMs. Unlike existing approaches that rely on aggregate sequence loss or expensive shadow model training, EZ MIA focuses on token-level error positions where memorization signals are most concentrated, requiring only two forward passes per sequence. Experiments on GPT-2, GPT-2-XL, and Llama-2 show EZ MIA outperforms baselines like LOSS, Min-K++, and SPV-MIA by up to 9x. The research also confirms that full fine-tuning creates significantly more membership leakage than LoRA-based fine-tuning, though LoRA does not eliminate the risk entirely — especially for larger models.

#llm #deep-learning #privacy Nguồn

freeCodeCamp017 phút1 ngày trước

How to Teach a Small LLM to Suggest K12 Creative Project Ideas

A hands-on walkthrough of fine-tuning a small LLM (Qwen2.5-1.5B) to suggest culturally-grounded K12 STEAM project ideas. Covers the full pipeline: scraping and filtering ~19,000 Wikipedia articles, generating structured JSON training pairs using a local Qwen 2.5 7B model via Ollama, fine-tuning with LoRA on Apple Silicon using MLX, evaluating on a held-out test set, adding RAG with vector embeddings to improve cultural accuracy, and integrating the model into a TypeScript educational app. Also addresses child-safety guardrails through both data filtering and runtime screening.

#python #rag #ollama

Nguồn