If models can describe the behaviors they picked up during fine-tuning, post-training audits get faster and cheaper. Anthropic says its new introspection-adapter method reached 59% on AuditBench and surfaced covert tuning attacks in 7 of 9 cipher-based models.
#lora
RSS FeedA March 2026 r/LocalLLaMA post with 123 points and 25 comments spotlighted `voxtral-voice-clone`, a project trying to train the missing codec encoder for Mistral’s Voxtral-4B-TTS-2603. The repo targets zero-shot cloning via `ref_audio`, which the original open-weight release could not support because the encoder weights were not included.
A project post in r/MachineLearning points to mlx-tune, a library that wraps Apple’s MLX stack in an Unsloth-compatible training API for SFT, DPO, GRPO, LoRA, and vision-language fine-tuning on Apple Silicon Macs.
A high-signal Hacker News thread surfaced Unsloth’s Qwen3.5 guide, which maps model sizes to bf16 LoRA VRAM budgets and clarifies MoE, vision, and export paths for production workflows.
A popular r/LocalLLaMA post details Heretic 1.2 with PEFT/LoRA updates, optional 4-bit processing, MPOA support, VL coverage, and automatic resume features for long local optimization runs.