snav's lab

Posts

03 - Format Sensitivity in LLM Speaker Simulation How do different transcript formats affect how LLMs simulate speakers in group chats? 2026-02-19
02 - Trellis -- Interactive Preference Steering for LLMs On building an app where you can "play" Reinforcement Learning as though you were taking an internet quiz. 2025-12-28
01 - CoT Research Log Some (failed) attempts to teach a small language model to think weird thoughts. 2025-12-12