Posts
- 02 - Trellis -- Interactive Preference Steering for LLMs On building an app where you can "play" Reinforcement Learning as though you were taking an internet quiz. 2025-12-28
- 01 - CoT Research Log Some (failed) attempts to teach a small language model to think weird thoughts. 2025-12-12