TopPaper
15 January 2026
Today's Paper

The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Christina Lu, Jack Gallagher, Jonathan Michala, Kyle Fish, Jack Lindsey • Arxiv

Large language models (LLMs) are post-trained to adopt a default "helpful Assistant" persona, but this identity is fragile. The paper explores the internal "persona space" in model activations, discovering a dominant linear direction called the Assistant Axis the primary axis of variation across hundreds of character archetypes (roles like "therapist" or "jester," traits like "conscientious" or "flippant").

View Full Abstract View Full Paper DOI: 10.48550/arXiv.2601.10387 Share