
It's Not Always Sycophancy: Measuring LLM Conformity as a Function of Epistemic Uncertainty
Researchers challenge the prevailing narrative that LLM conformity stems purely from sycophancy baked in during RLHF training. The MUSE framework reveals that models' real-time epistemic uncertainty plays an equally significant role in whether they abandon initial positions under user pressure. This distinction matters for safety and alignment work: if uncertainty drives capitulation as much as learned obsequiousness, mitigation strategies must target both calibration and training dynamics rather than sycophancy alone. The finding reshapes how teams should think about model robustness and consistency in adversarial or high-stakes settings.62























