Research·arXiv cs.LG·May 19

When Critics Disagree: Adaptive Reward Poisoning Attacks in RIS-Aided Wireless Control System

Researchers have identified a novel attack vector against reinforcement learning systems in wireless networks by exploiting disagreement between dual critic networks in Soft Actor-Critic agents. The Disagreement-Guided Reward Poisoning attack targets high-uncertainty decision points where the two critics diverge, corrupting reward signals to push learned policies toward suboptimal behavior. This work exposes a structural vulnerability in a widely-used RL architecture when deployed in safety-critical control domains like spectrum management and RIS-assisted communications, raising questions about the robustness of actor-critic methods in adversarial wireless environments.

Modelwire context

Explainer

The paper's key contribution isn't just that SAC agents can be attacked, but that attackers can systematically identify which decisions to poison by watching where the two critic networks disagree. This disagreement-guided targeting makes the attack far more efficient than random reward corruption.

This connects to the contextual bandit work from the same day, which showed that adaptive sampling strategies outperform passive approaches. Here, the attacker is doing something similar: adaptively choosing high-uncertainty points rather than poisoning uniformly. Both papers highlight how uncertainty and disagreement create exploitable structure in learning systems. The difference is intent: one paper optimizes for better learning, this one weaponizes the same principle against it.

If follow-up work demonstrates that adding a third critic or using ensemble disagreement thresholds actually prevents this attack in simulation, that validates the diagnosis. If instead attackers can still succeed with minimal overhead against hardened variants, the vulnerability runs deeper than the architecture choice.

Coverage we drew on

Active Context Selection Improves Simple Regret in Contextual Bandits · arXiv cs.LG

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsSoft Actor-Critic · Reconfigurable Intelligent Surfaces · Cognitive Radio Network · Disagreement-Guided Reward Poisoning

Read full story at arXiv cs.LG →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.