EvoDefense: Co-Evolving Black-Box Defense with Large Language Models
EvoDefense addresses a critical vulnerability in LLM deployment: black-box adversarial robustness without access to model internals. The system pairs a guard LLM with an experience memory layer that learns from attack patterns, then runs continuous co-evolution cycles where attack and defense strategies refine each other. This shifts LLM security from static rule-based filtering to adaptive, learned defenses that generalize across unseen attack types and architectures. The approach matters because production LLMs often sit behind API boundaries where defenders lack transparency, making adaptive guardrails a practical necessity for real-world safety.62

























