
RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents
RecMem introduces a lazy consolidation strategy for long-running LLM agents, deferring memory extraction until patterns emerge rather than processing every interaction. By routing routine exchanges through lightweight embeddings and invoking LLMs only when semantic recurrence signals meaningful learning, the approach cuts token overhead substantially while maintaining retrieval fidelity. This addresses a core scaling bottleneck for production agents operating over extended horizons, where naive eager consolidation becomes prohibitively expensive. The insight that memory work should cluster around genuine novelty rather than raw volume could reshape how teams architect stateful AI systems.62























