MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 7.0 Download Report PDF

Agent MemoryLatent ReasoningLLM Agent

Agent memory shapes how Large Language Model (LLM)-powered agents, akin to the human brain, progressively refine themselves through environment interactions. Existing paradigms remain constrained: parametric memory forcibly adjusts model parameters, and retrieval-based memory externalizes experience into structured databases, yet neither captures the fluid interweaving of reasoning and memory that underlies human cognition. To address this gap, we propose MemGen, a dynamic generative memory framework that equips agents with a human-esque cognitive faculty. It consists of a \textit{memory trigger}, which monitors the agent’s reasoning state to decide explicit memory invocation, and a \textit{memory weaver}, which takes the agent's current state as stimulus to construct a latent token sequence as machine-native memory to enrich its reasoning. In this way, MemGen enables agents to recall and augment latent memory throughout reasoning, producing a tightly interwoven cycle of memory and cognition. Extensive experiments across eight benchmarks show that MemGen surpasses leading external memory systems such as ExpeL and AWM by up to $38.22\\%$ , exceeds GRPO by up to $13.44\\%$ , and exhibits strong cross-domain generalization ability. More importantly, we find that without explicit supervision, MemGen spontaneously evolves distinct human-like memory faculties, including planning memory, procedural memory, and working memory, suggesting an emergent trajectory toward more naturalistic forms of machine cognition.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces MemGen, a dynamic generative memory framework that represents memory as latent token sequences integrated directly into agent reasoning processes. According to the taxonomy, this work sits in the 'Generative Latent Memory Frameworks' leaf under 'Memory Architecture and Organization'. Notably, this leaf contains only the original paper itself with no sibling papers, suggesting this represents a relatively sparse and potentially novel research direction within the broader field of memory-augmented LLM agents.

The taxonomy reveals that MemGen's closest conceptual neighbors include 'Dynamic Agentic Memory Systems' (one paper) and 'Procedural Memory Learning' (one paper), both within the same parent branch. The broader field shows active development in memory retrieval mechanisms and evolutionary learning, but the generative latent approach appears distinct from these directions. The taxonomy's scope notes explicitly exclude external database memories and parametric weight updates from this category, positioning MemGen as exploring a middle ground between retrieval-based and parametric memory paradigms.

Among the 22 candidates examined across three contributions, no clearly refuting prior work was identified. The core MemGen framework examined 10 candidates with zero refutable matches, while the Memory Trigger/Weaver components examined 2 candidates with similar results. The emergent memory hierarchy contribution also found no overlapping prior work among 10 examined candidates. This limited search scope suggests that within the top-22 semantically similar papers, the specific combination of generative latent memory with dynamic triggering mechanisms appears not to have direct precedents.

Based on the available signals from this limited literature search, MemGen appears to occupy a relatively unexplored position within the memory-augmented agent landscape. The taxonomy structure shows this as a sparse leaf with no immediate siblings, and the contribution-level analysis found no refuting work among examined candidates. However, these findings reflect only the top-22 semantic matches and may not capture the full breadth of related work in adjacent research communities or alternative formulations of similar ideas.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: generative latent memory for self-evolving language model agents. The field centers on equipping language model agents with memory systems that enable them to learn, adapt, and improve autonomously over time. The taxonomy reveals five major branches: Memory Architecture and Organization addresses how memory is structured and stored, ranging from hierarchical designs like Hierarchical Procedural Memory[9] to generative frameworks such as those in Generative Agents[2]; Memory Retrieval and Augmentation focuses on mechanisms for accessing and utilizing stored information effectively; Memory Learning and Evolution explores how agents refine their memory through experience, as seen in works like Evo-Memory[5]; Agent Application Domains examines deployment contexts from interactive simulations to specialized tasks; and Agent Development Frameworks and Tools provides infrastructure for building and evaluating these systems. These branches are deeply interconnected, as architectural choices directly influence retrieval strategies and learning dynamics. Recent work shows particularly active development in generative memory frameworks and evolutionary learning mechanisms. A central tension emerges between structured memory architectures that impose explicit organization versus more fluid generative approaches that synthesize memory representations on demand. MemGen[0] sits squarely within the generative latent memory frameworks, emphasizing self-evolution through latent representations that can be dynamically generated and refined. This positions it closely alongside Evo-Memory[5], which similarly focuses on memory evolution, though MemGen[0] appears to place stronger emphasis on the generative aspect of memory construction rather than purely evolutionary refinement. Compared to earlier foundational work like Generative Agents[2], MemGen[0] likely advances the self-evolving dimension, while works such as Agentic Memory[1] explore complementary perspectives on how agents manage and leverage their memory stores across diverse interaction scenarios.

Claimed Contributions

MemGen: Dynamic Generative Memory Framework

10 retrieved papers

The authors introduce MemGen, a framework that enables LLM agents to dynamically generate and integrate latent memory during reasoning. Unlike parametric or retrieval-based approaches, MemGen interweaves memory and reasoning through continuous monitoring and generative reconstruction, creating a fluid cognitive cycle similar to human cognition.

10 retrieved papers

Memory Trigger and Memory Weaver Components

2 retrieved papers

The framework comprises two synergistic components: a reinforcement learning-trained memory trigger that determines when to invoke memory based on the agent's cognitive state, and a memory weaver that generates machine-native latent memory sequences to enrich reasoning without modifying the core LLM parameters.

2 retrieved papers

Emergent Human-like Memory Hierarchy

10 retrieved papers

The authors demonstrate that MemGen autonomously develops functionally specialized memory types resembling human cognitive faculties. These include planning memory for high-level task organization, procedural memory for task-specific skills, and working memory for maintaining context coherence, emerging without explicit architectural design for these distinctions.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

Within the taxonomy built over the current TopK core-task papers, the original paper is assigned to a leaf with no direct siblings and no cousin branches under the same grandparent topic. In this retrieved landscape, it appears structurally isolated, which is one partial signal of novelty, but still constrained by search coverage and taxonomy granularity.

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

MemGen: Dynamic Generative Memory Framework

[23] Cognitive architectures for language agents PDF

Cannot Refute

[24] Empowering working memory for large language model agents PDF

Cannot Refute

[25] FinMem: A Performance-Enhanced LLM Trading Agent With Layered Memory and Character Design PDF

Cannot Refute

[26] Can a cognitive architecture fundamentally enhance LLMs? Or vice versa? PDF

Cannot Refute

[27] From Experience to Strategy: Empowering LLM Agents with Trainable Graph Memory PDF

Cannot Refute

[28] Advances and challenges in foundation agents: From brain-inspired intelligence to evolutionary, collaborative, and safe systems PDF

Cannot Refute

[29] Reasoningbank: Scaling agent self-evolving with reasoning memory PDF

Cannot Refute

[30] Procedural memory is not all you need: Bridging cognitive gaps in llm-based agents PDF

Cannot Refute

[31] Enhancing reasoning with collaboration and memory PDF

Cannot Refute

[32] CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension PDF

Cannot Refute

Contribution

Memory Trigger and Memory Weaver Components

[33] Latent syntax weaving in large language model representations: A novel mechanism for self-referential consistency in neural architectures PDF

Cannot Refute

[34] UFO: Weaving the Digital Agent Galaxy PDF

Cannot Refute

Contribution

Emergent Human-like Memory Hierarchy

[13] Hiagent: Hierarchical working memory management for solving long-horizon agent tasks with large language model PDF

Cannot Refute

[14] MAPLE: Multi-agent adaptive planning with long-term memory for table reasoning PDF

Cannot Refute

[15] Metagent-P: A Neuro-Symbolic Planning Agent with Metacognition for Open Worlds PDF

Cannot Refute

[16] Arigraph: Learning knowledge graph world models with episodic memory for llm agents PDF

Cannot Refute

[17] GraphCogent: Mitigating LLMs' Working Memory Constraints via Multi-Agent Collaboration in Complex Graph Understanding PDF

Cannot Refute

[18] Working Memory Identifies Reasoning Limits in Language Models PDF

Cannot Refute

[19] GATSim: Urban Mobility Simulation with Generative Agents PDF

Cannot Refute

[20] Llm-powered decentralized generative agents with adaptive hierarchical knowledge graph for cooperative planning PDF

Cannot Refute

[21] Memory, space, and planning: Multiscale predictive representations PDF

Cannot Refute

[22] Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments PDF

Cannot Refute

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

Contribution Analysis

MemGen: Dynamic Generative Memory Framework

[23] Cognitive architectures for language agents PDF

[24] Empowering working memory for large language model agents PDF

[25] FinMem: A Performance-Enhanced LLM Trading Agent With Layered Memory and Character Design PDF

[26] Can a cognitive architecture fundamentally enhance LLMs? Or vice versa? PDF

[27] From Experience to Strategy: Empowering LLM Agents with Trainable Graph Memory PDF

[28] Advances and challenges in foundation agents: From brain-inspired intelligence to evolutionary, collaborative, and safe systems PDF

[29] Reasoningbank: Scaling agent self-evolving with reasoning memory PDF

[30] Procedural memory is not all you need: Bridging cognitive gaps in llm-based agents PDF

[31] Enhancing reasoning with collaboration and memory PDF

[32] CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension PDF

Memory Trigger and Memory Weaver Components

[33] Latent syntax weaving in large language model representations: A novel mechanism for self-referential consistency in neural architectures PDF

[34] UFO: Weaving the Digital Agent Galaxy PDF

Emergent Human-like Memory Hierarchy

[13] Hiagent: Hierarchical working memory management for solving long-horizon agent tasks with large language model PDF

[14] MAPLE: Multi-agent adaptive planning with long-term memory for table reasoning PDF

[15] Metagent-P: A Neuro-Symbolic Planning Agent with Metacognition for Open Worlds PDF

[16] Arigraph: Learning knowledge graph world models with episodic memory for llm agents PDF

[17] GraphCogent: Mitigating LLMs' Working Memory Constraints via Multi-Agent Collaboration in Complex Graph Understanding PDF

[18] Working Memory Identifies Reasoning Limits in Language Models PDF

[19] GATSim: Urban Mobility Simulation with Generative Agents PDF

[20] Llm-powered decentralized generative agents with adaptive hierarchical knowledge graph for cooperative planning PDF

[21] Memory, space, and planning: Multiscale predictive representations PDF

[22] Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments PDF

Table of Contents