Computer Science > Computation and Language

arXiv:2603.22241 (cs)

[Submitted on 23 Mar 2026 (v1), last revised 13 Apr 2026 (this version, v2)]

Title:MemDLM: Memory-Enhanced DLM Training

Authors:Zehua Pei, Hui-Ling Zhen, Weizhe Lin, Sinno Jialin Pan, Yunhe Wang, Mingxuan Yuan, Bei Yu

Abstract:Diffusion Language Models (DLMs) offer attractive advantages over Auto-Regressive (AR) models, such as full-attention parallel decoding and flexible generation. However, standard DLM training uses a static, single-step masked prediction objective that never exposes the model to the progressive denoising dynamics of inference, and forces all contextual information to be maintained purely through token-space attention, which becomes increasingly diluted as context length grows. We propose MemDLM (Memory-Enhanced DLM), which introduces a second memory channel by embedding a simulated denoising trajectory into training via Bi-level Optimization. An inner loop updates a set of fast weights, forming a Parametric Memory that captures the local trajectory experience, while an outer loop updates the base model conditioned on this memory. By offloading part of the memorization burden from token-space attention to parameter space, MemDLM yields faster convergence, stronger long-context representations, and lower training loss, even when the fast weights are discarded at inference time. Re-enabling the inner loop at inference provides an additional prompt-specific adaptation effect, where the Parametric Memory acts as an emergent in-weight retrieval mechanism on challenging Needle-in-a-Haystack tasks. Code: this https URL.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2603.22241 [cs.CL]
	(or arXiv:2603.22241v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2603.22241

Submission history

From: Zehua Pei [view email]
[v1] Mon, 23 Mar 2026 17:39:56 UTC (775 KB)
[v2] Mon, 13 Apr 2026 08:19:37 UTC (774 KB)

Computer Science > Computation and Language

Title:MemDLM: Memory-Enhanced DLM Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MemDLM: Memory-Enhanced DLM Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators