Computer Science
Titans: Learning to Memorize at Test Time
A. Behrouz, P. Zhong, et al.
Discover Titans: a new family of architectures that pair a neural long-term memory module with attention to capture massive historical context while keeping fast, parallelizable training and inference. Experiments show Titans outperform Transformers and modern linear recurrent models on language modeling, common-sense reasoning, genomics, and time series, and can scale beyond 2M context windows. Research conducted by Ali Behrouz, Peilin Zhong, and Vahab Mirrokni.
Related Publications
Explore these studies to deepen your understanding of the subject.

