AGI Fire Alarm (@agifirealarm) 's Twitter Profile
AGI Fire Alarm

@agifirealarm

Current Alert Level: Amber Sparkle 2

ID: 1557354773975908353

calendar_today10-08-2022 13:14:59

399 Tweet

406 Followers

2,2K Following

Sander Dieleman (@sedielem) 's Twitter Profile Photo

Making diffusion language models work as well as autoregressive ones will be a challenge (see my earlier blog post: sander.ai/2023/01/09/dif…). This paper quantifies this and finds a 64x efficiency disadvantage across all scales 👀 a big gap, but at least it's a constant factor!

AGI Fire Alarm (@agifirealarm) 's Twitter Profile Photo

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model This model blends latent semantic diffusion with autoregressive generation. The goal is to produce fluent text while also maintaining global control over paragraph structures. arxiv.org/abs/2306.02531

AGI Fire Alarm (@agifirealarm) 's Twitter Profile Photo

This paper introduces Language Models Augmented with Long-Term Memory, a novel framework that enhances LLMs capabilities. LongMem uses a decoupled network to handle long-term contexts, greatly improving memory-augmented in-context learning. arxiv.org/abs/2306.07174

AGI Fire Alarm (@agifirealarm) 's Twitter Profile Photo

The best way to leverage distributed/decentralized compute at scale for AI may turn out to be doing massively parallel inference to generate synthetic data.