Satyapriya Krishna (@satyascribbles) Twitter Tweets • TwiCopy

GLADIA Research Lab

a month ago

LLMs are injective and invertible. In our new paper, we show that different prompts always map to different embeddings, and this property can be used to recover input tokens from individual embeddings in latent space. (1/6)

thumb_up_off_alt5,5K

chat_bubble_outline176

repeat772

shareShare

Yilun Du

@du_yilun

a month ago

Sharing our work at NeurIPS Conference on reasoning with EBMs! We learn an EBM over simple subproblems and combine EBMs at test-time to solve complex reasoning problems (3-SAT, graph coloring, crosswords). Generalizes well to complex 3-SAT / graph coloring/ N-queens problems.

thumb_up_off_alt346

chat_bubble_outline6

repeat39

shareShare

Dan Hendrycks

@danhendrycks

a month ago

Can AI automate jobs? We created the Remote Labor Index to test AI’s ability to automate hundreds of long, real-world, economically valuable projects from remote work platforms. While AIs are smart, they are not yet that useful: the current automation rate is less than 3%.

thumb_up_off_alt984

chat_bubble_outline91

repeat185

shareShare

Rohit Prasad

@rohitprasadai

a month ago

Year 2 of the Amazon Nova AI Challenge is here, focused on trusted software agents. 10 university teams will advance agentic AI for software eng, balancing capability & safety - some will build defenses, others will probe for weaknesses. Apps open Nov 10! amazon.science/nova-ai-challe…

thumb_up_off_alt5

chat_bubble_outline3

repeat2

shareShare

François Fleuret

@francoisfleuret

a month ago

thumb_up_off_alt1,1K

chat_bubble_outline9

repeat49

shareShare

Kai-Wei Chang

@kaiwei_chang

24 days ago

Last year, I led an unlearning effort at Amazon with the Nova Responsible AI and Pretraining teams, focusing on controlling model knowledge and behavior. We hosted an LLM unlearning challenge at Semeval and developed LUME unlearning bechmark a multitask optimization method, and a

thumb_up_off_alt48

chat_bubble_outline1

repeat7

shareShare

Alexia Jolicoeur-Martineau

@jm_alexia

24 days ago

Cool new work on progressively growing a dynamic vocabulary that merges tokens using LZW compression. arxiv.org/abs/2506.01084

thumb_up_off_alt385

chat_bubble_outline7

repeat53

shareShare

Generalist

@generalistai_

24 days ago

Introducing GEN-0, our latest 10B+ foundation model for robots ⏱️ built on Harmonic Reasoning, new architecture that can think & act seamlessly 📈 strong scaling laws: more pretraining & model size = better 🌍 unprecedented corpus of 270,000+ hrs of dexterous data Read more 👇

thumb_up_off_alt1,1K

chat_bubble_outline49

repeat283

shareShare

Stanford NLP Group

@stanfordnlp

23 days ago

Tomorrow, we are excited to welcome Weiyan Shi to the Stanford NLP Seminar! Date and Time: Thursday, November 6, 11:00AM — 12:00 PM Pacific Time. Zoom Link: stanford.zoom.us/j/93941842999?… Title: Beyond the Surface: How Post-Training Artifacts Shape LLM Diversity and Safety

Tomorrow, we are excited to welcome <a href="/shi_weiyan/">Weiyan Shi</a> to the Stanford NLP Seminar!

Date and Time: Thursday, November 6, 11:00AM — 12:00 PM Pacific Time.
Zoom Link: stanford.zoom.us/j/93941842999?…

Title: Beyond the Surface: How Post-Training Artifacts Shape LLM Diversity and Safety

thumb_up_off_alt74

chat_bubble_outline0

repeat11

shareShare

Sonali Parbhoo

@sonali_ai4ai

22 days ago

How do you make LLMs safer and more aligned with human values? A challenge is understanding the hidden reward signals they learn.Our new paper introduces Failure-Aware Inverse RL, a method to uncover these signals by focusing on what LLMs get wrong. Paper: arxiv.org/abs/2510.06092

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Forecasting Research Institute

@research_fri

18 days ago

Today, we are launching the most rigorous ongoing source of expert forecasts on the future of AI: the Longitudinal Expert AI Panel (LEAP). We’ve assembled a panel of 339 top experts across computer science, AI industry, economics, and AI policy. Roughly every month—for the next

thumb_up_off_alt217

chat_bubble_outline14

repeat78

shareShare

Zico Kolter

@zicokolter

18 days ago

I'm teaching a new "Intro to Modern AI" course at CMU this Spring: modernaicourse.org. It's an early-undergrad course on how to build a chatbot from scratch (well, from PyTorch). The course name has bothered some people – "AI" usually means something much broader in academic

thumb_up_off_alt2,2K

chat_bubble_outline48

repeat243

shareShare

Zhiyuan Zeng

@zhiyuanzeng_

17 days ago

RL is bounded by finite data😣? Introducing RLVE: RL with Adaptive Verifiable Environments We scale RL with data procedurally generated from 400 envs dynamically adapting to the trained model 💡find supervision signals right at the LM capability frontier + scale them 🔗in🧵

thumb_up_off_alt446

chat_bubble_outline11

repeat109

shareShare

Amazon Science

@amazonscience

17 days ago

Announcing a private AI bug bounty program to strengthen the Amazon Nova foundation models. Building on 30+ findings and $55,000+ in rewards from the public program, the new track partners with security researchers and academics to strengthen AI security. amazon.science/news/amazon-la…

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Christopher Potts

@chrisgpotts

16 days ago

The Anthropic perspective on interpretability is prominent and significant, but not inevitable. My own take is quite different. (Clip from a talk I gave; YouTube link in the thread):

thumb_up_off_alt209

chat_bubble_outline3

repeat25

shareShare

Aditya Ramesh

@model_mechanic

13 days ago

The value of fast iteration in AI is overrated. The best results are obtained by knowing the right things to do and doing each thing with neurotic precision and attention to detail.

thumb_up_off_alt434

chat_bubble_outline26

repeat35

shareShare

Rosinality

@rosinality

12 days ago

Expanding hidden states without increasing block dimensions. Classical line of approaches from the era when everyone tried to make variants of skip connections, but it could be worth trying.

thumb_up_off_alt176

chat_bubble_outline5

repeat15

shareShare

Daniel Tan

@danielchtan97

11 days ago

cool paper introducing better steering method tl;dr instead of using a fixed steering coefficient, optimize s.t. we get max steering while staying within distribution arxiv.org/abs/2510.13285

thumb_up_off_alt96

chat_bubble_outline2

repeat11

shareShare

Mor Geva

@megamor2

10 days ago

✨ New course materials: Interpretability of LLMs✨ This semester I'm teaching an active-learning grad course at Tel Aviv University on LLM interpretability, co-developed with my student Daniela Gottesman. We're releasing the materials as we go, so they can serve as a resource for anyone

thumb_up_off_alt785

chat_bubble_outline15

repeat104

shareShare

Satyapriya Krishna

@satyascribbles

8 days ago

Super excited about this cyber-sec challenge in collaboration with amazing teams!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare