Pranav agarwal (@pranav_al) Twitter Tweets • TwiCopy

Hua Shen✨

a year ago

📢Is current “human-AI alignment” research clarified and comprehensive? 🤔 We systematically reviewed 400+ papers across HCI, NLP, and ML to develop a framework for 👫<>🤖"Bidirectional Human-AI Alignment", encompassing the dual paths of “Aligning AI to Human” and “Aligning Human

thumb_up_off_alt273

chat_bubble_outline6

repeat68

shareShare

Hua Shen✨

@huashen218

a year ago

2/ 💎【Bidirectional Human-AI Alignment Framework】 We introduce our “Bidirectional Human-AI Alignment” framework developed from the systematic review. 🔸A🔸 “Align AI to Human” focuses on mechanisms ensuring AI systems’ objectives match those of humans’. 🔸B🔸 “Align Humans to

thumb_up_off_alt13

chat_bubble_outline1

repeat1

shareShare

Amanda Askell

@amandaaskell

a year ago

I had a lot of fun talking with Lex Fridman about a wide range of topics on his podcast, alongside Dario and Chris. Hope it's interesting to others!

thumb_up_off_alt1,1K

chat_bubble_outline77

repeat221

shareShare

Manling Li

@manlingli_

5 months ago

Can VLMs build Spatial Mental Models like humans? Reasoning from limited views? Reasoning from partial observations? Reasoning about unseen objects behind furniture / beyond current view? Check out MindCube! 🌐mll-lab-nu.github.io/mind-cube/ 📰arxiv.org/pdf/2506.21458

thumb_up_off_alt280

chat_bubble_outline5

repeat56

shareShare

Denny Zhou

@denny_zhou

4 months ago

Slides for my lecture “LLM Reasoning” at Stanford CS 25: dennyzhou.github.io/LLM-Reasoning-… Key points: 1. Reasoning in LLMs simply means generating a sequence of intermediate tokens before producing the final answer. Whether this resembles human reasoning is irrelevant. The crucial

thumb_up_off_alt2,2K

chat_bubble_outline22

repeat322

shareShare

Archiki Prasad

@archikiprasad

4 months ago

📢 Excited to share our new paper, where we introduce, ✨GrAInS✨, an inference-time steering approach for LLMs and VLMs via token attribution. Some highlights: ➡️GrAIns leverages contrastive, gradient-based attribution to identify the most influential textual or visual tokens

thumb_up_off_alt80

chat_bubble_outline0

repeat21

shareShare

Pranav agarwal

@pranav_al

4 months ago

Just got our Neurips reviews back, and the reviewers nailed every limitation and missing experiment we already knew about (and no, these weren't LLM-generated😅). Here's a thought: authors are often their own harshest critics. What if conferences required us to submit detailed

thumb_up_off_alt11

chat_bubble_outline1

repeat2

shareShare

Pranav agarwal

@pranav_al

4 months ago

still waiting for someone to build IMDB for academia🫥 no one should waste 5 years on a badly reviewed movie

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Sang Cho

@saaaang94

4 months ago

We are hiring! Interested in optimizing/scaling RL framework for pretrain scale RL? DM me or apply here: job-boards.greenhouse.io/xai/jobs/47991…

thumb_up_off_alt502

chat_bubble_outline7

repeat95

shareShare

Dr Singularity

@dr_singularity

4 months ago

Insane AI news A new paper introduces ASI-ARCH, a fully automated AI research loop that can independently discover superior neural network architectures, outpacing human designed models. Unlike traditional methods limited by human trial and error, ASI-ARCH connects LLM based

thumb_up_off_alt1,1K

chat_bubble_outline66

repeat181

shareShare

Smoke-away

@smokeawayyy

4 months ago

Theory: Everything Exists in Latent Space At a certain scale of base model intelligence, latent space contains nearly every possible idea, invention, and technology, along with every combination of those things. Therefore, AGI exists in latent space. AGI isn’t something we

thumb_up_off_alt305

chat_bubble_outline60

repeat34

shareShare

Jia-Bin Huang

@jbhuang0604

4 months ago

Who is Adam? In this video, meet this guy who has some momentum and his cousin AdamW. youtu.be/1_nujVNUsto

thumb_up_off_alt147

chat_bubble_outline3

repeat10

shareShare

Rohan Paul

@rohanpaul_ai

4 months ago

Survery paper with lots of insights on Continual Reinforcement Learning Detailed review of existing works, organizing and analyzing their metrics, tasks, benchmarks, and scenario settings. 🧩 Why the field exists A classic RL agent hones one policy for one environment then

thumb_up_off_alt342

chat_bubble_outline6

repeat67

shareShare

Pranav agarwal

@pranav_al

4 months ago

"Heartbreak at Headingley, heartbreak at Lord's, and yet a performance with so much heart in Manchester." As always, amazing commentary by Cricbuzz, and what a match! While it ended in a draw, it was probably the best fighting performance by Team India in recent times.

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

AK

@_akhaliq

4 months ago

Agentic Reinforced Policy Optimization

thumb_up_off_alt139

chat_bubble_outline3

repeat18

shareShare

ACL 2025

@aclmeeting

4 months ago

📅 10-Year ToT Award (2015) Thang Luong, Hieu Pham & Christopher D. Manning: “Effective Approaches to Attention-based Neural Machine Translation” EMNLP 2015 🔗 aclanthology.org/D15-1166/ A milestone in neural MT and attention mechanisms. 🔁🧠

thumb_up_off_alt38

chat_bubble_outline1

repeat10

shareShare