Kaiqu Liang (@kaiqu_liang) Twitter Tweets • TwiCopy

Kaiqu Liang

@kaiqu_liang

+ Follow

PhD student @PrincetonCS | AI Safety, LLM Alignment, Embodied AI

ID: 1484917372683173890

linkhttps://kaiquliang.github.io/ calendar_today22-01-2022 15:54:43

41 Tweet

363 Takipçi

374 Takip Edilen

Xindi Wu

@cindy_x_wu

4 months ago

Introducing COMPACT: COMPositional Atomic-to-complex Visual Capability Tuning, a data-efficient approach to improve multimodal models on complex visual tasks without scaling data volume. 📦 arxiv.org/abs/2504.21850 1/10

thumb_up_off_alt149

chat_bubble_outline6

repeat42

shareShare

John Yang

@jyangballin

4 months ago

40% with just 1 try per task: SWE-agent-LM-32B is the new #1 open source model on SWE-bench Verified. We built it by synthesizing a ton of agentic training data from 100+ Python repos. Today we’re open-sourcing the toolkit that made it happen: SWE-smith.

thumb_up_off_alt638

chat_bubble_outline25

repeat132

shareShare

Haimin Hu

@haiminhu

4 months ago

🗓️ Mark your calendar: The 1st #ICRA Workshop on Public Trust in Autonomous Systems (PTAS) is just two days away! We'll explore the critical question: How do we build assurances into autonomous technologies from the ground up, shaping public trust before widespread deployment?

thumb_up_off_alt6

chat_bubble_outline1

repeat2

shareShare

Xuandong Zhao

@xuandongzhao

3 months ago

🚀 Excited to share the most inspiring work I’ve been part of this year: "Learning to Reason without External Rewards" TL;DR: We show that LLMs can learn complex reasoning without access to ground-truth answers, simply by optimizing their own internal sense of confidence. 1/n

thumb_up_off_alt3,3K

chat_bubble_outline81

repeat505

shareShare

Alignment Lab AI

@alignment_lab

2 months ago

arxiv.org/pdf/2507.07484 machine-bullshit.github.io Princeton University and UC Berkeley published a formalized analysis on the emergent dishonesty that rlhf at scale optimizes for in large language models, They provide a taxonomy and scoring system to allow for direct indexing,

thumb_up_off_alt28

chat_bubble_outline4

repeat11

shareShare

Robert Scoble

@scobleizer

2 months ago

LLMs are picking up weird patterns from humans. Aligning them to be helpful actually teaches them to bullshit? Must be why I like Grok Unhinged the best. :-)

thumb_up_off_alt94

chat_bubble_outline22

repeat11

shareShare

Ryan Liu @ NeurIPS 2024

@theryanliu

2 months ago

Chain of thought can hurt LLM performance 🤖 Verbal (over)thinking can hurt human performance 😵‍💫 Are when/why they happen similar? Come find out at our poster at West-320 ⏰11am tomorrow! #ICML2025

thumb_up_off_alt49

chat_bubble_outline0

repeat9

shareShare

Balázs Kégl

@balazskegl

2 months ago

Awesome, I've been saying this for a while, inspired by Dr John Vervaeke. LLMs are formally bullshitting, yes. medium.com/@balazskegl/on… A couple of threads that may be interesting: x.com/balazskegl/sta… x.com/NandoDF/status… The connection: when we speak, we have an

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Ryan Liu @ NeurIPS 2024

@theryanliu

2 months ago

A short 📹 explainer video on how LLMs can overthink in humanlike ways 😲! had a blast presenting this at #icml2025 🥳

thumb_up_off_alt57

chat_bubble_outline4

repeat12

shareShare