Kexun Zhang@ICLR 2025 (@kexun_zhang) 's Twitter Profile
Kexun Zhang@ICLR 2025

@kexun_zhang

PhD student at @LTIatCMU. Previously at @ucsbNLP, @ZJU_china. language lover.

ID: 1474710385420873729

linkhttp://zkx06111.github.io calendar_today25-12-2021 11:55:37

583 Tweet

1,1K Takipçi

744 Takip Edilen

Omar Khattab (@lateinteraction) 's Twitter Profile Photo

Sigh, it's a bit of a mess. Let me just give you guys the full nuance in one stream of consciousness since I think we'll continue to get partial interpretations that confuse everyone. All the little things I post need to always be put together in one place. First, I have long

Omar Khattab (@lateinteraction) 's Twitter Profile Photo

btw i forgot to say that “under-reporting” might not be the most fair characterization hence. a lot of this is “the community’s current mental model of what LLMs even are is off” rather than “the researcher is under-reporting some baseline” it isn’t those researchers’ fault…

Chong Zeng (@iam_ncj) 's Twitter Profile Photo

What if a Transformer could render? Not text → image. But mesh → image — with global illumination. No rasterizers. No ray-tracers. Just a Transformer without per-scene training. RenderFormer does exactly that. #SIGGRAPH2025 🔗microsoft.github.io/renderformer

What if a Transformer could render?
Not text → image.
But mesh → image — with global illumination.

No rasterizers. No ray-tracers. Just a Transformer without per-scene training.

RenderFormer does exactly that.

#SIGGRAPH2025 
🔗microsoft.github.io/renderformer
Manish Shetty (@slimshetty_) 's Twitter Profile Photo

✨ NEW SWE-Agents BENCHMARK ✨ Introducing GSO: The Global Software Optimization Benchmark - 👩🏻‍💻 100+ challenging software optimization tasks - 🛣️ a long-horizon task w/ precise specification - 🐘 large code changes in Py, C, C++, ... - 📉 SOTA models get < 5% success! 1/

✨ NEW SWE-Agents BENCHMARK ✨

Introducing GSO: The Global Software Optimization Benchmark
 - 👩🏻‍💻 100+ challenging software optimization tasks
 - 🛣️ a long-horizon task w/ precise specification
 - 🐘 large code changes in Py, C, C++, ...
 - 📉 SOTA models get &lt; 5% success!

1/
Shizhe Diao (@shizhediao) 's Twitter Profile Photo

Yu Yang Hi one exciting thing is: we haven’t observed an upper limit yet — we’re still training and continuing to increase the number of steps!

DailyPapers (@huggingpapers) 's Twitter Profile Photo

LLM Coding made easier HardTests: synthesizing high-quality test cases for LLM coding to improve code evaluation and LLM post-training huggingface.co/papers/2505.24…

Aws Albarghouthi 🍉 أوس (@awsto) 's Twitter Profile Photo

the use of "verifier" in CS: - theory/PL: a tool for checking or generating a mathematical proof - ML: a tool that returns a real number

Xuandong Zhao (@xuandongzhao) 's Twitter Profile Photo

🚀 Excited to share our latest work: AgentSynth A powerful and cost-effective pipeline for generating diverse, high-quality, and realistic computer-use tasks Details below 🧵(1/n)

🚀 Excited to share our latest work: AgentSynth

A powerful and cost-effective pipeline for generating diverse, high-quality, and realistic computer-use tasks
 
Details below 🧵(1/n)
Omar Khattab (@lateinteraction) 's Twitter Profile Photo

Too many think the problem with LLMs is that they’re not human enough. But the problem with LLMs is that they’re not computer enough. We’re used to a standard of reliability from computer programs that LLMs so far don’t live up to. But making them human-like doesn’t fix that!

Alon Albalak (@albalakalon) 's Twitter Profile Photo

🚨 We’re hiring on the Open-Endedness team Lila Sciences and I’m beyond excited about our work! We research AI that doesn’t just solve problems, it creatively explores new scientific frontiers. If that excites you or someone you know 📢 Please RT + read on 🧵👇