Chenxiao Yang @ ICLR2025 (@chenxiao_yang_) 's Twitter Profile
Chenxiao Yang @ ICLR2025

@chenxiao_yang_

Hi, this is Chenxiao Yang. I am an incoming PhD student at TTIC. Before that, I earned my MS and BS at SJTU. Interested in ML theory, GNN and LLM.

ID: 1499067800518000640

linkhttp://chr26195.github.io calendar_today02-03-2022 17:03:38

7 Tweet

44 Takipçi

104 Takip Edilen

Towards Data Science (@tdatascience) 's Twitter Profile Photo

"In this article, we challenge the conventional 'write-only' CoT reasoning paradigm that dominates current LLM architectures, from both theoretical and practical perspectives." Chenxiao Yang presents insights based on their recent paper. towardsdatascience.com/empowering-llm…

losh (@attentionmech) 's Twitter Profile Photo

"Your goal in problem solving is not to solve the problem, but to raise your understanding of it.. to a level where the problem is almost trivial" -- my old CS prof

Chenxiao Yang @ ICLR2025 (@chenxiao_yang_) 's Twitter Profile Photo

When I applied for PhD (23 Fall), my goal was to develop new math to understand AI, so I chose a theory-focused school. Now, as a PhD, my focus has shifted to studying AI’s "psychology" scientifically. This article echoes many of my earlier reflections. 👍

Songlin Yang (@songlinyang4) 's Twitter Profile Photo

📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks arxiv.org/abs/2505.16381

Chenxiao Yang @ ICLR2025 (@chenxiao_yang_) 's Twitter Profile Photo

🤣 If the “illusion of thinking” just means “context not large enough to think deeply”, here’s a simple path to true intelligence: ERASE thoughts! We proved that LLMs can solve much larger-sized problems (like Tower of Hanoi which requires exponential steps) by recursively