Jiashuo Liu (@liujiashuo77) 's Twitter Profile
Jiashuo Liu

@liujiashuo77

PhD @Tsinghua_Uni CS | AI Research @ByteDance Seed Team | OOD Gen, Data-Centric AI | Visiting @Stanford @Cambridge @Columbia

ID: 1428954721600131076

linkhttp://ljsthu.github.io calendar_today21-08-2021 05:39:01

235 Tweet

395 Takipรงi

501 Takip Edilen

Jiashuo Liu (@liujiashuo77) 's Twitter Profile Photo

Thrilled by the interest in our work! [Surprising finding] Across 7,650 source-target pairs, neural networks with LLM embeddings outperformed GBDTs, DROs, and GPT-4-mini. See the 'optimal ratio' for each method below ๐Ÿ‘‡ [Question] Do we still need a dedicated Tabular LLM? ๐Ÿค”

Thrilled by the interest in our work! 
[Surprising finding] Across 7,650 source-target pairs, neural networks with LLM embeddings outperformed GBDTs, DROs, and GPT-4-mini. See the 'optimal ratio' for each method below ๐Ÿ‘‡
[Question] Do we still need a dedicated Tabular LLM? ๐Ÿค”
Renzhe Xu (@xrz199721) 's Twitter Profile Photo

๐Ÿšจ Excited to share our latest research! ๐Ÿšจ We present Stable Cox Regression, published in Nature Machine Intelligence! ๐ŸŽ‰nature Nature Machine Intelligence Read more: nature.com/articles/s4225โ€ฆ #StableCox #SurvivalAnalysis #MachineLearning

Jiashuo Liu (@liujiashuo77) 's Twitter Profile Photo

Probably the 1ST data-centric LLM copilot, integrating various data-centric tools, supporting complex data processing and modeling of clinical problems๐Ÿ”ฅ๐Ÿ”ฅ - Paper: Summarizes various data-related issues & corresponding tools, along with detailed case studies in the medical field

Jiashuo Liu (@liujiashuo77) 's Twitter Profile Photo

Thank you so much! YES, we find Grok 4 super cool, and it is really good at harder tiers (Deep Search and Super Agent). Btw, our case study (sec 4.5.1) shows that Grok 4โ€™s search capability is much better!

Jiashuo Liu (@liujiashuo77) 's Twitter Profile Photo

We're collaborating with six commercial agents to include them in the next leaderboard update. Meanwhile, GPT-5 Pro, ChatGPT Agent, Gemini 2.5 Pro Deep Think, and Claude Opus 4.1 are all on the way. And now, a future prediction challenge: Will Grok remain the best? Letโ€™s seeโ€ฆ

Jiashuo Liu (@liujiashuo77) 's Twitter Profile Photo

Wow thanks Elon! Yes, we think it's a measure of agent's AGI! Again, introduce our FutureX live benchmark. futurex-ai.github.io