Sinclair Wang (@sinclairwang1) 's Twitter Profile
Sinclair Wang

@sinclairwang1

PhDing @sjtu1896 #NLProc
Working on Data Engineering for LLMs: MathPile (2023), ๐Ÿซ ProX (2024),
๐Ÿ’Ž MegaMath (2025)

ID: 1326804636683022338

calendar_today12-11-2020 08:31:09

1,1K Tweet

1,1K Takipรงi

2,2K Takip Edilen

Luca Soldaini โœˆ๏ธ ICLR 25 (@soldni) 's Twitter Profile Photo

Emad Nathan Lambert processing all of CommonCrawl is about $20-50k [0], plus maybe 10-50k H100 if you wanna do GPU classification [1]. You can extract 1T tokens from PDFs for around $10k [2]. Major expenses are synth data, and verify which one of your approaches work [3]. -----------------------

Kimi.ai (@kimi_moonshot) 's Twitter Profile Photo

๐Ÿš€ Hello, Kimi K2! Open-Source Agentic Model! ๐Ÿ”น 1T total / 32B active MoE model ๐Ÿ”น SOTA on SWE Bench Verified, Tau2 & AceBench among open models ๐Ÿ”นStrong in coding and agentic tasks ๐Ÿค Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence

๐Ÿš€ Hello, Kimi K2!  Open-Source Agentic Model!
๐Ÿ”น 1T total / 32B active MoE model
๐Ÿ”น SOTA on SWE Bench Verified, Tau2 & AceBench among open models
๐Ÿ”นStrong in coding and agentic tasks
๐Ÿค Multimodal & thought-mode not supported for now

With Kimi K2, advanced agentic intelligence
Yulun Du (@yulun_du) 's Twitter Profile Photo

Capable, Agentic, and Open-sourced. Kimi K2 excels in knowledge, math, and coding, and is optimized for complex tool use. See how it can analyze data, generate interactive webpages, and more. Explore what's possible and start building today!

Shiyu Ni (@shictyu) 's Twitter Profile Photo

๐ŸฅณHappy to share that our paper "Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception" has been accepted by #ACL2025! We explore leveraging LLMs' internal states to improve their knowledge boundary perception from efficiency and risk perspectives.

๐ŸฅณHappy to share that our paper "Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception" has been accepted by #ACL2025! We explore leveraging LLMs' internal states to improve their knowledge boundary perception from efficiency and risk perspectives.
Sinclair Wang (@sinclairwang1) 's Twitter Profile Photo

Excited to share that our two papers have been accepted to #ICML2025! ICML Conference However, I can't be there in person due to visa issues. What a pity.๐Ÿฅฒ Feel free to check out our poster, neither online nor offline in the Vancouver Convention Center. Programming Every Example:

Excited to share that our two papers have been accepted to #ICML2025! <a href="/icmlconf/">ICML Conference</a>  However, I can't be there in person due to visa issues. What a pity.๐Ÿฅฒ

Feel free to check out our poster, neither online nor offline in the Vancouver Convention Center.

Programming Every Example:
Run-Ze Fan (@vfrz525_) 's Twitter Profile Photo

๐Ÿšจ New release: MegaScience The largest & highest-quality post-training dataset for scientific reasoning is now open-sourced (1.25M QA pairs)! ๐Ÿ“ˆ Trained models outperform official Instruct baselines ๐Ÿ”ฌ Covers 7+ disciplines with university-level textbook-grade QA ๐Ÿ“„ Paper:

๐Ÿšจ New release: MegaScience
The largest &amp; highest-quality post-training dataset for scientific reasoning is now open-sourced (1.25M QA pairs)!
๐Ÿ“ˆ Trained models outperform official Instruct baselines
๐Ÿ”ฌ Covers 7+ disciplines with university-level textbook-grade QA
๐Ÿ“„ Paper:
Run-Ze Fan (@vfrz525_) 's Twitter Profile Photo

When building MegaScience, we learned the hard way: ๐Ÿ“ˆ Strong datasets need strong proxy models. Our data was too spicy ๐ŸŒถ๏ธ for small models like Qwen2.5-1.5B & 3Bโ€”they just flopped. But once we tried Qwen3-14B and 30Bโ€ฆ boom ๐Ÿ’ฅ, everything clicked. Kinda terrifying to think: if

Run-Ze Fan (@vfrz525_) 's Twitter Profile Photo

๐Ÿš€ In its first week, MegaScience ranks #4 on HuggingFace's Trending Datasets of the Week with 3.74k downloads! Thanks for the support โ€” letโ€™s keep pushing open science forward! ๐Ÿ“ท๐ŸŒ

๐Ÿš€ In its first week, MegaScience ranks #4 on HuggingFace's Trending Datasets of the Week with 3.74k downloads! Thanks for the support โ€” letโ€™s keep pushing open science forward! ๐Ÿ“ท๐ŸŒ
Feng Yao (@fengyao1909) 's Twitter Profile Photo

Failing on ๐ฅ๐š๐ซ๐ ๐ž-๐ฌ๐œ๐š๐ฅ๐ž ๐‘๐‹ with VeRL? โš ๏ธ Mixing inference backend (๐ฏ๐‹๐‹๐Œ/๐’๐†๐‹๐š๐ง๐ ) with training backends (๐…๐’๐ƒ๐/๐Œ๐ž๐ ๐š๐ญ๐ซ๐จ๐ง) ๐ฌ๐ž๐œ๐ซ๐ž๐ญ๐ฅ๐ฒ ๐ญ๐ฎ๐ซ๐ง๐ฌ ๐ฒ๐จ๐ฎ๐ซ ๐‘๐‹ ๐ข๐ง๐ญ๐จ ๐จ๐Ÿ๐Ÿ-๐ฉ๐จ๐ฅ๐ข๐œ๐ฒ โ€” even if they share the same weights! ๐Ÿ“‰ย Blog:

Failing on ๐ฅ๐š๐ซ๐ ๐ž-๐ฌ๐œ๐š๐ฅ๐ž ๐‘๐‹ with VeRL?

โš ๏ธ Mixing inference backend (๐ฏ๐‹๐‹๐Œ/๐’๐†๐‹๐š๐ง๐ ) with training backends (๐…๐’๐ƒ๐/๐Œ๐ž๐ ๐š๐ญ๐ซ๐จ๐ง) ๐ฌ๐ž๐œ๐ซ๐ž๐ญ๐ฅ๐ฒ ๐ญ๐ฎ๐ซ๐ง๐ฌ ๐ฒ๐จ๐ฎ๐ซ ๐‘๐‹ ๐ข๐ง๐ญ๐จ ๐จ๐Ÿ๐Ÿ-๐ฉ๐จ๐ฅ๐ข๐œ๐ฒ โ€” even if they share the same weights!

๐Ÿ“‰ย Blog:
Fan Zhouโœˆ๏ธICLR2025 (@fazhou_998) 's Twitter Profile Photo

1. npx @โ€‹qwen-code/[email protected] 2. get 2000 free calls/day via Qwen Chat quick math: let's suppose avg agentic interaction โ‰ˆ 32k context 2000 ร— 32k โ‰ˆ 64 million tokens/day

Tianbao Xie (@tianbaox) 's Twitter Profile Photo

๐Ÿš€ OSWorld gets a major upgrade! OSWorld-Verified: 15 months community feedback โ†’ 300+ fixes (ambiguity, gradersโ€ฆ), 50x faster eval through AWS parallelization More apple-to-apple comparison for reliable CUA evaluation โœจ ๐Ÿ‘‡xlang.ai/blog/osworld-vโ€ฆ

ๆœบๅ™จไน‹ๅฟƒ JIQIZHIXIN (@synced_global) 's Twitter Profile Photo

Wow, this is really cool! This reserach answers this question: what if your computer-use AI was not a black box? OpenCUA: Open Foundations for Computer-Use Agents Researchers from HKU, Moonshot AI, and others present OpenCUAโ€”a fully open-source framework for building and

Wow, this is really cool! This reserach answers this question: what if your computer-use AI was not a black box?

OpenCUA: Open Foundations for Computer-Use Agents

Researchers from HKU, Moonshot AI, and others present OpenCUAโ€”a fully open-source framework for building and