Xuezhe Ma (Max) (@maxma1987) Twitter Tweets • TwiCopy

LLM360

a year ago

📢📢 We are releasing TxT360: a globally deduplicated dataset for LLM pretraining 🌐 99 Common Crawls 📘 14 Curated Sources 👨‍🍳 recipe to easily adjust data weighting and train the most performant models Dataset: huggingface.co/datasets/LLM36… Blog: huggingface.co/spaces/LLM360/…

thumb_up_off_alt243

chat_bubble_outline5

repeat87

shareShare

Jiao Sun

@sunjiao123sun_

a year ago

Someone confronted on the spot, and they said “ Maybe there is one, maybe they are common, who knows what. I hope it was an outlier.” Even this explanation is full of implicit racial bias. See the full conv: dropbox.com/scl/fi/2dtji0z…

thumb_up_off_alt712

chat_bubble_outline19

repeat35

shareShare

Xin Eric Wang @ ICLR 2025

@xwang_lk

a year ago

It is just so sad that the #NeurIPS2024 main conference ended with such a racist remark by a faculty when talking about ethics. How ironic! I also want to commend the Chinese student who spoke up right on spot. She was respectful, decent, and courageous. Her response was

thumb_up_off_alt2,2K

chat_bubble_outline61

repeat271

shareShare

Felix Juefei Xu

@felixudr

a year ago

Today’s event was a glaring display of ignorance and deep-seated bias, proving that academic achievement doesn’t equate to decency or awareness. I feel for those who have to tolerate and work with her every day. Massachusetts Institute of Technology (MIT) and NeurIPS Conference, this is unacceptable—you can and must do

thumb_up_off_alt56

chat_bubble_outline0

repeat2

shareShare

Furong Huang

@furongh

a year ago

I saw a slide circulating on social media last night while working on a deadline. I didn’t comment immediately because I wanted to understand the full context before speaking. After learning more, I feel compelled to address what I witnessed during an invited talk at NeurIPS 2024

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat178

shareShare

MMitchell

@mmitchell_ai

a year ago

At #NeurIPS2024, the keynote speaker perpetuated explicit racist stereotypes against Chinese students. Generalizations against a community subject to discrimination, even as an “example”, further provokes discrimination. Below, video of an audience member’s perfect response.

thumb_up_off_alt347

chat_bubble_outline12

repeat35

shareShare

Roxana Daneshjou MD/PhD

@roxanadaneshjou

a year ago

I’m not at NeurIPs this year but I have seen the deeply offensive and racist slide shown by a keynote (not reposting here). It’s completely unacceptable, and I stand with my Chinese colleagues.

thumb_up_off_alt512

chat_bubble_outline21

repeat7

shareShare

Qingyun Wu

@qingyun_wu

a year ago

Just can't believe this happened at NeurIPS and, ironically, from an invited keynote speaker talking about ethics! Removing racial bias from humans is so much harder than removing it from LLMs. So proud of the Chinese student who speaks up on spot, pointing out the racist

thumb_up_off_alt106

chat_bubble_outline2

repeat7

shareShare

Hongyi Wang

@hongyiwang10

a year ago

I have three Ph.D. student openings in my research group at Rutgers Computer Science Department starting in Fall 2025. If you are interested in working with me on efficient algorithms and systems for LLMs, foundation models, and AI4Science, please apply at: grad.rutgers.edu/academics/prog… The deadline is

thumb_up_off_alt410

chat_bubble_outline21

repeat117

shareShare

LLM360

@llm360

a year ago

The LLM360 team is hiring! 🚀 We welcome researchers and ML engineers to join our team to push the frontier of AI research. You'll be deeply involved and gain firsthand experience in large model training. Join us in our mission of making AGI open to all. For more information,

thumb_up_off_alt55

chat_bubble_outline2

repeat7

shareShare

Wenhu Chen

@wenhuchen

10 months ago

I spent the weekend reading some recent great math+reasoning papers: 1. AceMath (arxiv.org/abs/2412.15084) 2. rStar-Math (arxiv.org/pdf/2501.04519) 3. PRIME (arxiv.org/abs/2412.01981) Here are some of my naive thoughts! It could be wrong. All of these papers are showing possible

thumb_up_off_alt934

chat_bubble_outline19

repeat161

shareShare

LLM360

@llm360

10 months ago

We are releasing our code base for TxT360, a globally deduplicated dataset for LLM pretraining: 🌐 github.com/LLM360/TxT360 (You can access the full dataset here: huggingface.co/datasets/LLM36…) We are happy to see the open source community filled with great recent dataset releases,

thumb_up_off_alt35

chat_bubble_outline0

repeat8

shareShare

Nathan Lambert

@natolambert

10 months ago

LLM360 gets way less recognition relative to the quality of their totally open outputs in the last year+. They dropped a 60+ page technical report last week and I don't know if I saw anyone talking about it. Along with OLMo, it's the other up to date open-source LM.

thumb_up_off_alt443

chat_bubble_outline11

repeat85

shareShare

Daily Dose of Data Science

@dailydoseofds_

9 months ago

Transformers vs Mixture of experts in LLMs visually explained:

thumb_up_off_alt1,1K

chat_bubble_outline6

repeat378

shareShare

Violet Peng

@violetnpeng

8 months ago

Tired of complaining about *CL reviews on social media and conferences? Now’s your chance to make a real difference! Your feedback is invaluable in improving the process. Take a few minutes to share your thoughts—every response counts!

thumb_up_off_alt36

chat_bubble_outline0

repeat6

shareShare

Xuezhe Ma (Max)

@maxma1987

8 months ago

Strongly agree!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

LLM360

@llm360

7 months ago

Proudly present MegaMath, the largest open-source math reasoning pretraining corpus—371B tokens of high-quality mathematical web, code, and synthetic data, designed to build the data foundation for next-generation math-proficient LLMs like o1 and R1. 🧵👇 #LLM #OpenSource

thumb_up_off_alt85

chat_bubble_outline3

repeat36

shareShare

Hector Liu

@waterluffy

7 months ago

Great work from Fan Zhou and the MBZUAI data team!

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

LLM360

@llm360

6 months ago

📢📢 TxT360 has been updated to v1.1: 🌟 BestofWeb: high-quality doc set from the web ❓ QA: Large Scale Synthetic Q&A dataset 📖 Wiki_extended: extended wiki articles via links 🌍 Europarl Aligned: reformatted long aligned corpus huggingface.co/datasets/LLM36… #AIResearch

thumb_up_off_alt25

chat_bubble_outline2

repeat10

shareShare

Thinking Machines

@thinkymachines

2 months ago

Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.

thumb_up_off_alt2,2K

chat_bubble_outline111

repeat447

shareShare