Manuel Mager (Turatemai) (@pywirrarika) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

You can now run Qwen3-235B-A22B-2507 with our Dynamic 2-bit GGUFs! The full 250GB model gets reduced to just 88GB (-65% size). Achieve >5 tokens/s on 89GB unified memory or 80GB RAM + 8GB VRAM. GGUFs: huggingface.co/unsloth/Qwen3-…

thumb_up_off_alt569

chat_bubble_outline19

repeat76

shareShare

UW NLP

@uwnlp

14 days ago

Fascinating work bridging cognitive science + NLP! PrefPalette decomposes preferences into interpretable attributes (humor, empathy, formality) with dynamic weighting. 46.6% better than GPT-4o with explainability. This opens new directions in alignment and personalization.

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Mihir Prabhudesai

@mihirp98

14 days ago

🚨 The era of infinite internet data is ending, So we ask: 👉 What’s the right generative modelling objective when data—not compute—is the bottleneck? TL;DR: ▶️Compute-constrained? Train Autoregressive models ▶️Data-constrained? Train Diffusion models Get ready for 🤿 1/n

thumb_up_off_alt973

chat_bubble_outline122

repeat171

shareShare

Qwen

@alibaba_qwen

11 days ago

🚀 We’re excited to introduce Qwen3-235B-A22B-Thinking-2507 — our most advanced reasoning model yet! Over the past 3 months, we’ve significantly scaled and enhanced the thinking capability of Qwen3, achieving: ✅ Improved performance in logical reasoning, math, science & coding

thumb_up_off_alt3,3K

chat_bubble_outline162

repeat504

shareShare

Chujie Zheng

@chujiezheng

11 days ago

Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) 🚀 📄 huggingface.co/papers/2507.18…

thumb_up_off_alt1,1K

chat_bubble_outline18

repeat143

shareShare

Manuel Mager (Turatemai)

@pywirrarika

11 days ago

Open weight models + open science is the way to go and will win in the long term. Congrats to the team for this awesome release!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Yoshinari Fujinuma

@akkikiki

10 days ago

StanfordのMixture of Experts (MoE)の講義は超おすすめ（5回ぐらいヘビロテした）/ Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 4:... youtu.be/LPv1KfUXLCo?si… via YouTube

thumb_up_off_alt160

chat_bubble_outline0

repeat26

shareShare

SomosNLP

@somosnlp_

8 days ago

📢🧵 Thread | Celebrating Ibero-American NLP at #ACL2025! 🇪🇸🇵🇹🇲🇽🇨🇴🇧🇷🇦🇷 ... 🌎 The SomosNLP @ACL community is showing up strong at ACL 2025 ! We want to highlight all the papers from our vibrant community 👇 #NLProc #NLP #IberoAmerica

📢🧵 Thread | Celebrating Ibero-American NLP at #ACL2025! 🇪🇸🇵🇹🇲🇽🇨🇴🇧🇷🇦🇷 ... 🌎

The <a href="/SomosNLP_/">SomosNLP @ACL</a> community is showing up strong at <a href="/aclmeeting/">ACL 2025</a> !

We want to highlight all the papers from our vibrant community 👇

#NLProc #NLP #IberoAmerica

thumb_up_off_alt23

chat_bubble_outline0

repeat9

shareShare

Muhammad AbdulMageed

@mageed

8 days ago

What if the future of AI was fundamentally inequitable for an entire continent? This is the critical question we pose in our latest work (#ACL2025). We undertake a comprehensive empirical evaluation of leading LLMs on Sahara, our extensive benchmark that we collect using mostly

thumb_up_off_alt8

chat_bubble_outline2

repeat3

shareShare

David Ifeoluwa Adelani 🇳🇬

@davlanade

7 days ago

Today, we are presenting 3 papers #ACL2025: 1) Injongo: A multicultural intent detection for African lang. Room 1.15-16 (Multilingualism)@ 14:00 2) BRIGHTER: Emotion classification(Nedjma Ousidhoum نجمة أوسيدهم @ACL2025) Hall B (resources) @ 14:00 3) Global MMLU (Shivalika Singh Sara Hooker) poster @ 10:30

thumb_up_off_alt78

chat_bubble_outline3

repeat11

shareShare

Cohere Labs

@cohere_labs

7 days ago

It’s day 4 at ACL 2025 and we’re excited that MURI will be showcased today! 🎉 Work led by: Abdullatif Köksal Marion Thaler Ayyoob Imani Ahmet Üstün Anna Korhonen Peiqin Lin 📜arxiv.org/abs/2409.12958

It’s day 4 at <a href="/aclmeeting/">ACL 2025</a> and we’re excited that MURI will be showcased today! 🎉

Work led by: <a href="/akoksal_/">Abdullatif Köksal</a> Marion Thaler <a href="/imani_ayyoob/">Ayyoob Imani</a> <a href="/ahmetustun89/">Ahmet Üstün</a> <a href="/annalkorhonen/">Anna Korhonen</a> <a href="/lpq29743/">Peiqin Lin</a>

📜arxiv.org/abs/2409.12958

thumb_up_off_alt18

chat_bubble_outline2

repeat4

shareShare

David Ifeoluwa Adelani 🇳🇬

@davlanade

5 days ago

6th AfricaNLP Workshop ACL 2025 #ACL2025NLP #ACL2025 @masakhane Opening remark by Shamsuddeen Hassan Muhammad, PhD

6th AfricaNLP Workshop <a href="/aclmeeting/">ACL 2025</a>
#ACL2025NLP #ACL2025 @masakhane

Opening remark by <a href="/Shmuhammadd/">Shamsuddeen Hassan Muhammad, PhD</a>

thumb_up_off_alt69

chat_bubble_outline1

repeat15

shareShare

Verena Rieser

@verena_rieser

5 days ago

Great talk by Malihe Alikhani on AI sycophancy 🚀

Great talk by <a href="/malihealikhani/">Malihe Alikhani</a> on AI sycophancy 🚀

thumb_up_off_alt22

chat_bubble_outline0

repeat5

shareShare

Kiado CruzMiguel

@kiadorindani

5 days ago

La inclusión digital no es un lujo, es una necesidad.  indigital.surcooaxaca.org Cerremos la brecha.

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Qwen

@alibaba_qwen

5 days ago

🦥 Qwen3-Coder-Flash: Qwen3-Coder-30B-A3B-Instruct 💚 Just lightning-fast, accurate code generation. ✅ Native 256K context (supports up to 1M tokens with YaRN) ✅ Optimized for platforms like Qwen Code, Cline, Roo Code, Kilo Code, etc. ✅ Seamless function calling & agent

thumb_up_off_alt2,2K

chat_bubble_outline111

repeat394

shareShare

Yoshua Bengio

@yoshua_bengio

5 days ago

Pleased to see this new Alignment Project, where I serve as an expert advisor, launched by the UK's AI Security Institute and supported by the Canadian AI Safety Institute and many others. I encourage my fellow researchers to apply for funding, compute and support from int’l experts.

thumb_up_off_alt166

chat_bubble_outline8

repeat25

shareShare

Thomas Wolf

@thom_wolf

3 days ago

Long-form AI reading is back and we’ve just dropped the ultimate summer read. Inspired by the likes of Stripe Press, we’re proud to announce the first book from HF Press: a carefully crafted, book-length PDF edition of the Ultra-Scale Playbook. Over 200 dense pages to learn the

thumb_up_off_alt468

chat_bubble_outline20

repeat62

shareShare

Shruti

@heyshrutimishra

3 days ago

AI Industry Made $57 Billion Mistake and No One’s Talking About It. While GPT-5 headlines kept you distracted... NVIDIA quietly released a bold claim: → Small Language Models (SLMs) are the future of AI agents Cheaper, faster and just as capable for 80% of real-world tasks.

thumb_up_off_alt2,2K

chat_bubble_outline128

repeat467

shareShare

Sebastian Raschka

@rasbt

3 days ago

So, I did some coding this week... - Qwen3 Coder Flash (30B-A3B) - Mixture-of-Experts setup with 128 experts, 8 active per token - In pure PyTorch (optimized for human readability) - in a standalone Jupyter notebook - Runs on a single A100

thumb_up_off_alt2,2K

chat_bubble_outline31

repeat270

shareShare

tommy

@xundecidability

3 days ago

Has anyone actually tried this 27M param model that beats o3?

thumb_up_off_alt1,1K

chat_bubble_outline67

repeat50

shareShare

Manuel Mager (Turatemai)

Gate.io

Unsloth AI

UW NLP

Mihir Prabhudesai

Qwen

Chujie Zheng

Manuel Mager (Turatemai)

Yoshinari Fujinuma

SomosNLP

Muhammad AbdulMageed

David Ifeoluwa Adelani 🇳🇬

Cohere Labs

David Ifeoluwa Adelani 🇳🇬

Verena Rieser

Kiado CruzMiguel

Qwen

Yoshua Bengio

Thomas Wolf

Shruti

Sebastian Raschka

tommy