Christian S. Perone (@tarantulae) Twitter Tweets • TwiCopy

Christian S. Perone

@tarantulae

+ Follow

Machine Learning, Computer Science, Math. Computer Science (UPF Brazil) 🇧🇷🧉 / Machine Learning (@polymtl/@UMontreal). Working with Autonomous Vehicles in UK

ID: 20831552

linkhttp://blog.christianperone.com calendar_today14-02-2009 04:38:15

9,9K Tweet

7,7K Followers

1,1K Following

Thang Luong

@lmthang

5 months ago

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this

thumb_up_off_alt1,1K

chat_bubble_outline75

repeat224

shareShare

Christian S. Perone

@tarantulae

5 months ago

"Optimizers Qualitatively Alter Solutions And We Should Leverage This" (arxiv.org/abs/2507.12224), very nice to see this direction of understanding what different optimizers bring in terms of solution properties.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Qwen

@alibaba_qwen

5 months ago

🚀 We’re excited to introduce Qwen3-235B-A22B-Thinking-2507 — our most advanced reasoning model yet! Over the past 3 months, we’ve significantly scaled and enhanced the thinking capability of Qwen3, achieving: ✅ Improved performance in logical reasoning, math, science & coding

thumb_up_off_alt3,3K

chat_bubble_outline162

repeat504

shareShare

Chujie Zheng

@chujiezheng

5 months ago

Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) 🚀 📄 huggingface.co/papers/2507.18…

thumb_up_off_alt1,1K

chat_bubble_outline18

repeat143

shareShare

Christian S. Perone

@tarantulae

4 months ago

. Pablo Galindo Salgado 😂

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

Christian S. Perone

@tarantulae

4 months ago

I will be preparing some popcorn for what will be the new name of "context engineering" that was renamed a few days ago 😅

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Mathieu

@miniapeur

4 months ago

thumb_up_off_alt9,9K

chat_bubble_outline40

repeat866

shareShare

Chi Jin

@chijinml

4 months ago

Many friends still ask me about AI for IMO, formal vs informal math. Some quick thoughts: IMO results: GDM and OpenAI achieved gold using informal (natural language) methods. ByteDance and AlphaProof (last year) got gold/silver using formal methods (Lean + specialized geometry

thumb_up_off_alt372

chat_bubble_outline13

repeat37

shareShare

Christian S. Perone

@tarantulae

4 months ago

I think next years will tell that Yann LeCun was right on many accounts. If we look how discovery in science is unfolding today with world models (wm) + evolutionary methods on top of it, it is very similar to MPC as well.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Christian S. Perone

@tarantulae

4 months ago

I feel I can build an entire benchmark dataset with ONNX errors that would be harder than the humanity's last exam dataset for us to evaluate AGI

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

vik

@vikhyatk

4 months ago

interesting swiglu variant from the gpt-oss model: clamps inputs and adds a skip connection

thumb_up_off_alt182

chat_bubble_outline12

repeat8

shareShare

Lucas Beyer (bl16)

@giffmana

4 months ago

Amazing! Truly open review, through which we all gained more insights, i love it! Result: in multi epoch setting, making AR learn multiple orderings ~closes the gap to diffusion, explaining much of the difference. How the truly open review happened (from my vague memory): Mihir

thumb_up_off_alt594

chat_bubble_outline16

repeat36

shareShare

Aleksander Holynski

@holynski_

4 months ago

Something fun we discovered: you can use #Genie3 to step into and explore your favorite paintings. Here's a short visit to Edward Hopper's "Nighthawks".

thumb_up_off_alt7,7K

chat_bubble_outline410

repeat851

shareShare

Lucas Beyer (bl16)

@giffmana

4 months ago

Oh wow, this VLM benchmark is pure evil, and I love it! "Vision Language Models are Biased" by An Vo, taesiri, Anh Totti Nguyen, etal. Also really good idea to have one-click copy-paste of images and prompts, makes trying it super easy.

Oh wow, this VLM benchmark is pure evil, and I love it!

"Vision Language Models are Biased" by <a href="/an_vo12/">An Vo</a>, <a href="/taesiri/">taesiri</a>, <a href="/anh_ng8/">Anh Totti Nguyen</a>, etal.

Also really good idea to have one-click copy-paste of images and prompts, makes trying it super easy.

thumb_up_off_alt937

chat_bubble_outline32

repeat75

shareShare

Christian S. Perone

@tarantulae

4 months ago

I think the main question is: do we really need two separate AI for world modeling and another for agency ? I think these two concepts will merge very soon.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

AI at Meta

@aiatmeta

4 months ago

Introducing DINOv3: a state-of-the-art computer vision model trained with self-supervised learning (SSL) that produces powerful, high-resolution image features. For the first time, a single frozen vision backbone outperforms specialized solutions on multiple long-standing dense