Łukasz Stafiniak (@lukstafi) Twitter Tweets • TwiCopy

机器之心 JIQIZHIXIN

8 months ago

🚨 41 years in the making — and Dijkstra is no longer unbeatable. A Tsinghua, Stanford, and MPI for Informatics team has achieved the first deterministic algorithm to break the O(m + n log n) bound for single-source shortest paths in directed graphs with real non-negative

thumb_up_off_alt232

chat_bubble_outline7

repeat32

shareShare

Shai Shalev-Shwartz

@shai_s_shwartz

8 months ago

Are frontier AI models really capable of “PhD-level” reasoning? To answer this question, we introduce FormulaOne, a new reasoning benchmark of expert-level Dynamic Programming problems. We have curated a benchmark consisting of three tiers, in increasing complexity, which we call

thumb_up_off_alt842

chat_bubble_outline26

repeat89

shareShare

Łukasz Stafiniak

@lukstafi

8 months ago

There is psychological well-being benefits to pair programming with frontier LLMs, that would not be offered by perfect (meaning, not making mistakes) but partial (meaning, not AI-complete) GOFAI systems.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

SpaceX

@spacex

8 months ago

Watch Starship's tenth flight test → spacex.com/launches/stars… x.com/i/broadcasts/1…

thumb_up_off_alt14,14K

chat_bubble_outline936

repeat3,3K

shareShare

Łukasz Stafiniak

@lukstafi

8 months ago

GPT-2, Opus 4.1 na-palcach.blogspot.com/search/label/G…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Greg Brockman

@gdb

8 months ago

super cool to compare the outputs from GPT-1 through GPT-5, given the same prompt: progress.openai.com

thumb_up_off_alt4,4K

chat_bubble_outline196

repeat330

shareShare

Łukasz Stafiniak

@lukstafi

7 months ago

ppx_minidebug might reach v3 before it reaches its 3rd user. #ocaml

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Wes

@wmorrill3

7 months ago

Ok hear me out - the sun keeps sending it's unwanted radiation to America. This is unacceptable, we need to put a tariff on those solar rays. Let's build a wall that will absorb that unwanted light and turn it into electricity. But we need to make the wall horizontal and put one

thumb_up_off_alt4,4K

chat_bubble_outline227

repeat355

shareShare

SpaceX

@spacex

7 months ago

Watch Starship's tenth flight test → spacex.com/launches/stars… x.com/i/broadcasts/1…

thumb_up_off_alt13,13K

chat_bubble_outline984

repeat3,3K

shareShare

Jay

@jayendra_ram

7 months ago

Since everyone is talking about RL Environments and GRPO now but no one knows how it works we thought it would be cool to make an explainer video + code you can run: This is an example of using GRPO to train Qwen 2.5 to play 2048 (code in thread) 🧵:

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat163

shareShare

Kevin Patrick Murphy

@sirbayes

7 months ago

I just finished reading this interesting book by Druv Pai, Sam Buchanan and colleagues. It's fairly "heavy" but provides a very satisfying theoretical explanation for many different empirical approaches currently used in "generative AI", such as denoising diffusion models,

thumb_up_off_alt788

chat_bubble_outline10

repeat91

shareShare

Łukasz Stafiniak

@lukstafi

7 months ago

I'm currently having a Claude Code phase. I stopped using Cursor agents, but Cursor tab complete is by itself totally worth the price.

thumb_up_off_alt0

chat_bubble_outline1

repeat0

shareShare

Elon Musk

@elonmusk

7 months ago

Starship

thumb_up_off_alt75,75K

chat_bubble_outline4,4K

repeat6,6K

shareShare

Łukasz Stafiniak

@lukstafi

7 months ago

tab-o: sorry, I blindly pressed autocomplete on this

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Sebastian Raschka

@rasbt

7 months ago

Updated & turned my Big LLM Architecture Comparison article into a narrated video lecture. The 11 LLM architectures covered in this video: 1. DeepSeek V3/R1 2. OLMo 2 3. Gemma 3 4. Mistral Small 3.1 5. Llama 4 6. Qwen3 7. SmolLM3 8. Kimi 2 9. GPT-OSS 10. Grok 2.5 11. GLM-4.5

thumb_up_off_alt2,2K

chat_bubble_outline32

repeat446

shareShare

Łukasz Stafiniak

@lukstafi

7 months ago

My old university page is dead :-| Truth be told it was an attack vector, insecure wiki content management setup. But a bit sad to lose it.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

FUN OCaml

@funocaml

7 months ago

Here's Nathan Taylor who'll be speaking at FUN OCaml 2025 - and it's already next Monday! ✨🐫

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

机器之心 JIQIZHIXIN

@synced_global

7 months ago

Huawei proposed Tree-OPO! They explore how MCTS trajectories can fuel Group Relative Policy Optimization (GRPO), enabling preference-based RL without value networks. By staging training with partially revealed rollouts, they create tree-structured reward signals that better

thumb_up_off_alt206

chat_bubble_outline2

repeat34

shareShare

Łukasz Stafiniak

@lukstafi

7 months ago

Which life do you prefer: at the frontier, or out of distribution?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare