Rene Heuser (@reneheuser) 's Twitter Profile
Rene Heuser

@reneheuser

from Munich | loving Dogs & Cats | reading, talking, writing about Games, Tech, AI & Science | personal opinion

ID: 327315696

calendar_today01-07-2011 10:58:59

1,1K Tweet

545 Takipçi

495 Takip Edilen

Mingqian Zheng (@elisazmq_zheng) 's Twitter Profile Photo

🎙️ What if the way we prompt LLMs might actually hold it back? 🚨 Assigning personas like "helpful assistant" in system prompts might *not* be as helpful as we think! ✨ Check out our work accepted to Findings of EMNLP 2025 ✨ 📜 arxiv.org/abs/2311.10054 🧵 [1/7]

🎙️ What if the way we prompt LLMs might actually hold it back?
🚨 Assigning personas like "helpful assistant" in system prompts might *not* be as helpful as we think!
✨ Check out our work accepted to Findings of <a href="/emnlpmeeting/">EMNLP 2025</a> ✨

📜 arxiv.org/abs/2311.10054
🧵 [1/7]
Anthropic (@anthropicai) 's Twitter Profile Photo

Claude can now write and run code. We've added a new analysis tool. The tool helps Claude respond with mathematically precise and reproducible answers. You can then create interactive data visualizations with Artifacts. Enable the feature preview: claude.ai/new?fp=1.

Thomas Dohmke (@ashtom) 's Twitter Profile Photo

Also: it’s a big fucking deal that Python is now the number 1 language on GitHub. 🐍 AI creation is booming — GitHub is the world’s largest creator network for the age of AI. github.blog/news-insights/…

Also: it’s a big fucking deal that Python is now the number 1 language on GitHub. 🐍 AI creation is booming — GitHub is the world’s largest creator network for the age of AI. 

github.blog/news-insights/…
The Daily Show (@thedailyshow) 's Twitter Profile Photo

Jon Stewart on election night: "We're all going to have to wake up tomorrow morning and work like hell to move the world to the place that we prefer it to be." #DailyShow

Logan Kilpatrick (@officiallogank) 's Twitter Profile Photo

Announcing Gemini 2.0 Flash, a new multimodal agentic model (initially experimental), with our best-ever results on capabilities and benchmarks: developers.googleblog.com/en/the-next-ch…

Tim Cook (@tim_cook) 's Twitter Profile Photo

Excited to share that “Silo” will return for a third AND fourth season! We’re thrilled to support the imagination and inspiration out of the UK as they continue to create world-class films and series.

Arvind Narayanan (@random_walker) 's Twitter Profile Photo

New AI Snake Oil essay: Last month the AI industry's narrative on scaling suddenly flipped. This has left people outside AI confused. What changed? Is AI capability progress slowing? We look at the evidence. By me, Benedikt Stroebl and Sayash Kapoor. aisnakeoil.com/p/is-ai-progre… -----

New AI Snake Oil essay: Last month the AI industry's narrative on scaling suddenly flipped. This has left people outside AI confused. What changed? Is AI capability progress slowing? We look at the evidence.
By me, <a href="/benediktstroebl/">Benedikt Stroebl</a> and <a href="/sayashk/">Sayash Kapoor</a>.
aisnakeoil.com/p/is-ai-progre…
-----
ARC Prize (@arcprize) 's Twitter Profile Photo

New verified ARC-AGI-Pub SoTA! OpenAI o3 has scored a breakthrough 75.7% on the ARC-AGI Semi-Private Evaluation. And a high-compute o3 configuration (not eligible for ARC-AGI-Pub) scored 87.5% on the Semi-Private Eval. 1/4

New verified ARC-AGI-Pub SoTA!

<a href="/OpenAI/">OpenAI</a> o3 has scored a breakthrough 75.7% on the ARC-AGI Semi-Private Evaluation.

And a high-compute o3 configuration (not eligible for ARC-AGI-Pub) scored 87.5% on the Semi-Private Eval.

1/4