Jesper N. Wulff (@jesper_wulff) 's Twitter Profile
Jesper N. Wulff

@jesper_wulff

Professor @AarhusUni doing research on organizational research methods and teaching deep neural networks in our Msc. BI program.

ID: 406927301

linkhttps://sites.google.com/view/jesperwulff calendar_today07-11-2011 11:42:21

893 Tweet

227 Followers

352 Following

Deedy (@deedydas) 's Twitter Profile Photo

LLMs are far worse at competitive programming than we thought. Every one scored 0% on Hard problems. LiveCodeBench-Pro is a new benchmark with 584 always updating problems from IOI, ICPC and Codeforces. What's most interesting is the categories they perform really poorly on:

LLMs are far worse at competitive programming than we thought. Every one scored 0% on Hard problems.

LiveCodeBench-Pro is a new benchmark with 584 always updating problems from IOI, ICPC and Codeforces.

What's most interesting is the categories they perform really poorly on:
Phil (@nonrealbrandon) 's Twitter Profile Photo

Deedy It's unfair to expect LLMs to perform well on always changing benchmarks. How are they to overfit on the data if it keeps changing?

METR (@metr_evals) 's Twitter Profile Photo

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers.

The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.
ℏεsam (@hesamation) 's Twitter Profile Photo

"I use AI in a separate window. I don't enjoy Cursor or Windsurf, I can literally feel competence draining out of my fingers." DHH, the legendary programmer and creator of Ruby on Rails has the most beautiful and philosophical idea about what AI takes away from programmers.

Arvind Narayanan (@random_walker) 's Twitter Profile Photo

Back in grad school, when I realized how the “marketplace of ideas” actually works, it felt like I’d found the cheat codes to a research career. Today, this is the most important stuff I teach students, more than anything related to the substance of our research. A quick

Mackenzie Lockhart (@lockhartm) 's Twitter Profile Photo

Excited that our (apoorva.lal, Yiqing Xu, Gary ziwen_Zu) paper won the Political Analysis' 2024 Editor's Choice award! It was really a lot of work (we started this in 2018!), so nice to see we've had some impact on the field. It's also open access. cambridge.org/core/journals/…

Ben Ansell (@benwansell) 's Twitter Profile Photo

Brutal analysis of ChatGPT5 from Gary Marcus. This was a big moment for OpenAI and so far a dud. Since US economy is largely being kept afloat by AI investment, this could be inflection point. Hold onto your hats. garymarcus.substack.com/p/gpt-5-overdu…

Daniël Lakens (@lakens) 's Twitter Profile Photo

Too often, I see people talk about a replication as if the first study has established something, and the replication study is a double-check. What people often fail to understand is that we do not do replication studies to *check* a finding, but to *establish* a finding. 1/x

Ernest Ryu (@ernestryu) 's Twitter Profile Photo

The proof is something an experienced PhD student could work out in a few hours. That GPT-5 can do it with just ~30 sec of human input is impressive and potentially very useful to the right user. However, GPT5 is by no means exceeding the capabilities of human experts. (9/9)

Daniël Lakens (@lakens) 's Twitter Profile Photo

If you are preparing your bachelor statistics course and would like to add optional material for students to better understand statistics on a conceptual level (see topics in the screenshot) my free textbook provides a state of the art overview. lakens.github.io/statistical_in…

If you are preparing your bachelor statistics course and would like to add optional material for students to better understand statistics on a conceptual level (see topics in the screenshot) my free textbook provides a state of the art overview. lakens.github.io/statistical_in…
Gary Marcus (@garymarcus) 's Twitter Profile Photo

GenAI models “often match patterns instead of truly reasoning” Say it to yourself over and over til you full understand it. The amount of confirmation that is coming this year for my basic view is insane.

Gary Marcus (@garymarcus) 's Twitter Profile Photo

One minute Matt Turck is telling me that hallucinations are “a largely fixed problem”; the next minute ChatGPT 5 is telling a friend that Trump “is not in office”. 🤔

One minute <a href="/mattturck/">Matt Turck</a> is telling me that hallucinations are “a largely fixed problem”; the next minute ChatGPT 5 is telling a friend that Trump “is not in office”.

🤔
Victor (@victor_explore) 's Twitter Profile Photo

This comprehensive guide explains how Large Language Models work from scratch - assuming you only know how to add and multiply numbers. It covers everything from simple neural networks to the full Transformer architecture, stripping away all the jargon and representing

This comprehensive guide explains how Large Language Models work from scratch - assuming you only know how to add and multiply numbers.

It covers everything from simple neural networks to the full Transformer architecture, stripping away all the jargon and representing