Hussein Mozannar(@HsseinMzannar) 's Twitter Profileg
Hussein Mozannar

@HsseinMzannar

PhD @mitidss working on Human-AI Interaction 🇱🇧

ID:451752920

linkhttps://husseinmozannar.github.io/ calendar_today31-12-2011 23:51:47

224 Tweets

962 Followers

986 Following

Jacy Reese Anthis(@jacyanthis) 's Twitter Profile Photo

Hiroshi Ishii 石井 裕 Elizabeth Churchill R. Lisa Huang John Chen Litao Yan Standing room only for Hussein Mozannar et al. Microsoft Research with another Copilot paper! They build a 'CUPS' framework for labeling how AI-assisted coding time is spent. There is clearly a ton of room for making coding even more productive with only current models!

account_circle
Jacy Reese Anthis(@jacyanthis) 's Twitter Profile Photo

ece8bb Michael Bernstein Stephanie Bell Su Lin Blodgett Do you wish we had evals for LLM performance with real humans instead of just superficial, ephemeral benchmarks? Hussein Mozannar presents RealHumanEval, a platform with a LeetCode-style IDE to evaluate in vivo. They show benchmark %s can be way off from real performance.

@d19fe @msbernst @the_sbell @sulin_blodgett Do you wish we had evals for LLM performance with real humans instead of just superficial, ephemeral benchmarks? @HsseinMzannar presents RealHumanEval, a platform with a LeetCode-style IDE to evaluate in vivo. They show benchmark %s can be way off from real performance. #CHI2024
account_circle
Hussein Mozannar(@HsseinMzannar) 's Twitter Profile Photo

Presenting at :
1) RealHumanEval - our platform for human evaluation of LLMs for code, at TREW workshop arxiv.org/abs/2404.02806
2) Reading Between The Lines: Modeling AI-Assisted Programming (Honorable Mention Award) - Tuesday 2:45pm in 324 arxiv.org/abs/2210.14306

account_circle
Saleema Amershi(@SaleemaAmershi) 's Twitter Profile Photo

Feeling some FOMO that I won't be attending this year. For those who are, be sure to say👋 to the fabulous Gagan Bansal , Adam Fourney (hci.social/@adam), and the newest member of the at AI Frontiers, Hussein Mozannar! Don't forget to ask them about opportunities on our team👇

account_circle
Elena Glassman(@roboticwrestler) 's Twitter Profile Photo

Terrrrrific work, Dr. Hussein Mozannar! Hussein Mozannar It was an honor to serve on your committee, and I can't wait to see what you do next!

account_circle
Hussein Mozannar(@HsseinMzannar) 's Twitter Profile Photo

I will be defending my PhD thesis, Training Human-AI Teams, on April 25th at MIT and on Zoom! Please DM me or email for a link! -- (image generated after an hour of prompting DALL·E)

I will be defending my PhD thesis, Training Human-AI Teams, on April 25th at MIT and on Zoom! Please DM me or email for a link! -- (image generated after an hour of prompting DALL·E)
account_circle
Rami Awar(@iamramiawar) 's Twitter Profile Photo

Sneak peak!

Just recorded a demo
youtu.be/3sKIoVp8QRw

Some alpha users found a ton of bugs, so beta release is a bit delayed till next week at least.

You can still sign up @ dataline.app!

It's already open source but not ready for full public release yet 😁

account_circle
Arvind Satyanarayan(@arvindsatya1) 's Twitter Profile Photo

Elena Glassman Hussein Mozannar Minsuk Chang This is such a great thread!! Two classic readings to understand what “theory” means for HCI are: link.springer.com/book/10.1007/9… and dl.acm.org/doi/10.1145/34…

account_circle
Zanë (zbucinca@hci.social)(@ZanaBucinca) 's Twitter Profile Photo

Beyond the quality of our decisions, how will AI assistance affect us -- our growth and improvement, enjoyment, collaboration, or our agency in the workplace? The current design of AI assistance does not consider human-centric objectives; we need methods to account for them.

Beyond the quality of our decisions, how will AI assistance affect us -- our growth and improvement, enjoyment, collaboration, or our agency in the workplace? The current design of AI assistance does not consider human-centric objectives; we need methods to account for them.
account_circle
Hussein Mozannar(@HsseinMzannar) 's Twitter Profile Photo

Our paper on adapting Copilot from human feedback is at !
One of the secret sauces to making GitHub Copilot work is knowing when to show suggestions, our paper presents a motivated approach (CDHF) to achieve this using feedback data
arxiv.org/pdf/2306.04930…

account_circle
Hussein Mozannar(@HsseinMzannar) 's Twitter Profile Photo

We show in our work that LLM augmentations can improve clinical note readability! I was mostly involved with quantitative evaluation, but there are fascinating insights about LLM errors and qualitative insights in interviews with breast cancer survivors arxiv.org/abs/2401.09637

We show in our work that LLM augmentations can improve clinical note readability! I was mostly involved with quantitative evaluation, but there are fascinating insights about LLM errors and qualitative insights in interviews with breast cancer survivors arxiv.org/abs/2401.09637
account_circle
Matt Groh(@mattgroh) 's Twitter Profile Photo

👩‍⚕️💻 Our physician-machine partnerships paper is out in Nature Medicine today!

Experiment w/ 1000+ dermatologists & PCPs shows how AI assistance can boost diagnostic accuracy but reveals limits to addressing physician bias w/ fair classifiers

nature.com/articles/s4159…

Thread 👇

account_circle
Massachusetts Institute of Technology (MIT)(@MIT) 's Twitter Profile Photo

A new onboarding technique can help workers collaborate more effectively with AI assistants. The process finds situations where the human trusts the AI either too much or too little, then develops rules about them and creates training exercises. mitsha.re/cKP350Qqw2j

A new onboarding technique can help workers collaborate more effectively with AI assistants. The process finds situations where the human trusts the AI either too much or too little, then develops rules about them and creates training exercises. mitsha.re/cKP350Qqw2j
account_circle