Martin Pawelczyk (on Job Market) (@martinpawelczyk) 's Twitter Profile
Martin Pawelczyk (on Job Market)

@martinpawelczyk

Postdoc @Harvard. #AISafety. PhD from @uni_tue. MScs Stats @LSE @uni_edinburgh. Previously intern @JP_Morgan AI Research.

ID: 1083822529578520577

linkhttps://sites.google.com/view/martinpawelczyk/ calendar_today11-01-2019 20:26:42

251 Tweet

341 Takipรงi

415 Takip Edilen

Gjergji Kasneci (@gjergji_) 's Twitter Profile Photo

๐Ÿšจ Exciting #PhD opportunities in #DataScience and #ResponsibleAI TU Mรผnchen Munich Data Science Institute ๐ŸŽ“ Passionate about research in responsible AI and Data Science? Join our talented team for groundbreaking research and innovation #GenerativeAI #LLMs #AIRegulation portal.mytum.de/jobs/wissenschโ€ฆ

Michal Moshkovitz (@ml_theorist) 's Twitter Profile Photo

***New online XAI seminar*** Interested in the theory of Interpretable and Explainable AI? Want to connect with others who share your interests? Join us for a new seminar! First meeting: Thursday April 4 Website: tverven.github.io/tiai-seminar/ Organized together with Suraj Srinivas Tim van Erven

***New online XAI seminar***
Interested in the theory of Interpretable and Explainable AI? Want to connect with others who share your interests? Join us for a new seminar! 

First meeting: Thursday April 4

Website: tverven.github.io/tiai-seminar/
Organized together with <a href="/Suuraj/">Suraj Srinivas</a> <a href="/tverven/">Tim van Erven</a>
Sebastian Bordt (@sbordt) 's Twitter Profile Photo

Should we trust LLM evaluations on publicly available benchmarks?๐Ÿค” Our latest work studies the overfitting of few-shot learning with GPT-4. with Harsha Nori Vanessa Rodrigues Besmira Nushi ๐Ÿ’™๐Ÿ’› and Rich Caruana Paper: arxiv.org/abs/2404.06209 More details๐Ÿ‘‡ [1/N]

Should we trust LLM evaluations on publicly available benchmarks?๐Ÿค”

Our latest work studies the overfitting of few-shot learning with GPT-4.

with <a href="/HarshaNori/">Harsha Nori</a> Vanessa Rodrigues <a href="/besanushi/">Besmira Nushi ๐Ÿ’™๐Ÿ’›</a> and Rich Caruana 

Paper: arxiv.org/abs/2404.06209

More details๐Ÿ‘‡ [1/N]
๐™ท๐š’๐š–๐šŠ ๐™ป๐šŠ๐š”๐š”๐šŠ๐š›๐šŠ๐š“๐šž (@hima_lakkaraju) 's Twitter Profile Photo

๐Ÿ“ข Excited to share our latest pre-print on evaluating & enhancing the safety of medical LLMs We introduce med-safety-benchmark to assess the #safety of #medical #LLMs and find that state-of-the-art models violate principles of medical safety and ethics. arxiv.org/pdf/2403.03744

๐Ÿ“ข Excited to share our latest pre-print on evaluating &amp; enhancing the safety of medical LLMs 

We introduce med-safety-benchmark to assess the #safety of #medical #LLMs and find that state-of-the-art models violate principles of medical safety and ethics. arxiv.org/pdf/2403.03744
fly51fly (@fly51fly) 's Twitter Profile Photo

[LG] Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers arxiv.org/abs/2405.13536 - Common transformer architectures like BERT and GPT-2 structurally cannot represent additive models like linear models or generalized additive

[LG] Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers  
arxiv.org/abs/2405.13536      
- Common transformer architectures like BERT and GPT-2 structurally cannot represent additive models like linear models or generalized additive
Robert Lange (@roberttlange) 's Twitter Profile Photo

๐Ÿง‘โ€๐ŸŽจ Are Large Language Models Good Thieves? ๐Ÿฆน Breakthroughs are not 0-to-1 processes. E.g. Picasso and the โ€œinventionโ€ of cubism. Cubism was not a flash of thoughtโšก๏ธ but developed gradually & used various 'inspiration' thefts (Juan Gris' work and the surrealist movement. To me,

๐Ÿง‘โ€๐ŸŽจ Are Large Language Models Good Thieves? ๐Ÿฆน

Breakthroughs are not 0-to-1 processes. E.g. Picasso and the โ€œinventionโ€ of cubism. Cubism was not a flash of thoughtโšก๏ธ but developed gradually &amp; used various 'inspiration' thefts (Juan Gris' work and the surrealist movement.

To me,
Gautam Kamath (@thegautamkamath) 's Twitter Profile Photo

๐ŸงตNew paper: Machine Unlearning Fails to Remove Data Poisoning Attacks, ft Martin Pawelczyk (on Job Market), Jimmy Di, Ayush Sekhari (on Job Market), Seth Neel. Title says it all: current approaches for machine unlearning (MUL) are not effective at removing the effect of data poisoning attacks. 1/n

๐ŸงตNew paper: Machine Unlearning Fails to Remove Data Poisoning Attacks, ft <a href="/MartinPawelczyk/">Martin Pawelczyk (on Job Market)</a>, <a href="/jimmy_di98/">Jimmy Di</a>, <a href="/ayush_sekhari/">Ayush Sekhari (on Job Market)</a>, <a href="/SethInternet/">Seth Neel</a>. 

Title says it all: current approaches for machine unlearning (MUL) are not effective at removing the effect of data poisoning attacks. 1/n
Martin Pawelczyk (on Job Market) (@martinpawelczyk) 's Twitter Profile Photo

Can current unlearning methods remove poisoned training data from a trained model? Our new paper shows that unlearning methods are not quite ready, yet. Looking forward to a lot of interesting work ahead in this area.

Martin Pawelczyk (on Job Market) (@martinpawelczyk) 's Twitter Profile Photo

Happy to share that our work on machine unlearning received a spotlight talk The GenLaw Center ICML workshop (1:30-2 pm) at Lehar 2. I will also chat about unlearning during the poster session (2-3 pm).

@fraboeni (@fraboeni) 's Twitter Profile Photo

Wanna do a #PhD in #trustworthy #ML? Our group (sprintml.com) has 2 full-time positions open. If you are interested, please reach out. Please also share with people who might be interested.

Seong Joon Oh (@coallaoh) 's Twitter Profile Photo

AI subfields are evolving quickly over time. Some accelerating, while some are slowing down. #ResearchTrendAI More at researchtrend.ai

AI subfields are evolving quickly over time. Some accelerating, while some are slowing down. #ResearchTrendAI

More at researchtrend.ai
Isabel Valera (@ivaleram) 's Twitter Profile Photo

I will be hiring PhD students via ELLIS on multi-objective ML, causal generative models, and fair and interpretable ML. If interested, apply to our probabilistic ML: machinelearning.uni-saarland.de

Micah Goldblum (@micahgoldblum) 's Twitter Profile Photo

๐Ÿ“ขIโ€™ll be admitting multiple PhD students this winter to Columbia University ๐Ÿ™๏ธ in the most exciting city in the world! If you are interested in dissecting modern deep learning systems to probe how they work, advancing AI safety, or automating data science, apply to my group.

๐Ÿ“ขIโ€™ll be admitting multiple PhD students this winter to Columbia University ๐Ÿ™๏ธ in the most exciting city in the world!  If you are interested in dissecting modern deep learning systems to probe how they work, advancing AI safety, or automating data science, apply to my group.
Chirag Agarwal (@_cagarwal) 's Twitter Profile Photo

On this beautiful occasion of Diwali ๐Ÿช”, I am happy to announce that our lab will recruit 1-2 PhD students in UVA School of Data Science for Fall 2025! Our group works on developing Scalable XAI and Trustworthy Algorithms for AI Alignment and Safety. Visit chirag-agarwall.github.io for

On this beautiful occasion of Diwali ๐Ÿช”, I am happy to announce that our lab will recruit 1-2 PhD students in <a href="/uvadatascience/">UVA School of Data Science</a> for Fall 2025!   

Our group works on developing Scalable XAI and Trustworthy Algorithms for AI Alignment and Safety.

Visit chirag-agarwall.github.io for
Micah Goldblum (@micahgoldblum) 's Twitter Profile Photo

๐Ÿšจ๐Ÿ“ข Excited to announce the ICLR 2025 Workshop on Building Trust in LLMs and LLM Applications! ๐Ÿ“ข๐Ÿšจ Submit all your papers, and weโ€™ll see you in Singapore! There will be paper awards, and we have a stacked lineup of speakers and panelists.

Martin Pawelczyk (on Job Market) (@martinpawelczyk) 's Twitter Profile Photo

๐Ÿ“ข๐Ÿ“ข Happy to share that our paper on unlearning evaluations has been accepted to ICLR 2025 ๐Ÿ‡ธ๐Ÿ‡ฌ ๐Ÿ“œ: arxiv.org/abs/2406.17216 Thanks to my great co-authors Seth Neel Gautam Kamath Jimmy Di Ayush Sekhari @yiweilu

Chirag Agarwal (@_cagarwal) 's Twitter Profile Photo

Exciting opportunity at the intersection of climate science and XAI to work on groundbreaking research in attributing extreme precipitation events with multimodal models. Check out the details and help spread the word! #ClimateAI #Postdoc #UVA #Hiring Job description:

Bang An (@bang_an_) 's Twitter Profile Photo

๐Ÿšจ One Day Left! ๐Ÿšจ Submit your work to the ICLR 2025 Workshop on Building Trust in LLMs and LLM Applications. We accept both Full Papers and Tiny Papers! building-trust-in-llms.github.io/iclr-workshop/โ€ฆ

๐Ÿšจ One Day Left! ๐Ÿšจ
Submit your work to the ICLR 2025 Workshop on Building Trust in LLMs and LLM Applications. We accept both Full Papers and Tiny Papers!
building-trust-in-llms.github.io/iclr-workshop/โ€ฆ
Martin Pawelczyk (on Job Market) (@martinpawelczyk) 's Twitter Profile Photo

๐Ÿ“ข Last Chance! ๐Ÿ“ข Submit your work to the ICLR 2025 workshop on Building Trust in LLMs & LLM Applications. โฐ Deadline: 10 February AOE ๐Ÿ“œ Call for papers: building-trust-in-llms.github.io/iclr-workshop/โ€ฆ