Ehud Reiter (@ehudreiter) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

It is a couple years into LLM-world, and I continue to see lots of bias and ethics papers that study jailbreaks. I have yet to see *one* paper with 1) engagement with real world harms and 2) even a description of what a jailbreak is and is not. This is not science guys #ACL2024

thumb_up_off_alt35

chat_bubble_outline2

repeat2

shareShare

Ehud Reiter

@ehudreiter

6 months ago

Nice article on deficiencies in MMLU-like benchmarks in mass-market The Economist . Points out they are out of date, sloppily built, often wrong, leaked into LLM training data, unstable, and gamed by vendors. Wonder if business/govt types will stop treating these as gospel...

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Ehud Reiter

@ehudreiter

5 months ago

Chatted to stranger on train. I said LLMs impressive and exciting, he thought I meant AGI and extinction risk. Said I worked on AI in healthcare, he assumed this meant medical diagnosis. Bit frustrating, but I guess reflects the way media talks about AI to general public.

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Dr. Jochen L. Leidner (AI professor/advisor)

@jochenleidner

5 months ago

An example where AI literally harmed people: an AI generated mushroom identification book!

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Ehud Reiter

@ehudreiter

5 months ago

New blog: The latest/trendiest tech isnt always appropriate ... I saw this very strongly in the late 2010s with LSTMs (which do not work well for data-to-text), and continue to see this in 2024 (GPT4 is not always the best approach)... ehudreiter.com/2024/08/26/the…

thumb_up_off_alt16

chat_bubble_outline0

repeat3

shareShare

Ehud Reiter

@ehudreiter

5 months ago

Sometimes the latest technology is *not* appropriate for an NLG task. I saw this very strongly in the late 2010s with LSTMs (which do not work well for data-to-text), and continue to see this in 2024 (GPT4 is not always the best approach). Researchers/devs need to be open-minded

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

Ehud Reiter

@ehudreiter

5 months ago

Honoured and excited to be giving a keynote on evaluation at NLPCC in China. Also look foward to meeting people in the Chinese NLP community, who are doing great work tcci.ccf.org.cn/conference/202…

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Wei Zhao

@andyweizhao

5 months ago

We are organizing a shared task on (dis)agreements among annotators in lexical semantics. The task provides human judgements over seven languages - a useful resource for some of you looking at the fundamentals of disagreements such as their complexity and underlying causes.

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

𝞍 Shin Megami Boson 𝞍

@shinboson

5 months ago

A story about fraud in the AI research community: On September 5th, Matt Shumer, CEO of OthersideAI, announces to the world that they've made a breakthrough, allowing them to train a mid-size model to top-tier levels of performance. This is huge. If it's real. It isn't.

thumb_up_off_alt4,4K

chat_bubble_outline74

repeat398

shareShare

Ehud Reiter

@ehudreiter

5 months ago

New blog: One-day class on NLG evaluation ehudreiter.com/2024/09/09/one… In early Sept I ran a one-day class on evaluation. I summarise what I did in this class and give links to my presentations, in case this is useful to other people.

thumb_up_off_alt27

chat_bubble_outline0

repeat4

shareShare

Simone Balloccu

@simoneballoccu

5 months ago

Call for participation: if you plan to attend INLG 2024 in Tokyo, join us on Sept 23 for the 2nd Workshop on Practical LLM-assisted Data-to-Text Generation! Website: practicald2t.github.io (1/4)

Call for participation: if you plan to attend <a href="/inlgmeeting/">INLG 2024</a> in Tokyo, join us on Sept 23 for the 2nd Workshop on Practical LLM-assisted Data-to-Text Generation!

Website: practicald2t.github.io
(1/4)

thumb_up_off_alt3

chat_bubble_outline1

repeat2

shareShare

Ehud Reiter

@ehudreiter

5 months ago

On holiday in Shetland with Ann. I don't usually post holiday pictures, but this is 50m from our self catering flat, and is the kind of scenery i love

thumb_up_off_alt12

chat_bubble_outline1

repeat0

shareShare

Ehud Reiter

@ehudreiter

5 months ago

I worry that: (A) At a superficial level, LLMs can do amazing human-like things (B) Many NLP "evaluations" of LLMs are meaningless, and community doesnt seem to care Therefore (C) Extravagent claims are made for LLMs based on garbage evals, and taken at face value

thumb_up_off_alt28

chat_bubble_outline2

repeat5

shareShare

Ehud Reiter

@ehudreiter

5 months ago

Congratulations to Allmin Pradhap Singh Susaiyah for passing his PhD defence! I helped to supervise Allmin (who is at Eindhoven), it was a pleasure to work with him.

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Ehud Reiter

@ehudreiter

5 months ago

Really interesting to see this analysis of how people actually use chatgpt for news

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Daniel Feldman

@d_feldman

4 months ago

The widely-used wordfreq database of English word frequencies will no longer be updated.

thumb_up_off_alt9,9K

chat_bubble_outline65

repeat1,1K

shareShare

Saad Mahamood

@saad_m

4 months ago

The proceedings for the 17th International Natural Language Generation Conference has been published by siggen_acl and is now available: aclanthology.org/events/inlg-20… I am looking forward to seeing everyone in person next week in Tokyo! #nlg #nlp #inlg2024

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Ehud Reiter

@ehudreiter

4 months ago

LinkedIn is using content to train AI models, but not in Europe! (linkedin.com/help/linkedin/… , click on Can I Opt Out). Shows benefits of strong data protection laws.

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Ehud Reiter

@ehudreiter

4 months ago

New blog: How AI can help reform UK NHS ehudreiter.com/2024/09/23/how… The UK government wants to reform the UK health system by digitisation, shifting care to communities, and focusing on prevention. I think there is a lot of potential for AI to help with this...

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Ehud Reiter

@ehudreiter

4 months ago

UK govt wants to reform the National Health Service to shift care to GPs and community care, and to focus on prevention of illness. AI can help with this if AI/Med researchers focus on these topics, instead of being fixated on improving diagnoses in hospitals.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ehud Reiter

Gate.io

Seraphina Goldfarb-Tarrant

Ehud Reiter

Ehud Reiter

Dr. Jochen L. Leidner (AI professor/advisor)

Ehud Reiter

Ehud Reiter

Ehud Reiter

Wei Zhao

𝞍 Shin Megami Boson 𝞍

Ehud Reiter

Simone Balloccu

Ehud Reiter

Ehud Reiter

Ehud Reiter

Ehud Reiter

Daniel Feldman

Saad Mahamood

Ehud Reiter

Ehud Reiter

Ehud Reiter