Leshem Choshen @LREC 🤖🤗 (@LChoshen) Twitter Tweets • TwiCopy

3 weeks ago

LLMs for education have the power to improve us (cf. replace us et al.)
First, they need to adapt to be good teachers

thumb_up_off_alt3

repeat1

account_circle

Reading a paper on Arxiv\scholar?
Want to know (or give credit) to the authors?

Created a small script you can add to tapermonkey to search for the authors
greasyfork.org/es/scripts/494…

P.S. I have no javascript skills, and LMs have very little...

thumb_up_off_alt5

repeat0

account_circle

Alex Warstadt

@a_stadt

3 weeks ago

How would you test your BabyLM?

We have a new and improved eval pipeline for round 2 of the BabyLM competition, but we're NOT done adding to it.

If you have an idea for a new eval, reach out, open a pull request, or even submit a writeup to our new 'paper track'!

thumb_up_off_alt23

repeat3

account_circle

Tom Kocmi

@KocmiTom

2 months ago

Some metrics are completely useless to evaluate unrelated systems (like LLM vs. NMT). For example, +2 BLEU gain for unrelated systems is about as good as a coin toss (~55%). While the same gain for related systems (e.g. baseline vs. improved model) is about 90% accurate as humans

thumb_up_off_alt50

repeat8

account_circle

Leshem Choshen @LREC 🤖🤗

3 weeks ago

Some people asked me about babyLM evaluation:

thumb_up_off_alt2

repeat0

account_circle

Leshem Choshen @LREC 🤖🤗

4 weeks ago

NeurIPS Conference Anna Rogers 🇺🇦🇪🇺 is looking for postdocs! but also I don't want to compete

thumb_up_off_alt1

repeat0

account_circle

Leshem Choshen @LREC 🤖🤗

4 weeks ago

Are we seeing the second wave?
Linguistic and cultural diversity of LLMs?

Technologies in NLP are always years ahead in English, and then slowly catching up elsewhere recent events might hint we start a second phase?

From today:
x.com/LChoshen/statu…
x.com/LChoshen/statu…

thumb_up_off_alt4

repeat1

account_circle

Leshem Choshen @LREC 🤖🤗

4 weeks ago

GPT3.5 utterly fails on ~300 Polish medical exams,
But GPT4 passes 75%!
arxiv.org/abs/2405.01589

I wonder, how culturally different Polish exams are from US ones
Sadly, a deep read will tell you the exams are in English...

thumb_up_off_alt13

repeat1