Kyle Lo @ ICLR 2024 (@kylelostat) Twitter Tweets • TwiCopy

Kyle Lo @ ICLR 2024

@kylelostat

+ Follow

#nlproc #hci Leading Data Research for OLMo @allen_ai, he/him, https://t.co/5Hm9cx3mC1

ID:1080639531429183488

linkhttp://kyleclo.com calendar_today03-01-2019 01:38:36

411 Tweets

2,1K Followers

1,1K Following

Shayne Longpre

3 weeks ago

🌟Several dataset releases deserve a mention for their incredible data measurement work 🌟

➡️ The Pile (arxiv.org/abs/2101.00027) Leo Gao Stella Biderman

➡️ ROOTS (arxiv.org/abs/2303.03915) Hugo Laurençon++

➡️ Dolma (arxiv.org/abs/2402.00159) Luca Soldaini 🎀 Kyle Lo

14/

thumb_up_off_alt16

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

1 month ago

this is the one🍪

thumb_up_off_alt12

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

1 month ago

follow-up to our work on BooookScore:

🐋prev, we evaluated summary coherency,
🦉now, we're evaluating faithfulness, omissions, etc which is hard cuz it requires localizing summary generations within original source (>100k tokens)

come chat w us at ICLR 2024 🐙

thumb_up_off_alt26

chat_bubble_outline0

account_circle

Luca Soldaini 🎀

1 month ago

PS: if you are also attending GenLaw and are looking for opportunities to research at the intersection of AI, Law, and Policy, let's chat 😊

thumb_up_off_alt31

chat_bubble_outline0

account_circle

Cody Blakeney

1 month ago

It’s finally here 🎉🥳

In case you missed us, MosaicML/ Databricks is back at it, with a new best in class open weight LLM named DBRX. An MoE with 132B total parameters and 32B active 32k context length and trained for 12T tokens 🤯

It’s finally here 🎉🥳 In case you missed us, MosaicML/ Databricks is back at it, with a new best in class open weight LLM named DBRX. An MoE with 132B total parameters and 32B active 32k context length and trained for 12T tokens 🤯

thumb_up_off_alt835

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

1 month ago

truly cursed timeline 😵‍💫

ACLRollingReview reviews due Mar 20
Conference on Language Modeling abstract deadline Mar 22
ACLRollingReview reviews released Mar 26
Conference on Language Modeling submission deadline Mar 29
ACLRollingReview rebuttal period closes Mar 30

thumb_up_off_alt71

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

1 month ago

one of my favorite aspects of this project is that it shows careful reuse of high quality 'older' datasets is still effective today🦉

we may think 'instructions' are relatively recent trend in NLP but some of the datasets we repurpose date back to 2004! 🎂

thumb_up_off_alt34

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

1 month ago

LMs can generate plain language summaries. For some audiences, automated simplification of complex text can improve the reading experience.
But what of users with more subject matter expertise? Our #CHI2024 paper studies benefits & pitfalls of LMs for simplifying science texts.

thumb_up_off_alt33

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

1 month ago

can LMs help us write expository answers to scientific research questions?

excited to share our work led by Fangyuan Xu. we recruited NLP folks to work with an LM to answer research questions and logged successes/failures in sustained interaction traces🦉

thumb_up_off_alt35

chat_bubble_outline0

account_circle

Shayne Longpre

2 months ago

New Resource: Foundation Model Development Cheatsheet for best practices

We compiled 250+ resources & tools for:
🔭 sourcing data
🔍 documenting & audits
🌴 environmental impact
☢️ risks & harms eval
🌍 release & monitoring

With experts from EleutherAI, Allen Institute for AI,…

thumb_up_off_alt633

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

2 months ago

for those looking for instruction-tuned OLMo👇

thumb_up_off_alt25

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

2 months ago

DM me if you're interested in:
🐋creating high-quality pretraining datasets
🐊studying data's impact on LM capabilities
🦉tools for sensemaking over large corpora
🐡adapting LMs to specialized domains like science
🐈evaluation through human interaction

thumb_up_off_alt146

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

2 months ago

ngl anonymous.4open.science is probably the single most useful research tool i've adopted in recent memory

thumb_up_off_alt13

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

2 months ago

this work is fascinating 🤯
1. they successfully inflate citation counts by creating chatGPT generated articles. google scholar parses the fake article's references & automatically increments citation counts
2. they successfully pay services to literally 'buy' citations

thumb_up_off_alt25

chat_bubble_outline0

account_circle

Mechanical Dirk

@mechanicaldirk

2 months ago

We just uploaded detailed Weights & Biases training logs for the OLMo 7B run: wandb.ai/ai2-llm/OLMo-7…

This is a cleaned-up version from the actual run, so the wall clock times don't make sense, but all the other information is there!

thumb_up_off_alt58

chat_bubble_outline0

account_circle

Kyle Lo @ ICLR 2024

3 months ago

excited to share our contribution to open science of language models!

🐈‍⬛ all our data, weights, ckpts, code, etc
🐈 covers data curation, pretraining, adaptation, evaluation, etc

check out more deets in Luca Soldaini 🎀 ‘s thread, technical reports out on arXiv shortly 😆

thumb_up_off_alt72

chat_bubble_outline0

account_circle

fpc ok :)