Kyle Lo (@kylelostat) 's Twitter Profile
Kyle Lo

@kylelostat

#nlproc #hci research scientist @allen_ai, co-lead of data for OLMo w/ @soldni, he/him, find me on πŸ‘‰πŸ»kylelo.bsky.socialπŸ§‹

ID: 1080639531429183488

linkhttp://kyleclo.com calendar_today03-01-2019 01:38:36

603 Tweet

3,3K Followers

1,1K Following

Kyle Lo (@kylelostat) 's Twitter Profile Photo

starting now in Hall 3 poster 306! come chat w me Niklas Muennighoff & Luca Soldaini πŸš— ICML 2025 about our MoE w fully open data, weights, code, and more! complete w iOS app to run it completely on device πŸ˜† poster is so pretty 😭 and don’t miss oral session later too! #ICLR2025

starting now in Hall 3 poster 306!

come chat w me <a href="/Muennighoff/">Niklas Muennighoff</a> &amp;
<a href="/soldni/">Luca Soldaini πŸš— ICML 2025</a> about our MoE w fully open data, weights, code, and more! complete w iOS app to run it completely on device πŸ˜†

poster is so pretty 😭  and don’t miss oral session later too! #ICLR2025
Kyle Lo (@kylelostat) 's Twitter Profile Photo

outstanding paper award for our AI in Education work! 🐟 dataset of natural images of student solutions to K-12 math problems from online teaching platform 🐠 annotations (dense captions, VQA pairs) by teachers to eval VLMs chat w leads Sami Baral Lucy Li at #NAACL2025 🀩

Kyle Lo (@kylelostat) 's Twitter Profile Photo

we released OLMo 2 1B, showing again how well our OLMo 2 pretrain & post train recipe works! Our small 1B model is comparable or better than other top open weights-only alternatives while maintaining full open data, code & intermediate checkpoints!

Kyle Lo (@kylelostat) 's Twitter Profile Photo

excited to win πŸ† this award for our work on molmo & pixmo, showing the value of high-quality data curation for VLMs! recalling when we released same time as Llama 3.2 πŸ˜† huge kudos to Matt Deitke chris clark & Ani Kembhavi for their leadership on this project!

excited to win πŸ† this award for our work on molmo &amp; pixmo, showing the value of high-quality data curation for VLMs! 

recalling when we released same time as Llama 3.2 πŸ˜†

huge kudos to <a href="/mattdeitke/">Matt Deitke</a>  chris clark &amp; <a href="/anikembhavi/">Ani Kembhavi</a> for their leadership on this project!
Kyle Lo (@kylelostat) 's Twitter Profile Photo

will be at #icml2025, lemme kno if wanna chat about OLMo pretraining data curation, evaluation, data mixing, etc!πŸ‘‹ find us at poster sess on πŸ“…Wed 7/16 @ 11am⏲️ to learn about Web Organizer, distilling web data taxonomies into small models & using them for LM data mixing!

Alex Wettig (@_awettig) 's Twitter Profile Photo

Presenting two posters at ICML over the next two days: - Both at 11am - 1:30pm - Both about how to improve pre-training with domains - Both at stall # E-2600 in East Exhibition Hall A-B (!) Tomorrow: WebOrganizer w/ Luca Soldaini πŸš— ICML 2025 & Kyle Lo @ ICML2025 Thursday: MeCo by Tianyu Gao

Presenting two posters at ICML over the next two days:
- Both at 11am - 1:30pm
- Both about how to improve pre-training with domains
- Both at stall # E-2600 in East Exhibition Hall A-B (!)

Tomorrow: WebOrganizer w/ <a href="/soldni/">Luca Soldaini πŸš— ICML 2025</a> &amp; <a href="/kylelostat/">Kyle Lo @ ICML2025</a>
Thursday: MeCo by <a href="/gaotianyu1350/">Tianyu Gao</a>
Kyle Lo (@kylelostat) 's Twitter Profile Photo

presenting olmOCR at the poster session (2:15pm 211 West) for #codeml workshop at #icml2025! 🐟 fully open source OCR, comparable or better than frontier VLMs 🐠 all weights, data, code free & public 🐑 new benchmark of OCR "unit tests" on diverse PDFs & challenging OCR cases