Nicolas Chapados (@nicolaschapados) 's Twitter Profile
Nicolas Chapados

@nicolaschapados

Passionate about AI and its impact on society — VP, Research, ServiceNow; Co-Founder, Imagia & Element AI

ID: 498502964

calendar_today21-02-2012 03:51:46

944 Tweet

3,3K Followers

610 Following

Tianyu Zhang (@tianyu_zh) 's Twitter Profile Photo

[1/n] We are happy to announce our new VLM task: Visual Caption Restoration along with datasets: arxiv.org/abs/2406.06462, tiny.cc/m06lyz Try yourself before diving in😀 Authors: T. Zhang, S. Wang, L. Li, G. Zhang, P. Taslakian, S. Rajeswar, J. Fu, B. Liu, Y. Bengio

[1/n] We are happy to announce our new VLM task: Visual Caption Restoration along with datasets: arxiv.org/abs/2406.06462, tiny.cc/m06lyz
Try yourself before diving in😀
Authors: 
T. Zhang, S. Wang, L. Li, G. Zhang, P. Taslakian, S. Rajeswar, J. Fu, B. Liu, Y. Bengio
Alexandre Lacoste (@alex_lacoste_) 's Twitter Profile Photo

We’re really excited to release this large collaborative work for unifying web agent benchmarks under the same roof. In this TMLR paper, we dive in-depth into #BrowserGym and #AgentLab. We also present some unexpected performances from Claude 3.5-Sonnet

We’re really excited to release this large collaborative work for unifying web agent benchmarks under the same roof.

In this TMLR paper, we dive in-depth into #BrowserGym and #AgentLab. We also present some unexpected performances from Claude 3.5-Sonnet
Nicolas Chapados (@nicolaschapados) 's Twitter Profile Photo

Thrilled to be speaking at the first Workshop for Research on Agent Language Models, at ACL 2025 this summer! Congrats to the organizers for putting together a strong program on a timely topic. Consider submitting your work (March 1st deadline).

Gaurav Sahu (@dem_fier) 's Twitter Profile Photo

A little to the party, but really happy to share that our work (arxiv.org/abs/2407.07341) from ServiceNow Research got accepted to #NAACL2025 (Findings), where we propose two sample-efficient methods for effective short and long document summarization! NAACL HLT 2025 1/3

Léo Boisvert (@leoboisvert) 's Twitter Profile Photo

📊 Fresh WorkArena benchmark results just dropped! Plot twist: o1-mini (51.8%) > o3-mini (48.2%) Either o1-mini had its coffee this morning ☕️ or we've stumbled upon something interesting 🧐 Replication studies welcome!

📊 Fresh WorkArena benchmark results just dropped!
Plot twist: o1-mini (51.8%) > o3-mini (48.2%)
Either o1-mini had its coffee this morning ☕️ or we've stumbled upon something interesting 🧐
Replication studies welcome!
Krishnamurthy (Dj) Dvijotham (@djdvij) 's Twitter Profile Photo

You drop a model, we drop our eval, boom! Plot twist on o1 vs o3 on our challenging workarena++ benchmark of enterprise knowledge worker tasks

Ravid Shwartz Ziv (@ziv_ravid) 's Twitter Profile Photo

🧵 I forgot to update, but our paper "SEQ-VCR: Preventing Collapse in Intermediate Transformer Representations" has been accepted to ICLR! Let me tell you why this is cool paper... Rifat Arefin, Gopeshh Subbaraj, Nicolas Gontier, Yann LeCun, Irina Rish Chris Pal

🧵 I forgot to update, but our paper "SEQ-VCR: Preventing Collapse in Intermediate Transformer Representations" has been accepted to ICLR! Let me tell you why this is cool paper...
Rifat Arefin, Gopeshh Subbaraj, <a href="/nicogontier/">Nicolas Gontier</a>, <a href="/ylecun/">Yann LeCun</a>, <a href="/irinarish/">Irina Rish</a> <a href="/chrisjpal/">Chris Pal</a>
Ahmed Masry (@ahmed_masry97) 's Twitter Profile Photo

Happy to announce AlignVLM📏: a novel approach to bridging vision and language latent spaces for multimodal understanding in VLMs! 🌍📄🖼️ 🔗 Read the paper: arxiv.org/abs/2502.01341 🧵👇 Thread

Happy to announce AlignVLM📏: a novel approach to bridging vision and language latent spaces for multimodal understanding in VLMs! 🌍📄🖼️

🔗 Read the paper: arxiv.org/abs/2502.01341
🧵👇 Thread
Dawn Song (@dawnsongtweets) 's Twitter Profile Photo

Really excited to announce our Advanced LLM Agents MOOC (Spring 2025)! Building on the success of our LLM Agents MOOC from Fall 2024 (15K+ registered learners, ~9K Discord members, 200K+ lecture views on YouTube), we are excited to extend the MOOC this semester to cover some more

Really excited to announce our Advanced LLM Agents MOOC (Spring 2025)!
Building on the success of our LLM Agents MOOC from Fall 2024 (15K+ registered learners, ~9K Discord members, 200K+ lecture views on YouTube), we are excited to extend the MOOC this semester to cover some more
Dawn Song (@dawnsongtweets) 's Twitter Profile Photo

🚀 Really excited to launch #AgentX competition hosted by UC Berkeley RDI UC Berkeley alongside our LLM Agents MOOC series (a global community of 22k+ learners & growing fast). Whether you're building the next disruptive AI startup or pushing the research frontier, AgentX is your

🚀 Really excited to launch #AgentX competition hosted by <a href="/BerkeleyRDI/">UC Berkeley RDI</a> <a href="/UCBerkeley/">UC Berkeley</a> alongside our LLM Agents MOOC series (a global community of 22k+ learners &amp; growing fast). Whether you're building the next disruptive AI startup or pushing the research frontier, AgentX is your
METR (@metr_evals) 's Twitter Profile Photo

When will AI systems be able to carry out long projects independently? In new research, we find a kind of “Moore’s Law for AI agents”: the length of tasks that AIs can do is doubling about every 7 months.

When will AI systems be able to carry out long projects independently?

In new research, we find a kind of “Moore’s Law for AI agents”: the length of tasks that AIs can do is doubling about every 7 months.
P Shravan Nayak (@pshravannayak) 's Twitter Profile Photo

🚀 Super excited to announce UI-Vision: the largest and most diverse desktop GUI benchmark for evaluating agents in real-world desktop GUIs in offline settings. 📄 Paper: arxiv.org/abs/2503.15661 🌐 Website: uivision.github.io 🧵 Key takeaways 👇

P Shravan Nayak (@pshravannayak) 's Twitter Profile Photo

🚀 Excited to share that UI-Vision has been accepted at ICML 2025! 🎉 We have also released the UI-Vision grounding datasets. Test your agents on it now! 🚀 🤗 Dataset: huggingface.co/datasets/Servi… #ICML2025 #AI #DatasetRelease #Agents

Juan A. Rodríguez 💫 (@joanrod_ai) 's Twitter Profile Photo

Thanks AK for sharing our work! Excited to present our next generation of SVG models, now using Reinforcement Learning from Rendering Feedback (RLRF). 🧠 We think we cracked SVG generalization with this one. Go read the paper! arxiv.org/abs/2505.20793 More details on

Thanks <a href="/_akhaliq/">AK</a> for sharing our work! Excited to present our next generation of SVG models, now using Reinforcement Learning from Rendering Feedback (RLRF). 

🧠 We think we cracked SVG generalization with this one.

Go read the paper! arxiv.org/abs/2505.20793

More details on