NotJustKama (@karrsen0713) 's Twitter Profile
NotJustKama

@karrsen0713

NLP & Vision-language @ Brown University

ID: 1885301189937664000

calendar_today31-01-2025 12:17:00

9 Tweet

4 Followers

54 Following

Zhaochen Su (@suzhaochen0110) 's Twitter Profile Photo

Excited to share our new survey on the reasoning paradigm shift from "Think with Text" to "Think with Image"! ๐Ÿง ๐Ÿ–ผ๏ธ Our work offers a roadmap for more powerful & aligned AI. ๐Ÿš€ ๐Ÿ“œ Paper: arxiv.org/pdf/2506.23918 โญ GitHub (400+๐ŸŒŸ): github.com/zhaochen0110/Aโ€ฆ

Excited to share our new survey on the reasoning paradigm shift from "Think with Text" to "Think with Image"! ๐Ÿง ๐Ÿ–ผ๏ธ
Our work offers a roadmap for more powerful & aligned AI. ๐Ÿš€
๐Ÿ“œ Paper: arxiv.org/pdf/2506.23918
โญ GitHub (400+๐ŸŒŸ): github.com/zhaochen0110/Aโ€ฆ
May Fung (@may_f1_) 's Twitter Profile Photo

๐Ÿง  How can AI evolve from statically ๐˜ต๐˜ฉ๐˜ช๐˜ฏ๐˜ฌ๐˜ช๐˜ฏ๐˜จ ๐˜ข๐˜ฃ๐˜ฐ๐˜ถ๐˜ต ๐˜ช๐˜ฎ๐˜ข๐˜จ๐˜ฆ๐˜ด โ†’ dynamically ๐˜ต๐˜ฉ๐˜ช๐˜ฏ๐˜ฌ๐˜ช๐˜ฏ๐˜จ ๐˜ธ๐˜ช๐˜ต๐˜ฉ ๐˜ช๐˜ฎ๐˜ข๐˜จ๐˜ฆ๐˜ด as cognitive workspaces, similar to the human mental sketchpad? ๐Ÿ” Whatโ€™s the ๐—ฟ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐—ฟ๐—ผ๐—ฎ๐—ฑ๐—บ๐—ฎ๐—ฝ from tool-use โ†’ programmatic

๐Ÿง  How can AI evolve from statically ๐˜ต๐˜ฉ๐˜ช๐˜ฏ๐˜ฌ๐˜ช๐˜ฏ๐˜จ ๐˜ข๐˜ฃ๐˜ฐ๐˜ถ๐˜ต ๐˜ช๐˜ฎ๐˜ข๐˜จ๐˜ฆ๐˜ด โ†’ dynamically ๐˜ต๐˜ฉ๐˜ช๐˜ฏ๐˜ฌ๐˜ช๐˜ฏ๐˜จ ๐˜ธ๐˜ช๐˜ต๐˜ฉ ๐˜ช๐˜ฎ๐˜ข๐˜จ๐˜ฆ๐˜ด as cognitive workspaces, similar to the human mental sketchpad?
๐Ÿ” Whatโ€™s the ๐—ฟ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐—ฟ๐—ผ๐—ฎ๐—ฑ๐—บ๐—ฎ๐—ฝ from tool-use โ†’ programmatic
NotJustKama (@karrsen0713) 's Twitter Profile Photo

I was saddened to find that out of the six papers I submitted, the reviewer response rate was 2/20, and out of the 12 papers I reviewed, the reviewer response rate was 16/43 (12 of which were my own)๐Ÿ˜ข #ARR #emnlp ACLRollingReview

Etha Tianze Hua (@ethahua) 's Twitter Profile Photo

Check out our new paper: โ€œHow Do Vision-Language Models Process Conflicting Information Across Modalities?โ€! Vision-language models often struggle with conflicting inputs - we show how their internal representations and key attention heads reveal when and how this happens, and

Conference on Language Modeling (@colm_conf) 's Twitter Profile Photo

COLM 2025 accepted submissions are now public: openreview.net/group?id=colmwโ€ฆ Congratulations to all the authors, and see you all in Montreal!

Chi Jin (@chijinml) 's Twitter Profile Photo

Our department (ECE) at Princeton is hiring in AI this year!๐Ÿ“ข Please consider applying and joining us: princeton.edu/acad-positionsโ€ฆ

Unsloth AI (@unslothai) 's Twitter Profile Photo

You can now train Vision LLMs with Reinforcement Learning in our free notebook! Unsloth VLM RL via GRPO: 1.5ร— faster, 90% less VRAM, 15ร— longer context & no accuracy loss. Guide: docs.unsloth.ai/new/vision-reiโ€ฆ GitHub: github.com/unslothai/unslโ€ฆ Qwen2.5-VL Colab: colab.research.google.com/github/unslothโ€ฆ

Isabel Papadimitriou (@isabelpapad) 's Twitter Profile Photo

Are there conceptual directions in VLMs that transcend modality? Check out our COLM spotlight๐Ÿ”ฆ paper! We analyze how linear concepts interact with multimodality in VLM embeddings using SAEs with Chloe H. Su, @napoolar, Sham Kakade and Stephanie Gil arxiv.org/abs/2504.11695

Are there conceptual directions in VLMs that transcend modality? Check out our COLM spotlight๐Ÿ”ฆ paper! We analyze how linear concepts interact with multimodality in VLM embeddings using SAEs

with <a href="/Huangyu58589918/">Chloe H. Su</a>, @napoolar, <a href="/ShamKakade6/">Sham Kakade</a> and Stephanie Gil
arxiv.org/abs/2504.11695
DeepSeek (@deepseek_ai) 's Twitter Profile Photo

๐Ÿš€ Introducing DeepSeek-V3.2-Exp โ€” our latest experimental model! โœจ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context. ๐Ÿ‘‰ Now live on App, Web, and API. ๐Ÿ’ฐ API prices cut by 50%+! 1/n