Xinyu Crystina Zhang | on job market (@crystina

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

✨New pre-print✨ Crosslingual transfer allows models to leverage their representations for one language to improve performance on another language. We characterize the acquisition of shared representations in order to better understand how and when crosslingual transfer happens.

thumb_up_off_alt83

chat_bubble_outline2

repeat12

shareShare

Freda Shi

@fredahshi

5 months ago

📢I just made the slides public for this talk. TL; DR: how we computer scientists adapt insights from linguistics to analyze and improve our models. Comments & discussion are welcomed; the recording from Vector is forthcoming. docs.google.com/presentation/d…

thumb_up_off_alt32

chat_bubble_outline1

repeat5

shareShare

Freda Shi

@fredahshi

3 months ago

On my way to NAACL✈️! If you're also there and interested in grounding, don't miss our tutorial on "Learning Language through Grounding"! Mark your calendar: May 3rd, 14:00-17:30, Ballroom A. Another exciting collaboration with Martin Ziqiao Ma Jiayuan Mao Parisa Kordjamshidi Michigan SLED Lab!

thumb_up_off_alt45

chat_bubble_outline2

repeat8

shareShare

Xinyu Crystina Zhang | on job market

@crystina_z

3 months ago

On my way to #NAACL2025 ✈️ I'll present the paper on Friday (May 2) 9-10:30am at poster session 7. Happy to chat about any aspect of multilingualism and culture! I'm also open to postdoc and visiting positions in the US. Definitely reach out if you have any opportunities.

thumb_up_off_alt34

chat_bubble_outline1

repeat6

shareShare

Akari Asai

@akariasai

3 months ago

Excited to be at the Foundation Models for Science Conference in NYC and NAACL in Albuquerque this week! I’ll be presenting OpenScholar (arxiv.org/abs/2411.14199), CodeRAG-Bench (arxiv.org/abs/2406.14497) and others & organize a workshop! Come say high 🧵

thumb_up_off_alt76

chat_bubble_outline1

repeat7

shareShare

Vilém Zouhar

@zouharvi

3 months ago

Multilinguality is happening at #NAACL2025 Xinyu Crystina Zhang Nathan Brown @ NAACL 2025 Dayeon (Zoey) Ki Rena Gao Ona de Gibert

Multilinguality is happening at #NAACL2025

<a href="/crystina_z/">Xinyu Crystina Zhang</a> <a href="/OxxoTweets/">Nathan Brown @ NAACL 2025</a> <a href="/zoeykii/">Dayeon (Zoey) Ki</a> <a href="/weiweigao2222/">Rena Gao</a> <a href="/OnadeGibert/">Ona de Gibert</a>

thumb_up_off_alt56

chat_bubble_outline1

repeat4

shareShare

Xinyu Crystina Zhang | on job market

@crystina_z

3 months ago

Since I really like the poster, sharing here in case we didn’t get to chat at the conference🍅

thumb_up_off_alt30

chat_bubble_outline0

repeat7

shareShare

Xueguang Ma

@xueguang_ma

3 months ago

Sharing some updates on Tevatron-2.0 toolkit (accepted as a SIGIR 2025 demo) together with OmniEmbed-v0.1 Tevatron-2.0 aims to better support the training of unified embedding models across tasks, languages, and modalities, facilitating future research in better information

thumb_up_off_alt44

chat_bubble_outline5

repeat5

shareShare

Shengyao Zhuang

@shengyaozhuang

3 months ago

One embedding model for all modalities and cross different languages! We will demo the model training pipeline in #SIGIR2025 Our OmniEmbed-v0.1 also demonstrate very strong performance on MAGMaR multimodal retrieval shared task eval.ai/web/challenges…

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

Benjamin Minixhofer

@bminixhofer

2 months ago

We achieved the first instance of successful subword-to-byte distillation in our (just updated) paper. This enables creating byte-level models at a fraction of the cost of what was needed previously. As a proof-of-concept, we created byte-level Gemma2 and Llama3 models. 🧵

$We achieved the first instance of successful subword-to-byte distillation in our (just updated) paper. This enables creating byte-level models at a fraction of the cost of what was needed previously. As a proof-of-concept, we created byte-level Gemma2 and Llama3 models. 🧵$

thumb_up_off_alt59

chat_bubble_outline1

repeat14

shareShare

Jiaang Li

@jiaangli

2 months ago

🚀New Preprint Alert 🚀 Can Multimodal Retrieval Enhance Cultural Awareness in Vision-Language Models? Excited to introduce RAVENEA, a new benchmark aimed at evaluating cultural understanding in VLMs through RAG.

thumb_up_off_alt8

chat_bubble_outline1

repeat5

shareShare

Nandan Thakur

@beirmug

2 months ago

Did you know that fine-tuning retrievers & re-rankers on large but unclean training datasets can harm their performance? 😡 In our new preprint, we re-examine popular IR training data quality by pruning datasets and identifying and relabeling 𝐟𝐚𝐥𝐬𝐞-𝐧𝐞𝐠𝐚𝐭𝐢𝐯𝐞𝐬! 🏷️

thumb_up_off_alt82

chat_bubble_outline2

repeat19

shareShare

Eugene Yang

@eyangtw

2 months ago

🚨Wouldn’t it be nice if your agentic search system could reason over all your docs? ✨Introducing Rank-K, a listwise reranker that benefits from test-time compute and long-context! Rank-K sets a new SoTA for reasoning-based reranking, without reasoning chains from other models.

thumb_up_off_alt190

chat_bubble_outline2

repeat28

shareShare

Xinyu Crystina Zhang | on job market

@crystina_z

2 months ago

The more data, the better? 🤔 Only if they are clean! Introducing our latest work on relabeling hard negatives in massive IR training sets! 📝 Cleaner data → stronger embeddings & rerankers. Read more here ⬇️

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Jimmy Lin

@lintool

2 months ago

💥 My awesome University of Waterloo ugrad student Sisi Li - with the help of Ronak Pradeep - slapped an MCP server in front of Pyserini to create MCPyserini and connected it to Claude to create DeepResearcherini! 🤪 Here, an example of RAG using the MS MARCO v1 passage collection.

thumb_up_off_alt34

chat_bubble_outline1

repeat5

shareShare

Wenyan Li

@wenyan62

2 months ago

Excited to share our multimodal temporal culture benchmark is released 🚀🚀🚀 Dataset is public on 🤗 huggingface Check it out!! arxiv.org/abs/2506.01565 huggingface.co/datasets/lizho…

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

Xueguang Ma

@xueguang_ma

2 months ago

Very strong embedding model!!! If anyone is interested in further fine-tuning Qwen3-embed with custom data. Here is the command with Tevatron. github.com/texttron/tevat…

thumb_up_off_alt183

chat_bubble_outline2

repeat22

shareShare

Xueguang Ma

@xueguang_ma

2 months ago

Sharing our recent efforts on applying OmniEmbed to large-scale video retrieval MultiVENT2.0! tl;dr, we achieve SoTA on the MAGMAR shared task leaderboard. More importantly, we provide in-depth analysis on the effectiveness of different input modalities for video retrieval.

thumb_up_off_alt23

chat_bubble_outline1

repeat4

shareShare

Jimmy Lin

@lintool

2 months ago

In December 2024 Pankaj Gupta Gilad Mishne Will Horn and I put out a rather cryptic arXiv paper musing about the future of search: arxiv.org/abs/2412.18956. I’m now able to share what I’ve been up to! 🧵(1/9)

thumb_up_off_alt169

chat_bubble_outline9

repeat30

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

a month ago

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation "we introduce WM-ABench, a large-scale benchmark comprising 23 fine-grained evaluation dimensions across 6 diverse simulated environments with controlled counterfactual simulations. Through 660

thumb_up_off_alt296

chat_bubble_outline6

repeat66

shareShare

Xinyu Crystina Zhang | on job market

Gate.io

Catherine Arnett

Freda Shi

Freda Shi

Xinyu Crystina Zhang | on job market

Akari Asai

Vilém Zouhar

Xinyu Crystina Zhang | on job market

Xueguang Ma

Shengyao Zhuang

Benjamin Minixhofer

Jiaang Li

Nandan Thakur

Eugene Yang

Xinyu Crystina Zhang | on job market

Jimmy Lin

Wenyan Li

Xueguang Ma

Xueguang Ma

Jimmy Lin

Tanishq Mathew Abraham, Ph.D.