Gustavo Penha (@_guz_) Twitter Tweets • TwiCopy

Gustavo Penha

@_guz_

+ Follow

Research Scientist @Spotify · Working with IR, RecSys, NLP · PhD from @tudelft · ex @AmazonScience · guzpenha.github.io/guzblog/

ID: 19626509

linkhttps://linktr.ee/guzpenha calendar_today28-01-2009 00:15:44

671 Tweet

813 Followers

560 Following

Gustavo Penha

@_guz_

8 months ago

We have an open research scientist position in our lab at Spotify, Personalization ! The areas of expertise are: Information Retrieval, Recommendation System, Language Technologies, Foundational Models, Generative AI Technologies, and Machine Learning. lifeatspotify.com/jobs/research-…

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Gustavo Penha

@_guz_

8 months ago

I am attending #ECIR25 at Lucca 🇮🇹 if you are interested and want to discuss this position!

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Sumit

@_reachsumit

8 months ago

Contextualizing Spotify's Audiobook List Recommendations with Descriptive Shelves Spotify introduces a pipeline that generates personalized audiobook recommendations with descriptive shelves to help users explore content based on their interests. 📝arxiv.org/abs/2504.13572

thumb_up_off_alt15

chat_bubble_outline0

repeat4

shareShare

Sumit

@_reachsumit

5 months ago

Aligned Query Expansion: Efficient Query Expansion for Information Retrieval through LLM Alignment Adam Yang et al. leverage LLM alignment techniques to fine-tune models for generating query expansions that directly optimize retrieval effectiveness. 📝arxiv.org/abs/2507.11042

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Sumit

@_reachsumit

5 months ago

Adaptive Repetition for Mitigating Position Bias in LLM-Based Ranking Spotify introduces a dynamic early-stopping method that adaptively determines repetitions needed for each ranking instance, reducing LLM calls by 81% while preserving accuracy. 📝arxiv.org/abs/2507.17788

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Sumit

@_reachsumit

4 months ago

Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge Spotify introduces a profile-aware LLM framework for evaluating personalized podcast recommendations using natural-language user profiles distilled from listening history. 📝arxiv.org/abs/2508.08777

thumb_up_off_alt22

chat_bubble_outline0

repeat6

shareShare

Sumit

@_reachsumit

4 months ago

Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations Marco De Nadai et al. at Spotify use multimodal LLMs to generate natural-language descriptions of video content for better recommendations 📝arxiv.org/abs/2508.09789 👨🏽‍💻huggingface.co/datasets/marco…

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Marco De Nadai

@denadai2

4 months ago

What if we could use off-the-shelf Multimodal Large Language Model to enrich current video recommendation models? This is what we asked ourselves in our recent #recsys2025 paper arxiv.org/pdf/2508.09789 🧵

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Sumit

@_reachsumit

4 months ago

Semantic IDs for Joint Generative Search and Recommendation Gustavo Penha et al. at Spotify introduce a bi-encoder model fine-tuned on both search and recommendation tasks to obtain item embeddings, followed by construction of unified Semantic ID space. 📝arxiv.org/abs/2508.10478

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Gustavo Penha

@_guz_

4 months ago

Happy to share our #recsys25 paper: “Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge”. 🧠 90 days of listening → natural-language user profiles → LLM judges alignment 📊 Aligns with human eval. With amazing Spotify co-authors. 📄 arxiv.org/abs/2508.08777

thumb_up_off_alt12

chat_bubble_outline0

repeat4

shareShare

Aixin Sun 孙爱欣

@aixinsg

3 months ago

I doubt to what extent improvements on these datasets would translate to improvements in today's real-world recommendation settings. Reference: arxiv.org/abs/2508.19399…

thumb_up_off_alt10

chat_bubble_outline1

repeat4

shareShare

Kamil Ciosek

@mlciosek

3 months ago

For anyone worried their LLM might be making stuff up, we made a budget‐friendly truth serum (semantic entropy + Bayesian). See for yourself: youtube.com/watch?v=x_8ORG… Paper: arxiv.org/pdf/2504.03579

thumb_up_off_alt3

chat_bubble_outline0

repeat7

shareShare