Meetween (@meetweeneu) 's Twitter Profile
Meetween

@meetweeneu

ID: 1746903113578012672

calendar_today15-01-2024 14:32:27

36 Tweet

71 Followers

19 Following

Meetween (@meetweeneu) 's Twitter Profile Photo

🔖 Weekly pick from the #MeetweenScientificWatch: "Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research" - A massive, open dataset for advancing LM research. 🌍📚 #AI #ML arxiv.org/abs/2402.00159

Meetween (@meetweeneu) 's Twitter Profile Photo

🔖 Weekly pick from the #MeetweenScientificWatch: "ZeroST: Zero-Shot Speech Translation" - A novel framework leveraging multilingual models to perform speech-to-text translation without direct supervision. 🗣️🌐 #AI #NLP #SpeechTranslation merl.com/publications/d…

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "A Suite for Acoustic Language Model Evaluation" - SALMon: A novel benchmark suite for evaluating models on noise, emotion, speaker identity, and more. 🎶📊 #AI #SpeechProcessing arxiv.org/abs/2409.07437

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "SlowFast-LLaVA: A Strong Training-Free Baseline for Video LLMs" - A two-stream design captures spatial detail & temporal context, outperforming many fine-tuned models! 🎥📚 arxiv.org/abs/2407.15841

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "Vcoder: Versatile Vision Encoders for Multimodal LLMs" - A novel encoder boosts object perception in MLLMs, outperforming GPT-4V in visual reasoning! 🌆👀 arxiv.org/abs/2312.14233

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "SpeechAlign: Aligning Speech Generation to Human Preferences" - A novel approach to improving speech language models with human feedback! 🎤🔗 arxiv.org/abs/2404.05600

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "Learning Source Disentanglement in Neural Audio Codec" - Introducing SD-Codec for domain-aware audio compression & source separation! 🎵🔊 arxiv.org/abs/2409.11228

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "An image is worth 1/2 tokens after layer 2" - FastV slashes LVLM computational costs by up to 45% while maintaining top-notch performance! ⚡📷 arxiv.org/abs/2403.06764

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "ST-LLM: Large Language Models are effective temporal learners" – ST-LLM excels in video-based dialogue with spatial-temporal modeling, setting new SOTA! 🎥🤖 arxiv.org/abs/2404.00308

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: “LongVU: Spatiotemporal adaptive compression for long video-language understanding” – Efficiently processes hour-long videos with minimal detail loss. 🎥🤖 arxiv.org/abs/2410.17434

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: “Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition” – Boost ASR with synthetic multi-speaker data. 🎙️🤖 arxiv.org/abs/2408.09215

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: “Video-SALMONN: Speech-enhanced audio-visual large language models” – Redefining video comprehension with speech-aware AV-LLMs and groundbreaking QA accuracy. 🎥🎤🤖 arxiv.org/abs/2406.15704

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: “PaliGemma 2: A family of versatile VLMs for transfer” – An upgraded VLM family excelling in OCR, captioning, and radiography, setting new state-of-the-art benchmarks! 📖🖼️📊 arxiv.org/abs/2412.03555

Meetween (@meetweeneu) 's Twitter Profile Photo

Treasure Hunt at the Meetween project meeting in Shmoopy, Karlsruhe last week! 🏴‍☠️🔍 Any guesses on what's inside the treasure? 👀💡 #Meetween #TreasureHunt #Karlsruhe

Treasure Hunt at the Meetween project meeting in <a href="/KITKarlsruhe/">Shmoopy</a>, Karlsruhe last week! 🏴‍☠️🔍 

Any guesses on what's inside the treasure? 👀💡
 
#Meetween #TreasureHunt #Karlsruhe
Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: “The Ultra-Scale Playbook: Training LLMs on GPU Clusters” – A deep dive into scaling LLM training from 1 to 1000+ GPUs, demystifying parallelism techniques with practical examples. 🚀🖥️📖 huggingface.co/spaces/nanotro…

Sara Papi (@sarapapi) 's Twitter Profile Photo

📢 The evaluation period of the Instruction Following task at IWSLT 2025 just started! 🖥️ Consider submitting your speech-to-text system! The outputs can be easily uploaded on the SPEECHM platform developed in the Meetween project! ➡️ iwslt2025.speechm.cloud.cyfronet.pl

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: Overtrained language models are harder to fine-tune — This study reveals how longer pretraining can harm downstream adaptability, introducing "catastrophic overtraining" as a real risk. 🧠📉 arxiv.org/abs/2503.19206