Meetween (@meetweeneu) 's Twitter Profile
Meetween

@meetweeneu

ID: 1746903113578012672

calendar_today15-01-2024 14:32:27

36 Tweet

71 Followers

19 Following

Meetween (@meetweeneu) 's Twitter Profile Photo

๐Ÿ”– Weekly pick from the #MeetweenScientificWatch: "Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research" - A massive, open dataset for advancing LM research. ๐ŸŒ๐Ÿ“š #AI #ML arxiv.org/abs/2402.00159

Meetween (@meetweeneu) 's Twitter Profile Photo

๐Ÿ”– Weekly pick from the #MeetweenScientificWatch: "ZeroST: Zero-Shot Speech Translation" - A novel framework leveraging multilingual models to perform speech-to-text translation without direct supervision. ๐Ÿ—ฃ๏ธ๐ŸŒ #AI #NLP #SpeechTranslation merl.com/publications/dโ€ฆ

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "A Suite for Acoustic Language Model Evaluation" - SALMon: A novel benchmark suite for evaluating models on noise, emotion, speaker identity, and more. ๐ŸŽถ๐Ÿ“Š #AI #SpeechProcessing arxiv.org/abs/2409.07437

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "SlowFast-LLaVA: A Strong Training-Free Baseline for Video LLMs" - A two-stream design captures spatial detail & temporal context, outperforming many fine-tuned models! ๐ŸŽฅ๐Ÿ“š arxiv.org/abs/2407.15841

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "Vcoder: Versatile Vision Encoders for Multimodal LLMs" - A novel encoder boosts object perception in MLLMs, outperforming GPT-4V in visual reasoning! ๐ŸŒ†๐Ÿ‘€ arxiv.org/abs/2312.14233

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "SpeechAlign: Aligning Speech Generation to Human Preferences" - A novel approach to improving speech language models with human feedback! ๐ŸŽค๐Ÿ”— arxiv.org/abs/2404.05600

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "Learning Source Disentanglement in Neural Audio Codec" - Introducing SD-Codec for domain-aware audio compression & source separation! ๐ŸŽต๐Ÿ”Š arxiv.org/abs/2409.11228

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "An image is worth 1/2 tokens after layer 2" - FastV slashes LVLM computational costs by up to 45% while maintaining top-notch performance! โšก๐Ÿ“ท arxiv.org/abs/2403.06764

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: "ST-LLM: Large Language Models are effective temporal learners" โ€“ ST-LLM excels in video-based dialogue with spatial-temporal modeling, setting new SOTA! ๐ŸŽฅ๐Ÿค– arxiv.org/abs/2404.00308

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: โ€œLongVU: Spatiotemporal adaptive compression for long video-language understandingโ€ โ€“ Efficiently processes hour-long videos with minimal detail loss. ๐ŸŽฅ๐Ÿค– arxiv.org/abs/2410.17434

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: โ€œGenerating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognitionโ€ โ€“ Boost ASR with synthetic multi-speaker data. ๐ŸŽ™๏ธ๐Ÿค– arxiv.org/abs/2408.09215

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: โ€œVideo-SALMONN: Speech-enhanced audio-visual large language modelsโ€ โ€“ Redefining video comprehension with speech-aware AV-LLMs and groundbreaking QA accuracy. ๐ŸŽฅ๐ŸŽค๐Ÿค– arxiv.org/abs/2406.15704

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: โ€œPaliGemma 2: A family of versatile VLMs for transferโ€ โ€“ An upgraded VLM family excelling in OCR, captioning, and radiography, setting new state-of-the-art benchmarks! ๐Ÿ“–๐Ÿ–ผ๏ธ๐Ÿ“Š arxiv.org/abs/2412.03555

Meetween (@meetweeneu) 's Twitter Profile Photo

Treasure Hunt at the Meetween project meeting in Shmoopy, Karlsruhe last week! ๐Ÿดโ€โ˜ ๏ธ๐Ÿ” Any guesses on what's inside the treasure? ๐Ÿ‘€๐Ÿ’ก #Meetween #TreasureHunt #Karlsruhe

Treasure Hunt at the Meetween project meeting in <a href="/KITKarlsruhe/">Shmoopy</a>, Karlsruhe last week! ๐Ÿดโ€โ˜ ๏ธ๐Ÿ” 

Any guesses on what's inside the treasure? ๐Ÿ‘€๐Ÿ’ก
 
#Meetween #TreasureHunt #Karlsruhe
Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: โ€œThe Ultra-Scale Playbook: Training LLMs on GPU Clustersโ€ โ€“ A deep dive into scaling LLM training from 1 to 1000+ GPUs, demystifying parallelism techniques with practical examples. ๐Ÿš€๐Ÿ–ฅ๏ธ๐Ÿ“– huggingface.co/spaces/nanotroโ€ฆ

Sara Papi (@sarapapi) 's Twitter Profile Photo

๐Ÿ“ข The evaluation period of the Instruction Following task at IWSLT 2025 just started! ๐Ÿ–ฅ๏ธ Consider submitting your speech-to-text system! The outputs can be easily uploaded on the SPEECHM platform developed in the Meetween project! โžก๏ธ iwslt2025.speechm.cloud.cyfronet.pl

Meetween (@meetweeneu) 's Twitter Profile Photo

Weekly pick from the #MeetweenScientificWatch: Overtrained language models are harder to fine-tune โ€” This study reveals how longer pretraining can harm downstream adaptability, introducing "catastrophic overtraining" as a real risk. ๐Ÿง ๐Ÿ“‰ arxiv.org/abs/2503.19206