Meetween (@meetweeneu) Twitter Tweets • TwiCopy

Jerry Spanakis 🟥 🦋 gerasimoss.bsky.social

a year ago

✨In January we explored synergies between 2 🇪🇺 projects (UM Department of Advanced Computing Sciences VOXReality & AI4LT @KIT Meetween). 🌍 Galvanizing to 👀 all this #MT work w/ Jan Niehues Yusuf Can Semerci Abderrahmane Issam Tu Anh Dinh Paweł Mąka + more! 🤝 Looking forward to closer collab 🔗 voxreality.eu/fostering-inno…

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

Meetween

@meetweeneu

a year ago

🔖 Weekly pick from the #MeetweenScientificWatch: "Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research" - A massive, open dataset for advancing LM research. 🌍📚 #AI #ML arxiv.org/abs/2402.00159

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Meetween

@meetweeneu

a year ago

🔖 Weekly pick from the #MeetweenScientificWatch: "ZeroST: Zero-Shot Speech Translation" - A novel framework leveraging multilingual models to perform speech-to-text translation without direct supervision. 🗣️🌐 #AI #NLP #SpeechTranslation merl.com/publications/d…

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: "A Suite for Acoustic Language Model Evaluation" - SALMon: A novel benchmark suite for evaluating models on noise, emotion, speaker identity, and more. 🎶📊 #AI #SpeechProcessing arxiv.org/abs/2409.07437

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

fbk_stek

@fbk_stek

a year ago

Few days left for registration. The SpeechTek Lab will contribute with 3 papers, which are results of our efforts under ELOQUENCE AI , Meetween ,FondazioneFAIR

thumb_up_off_alt4

chat_bubble_outline0

repeat3

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: "SlowFast-LLaVA: A Strong Training-Free Baseline for Video LLMs" - A two-stream design captures spatial detail & temporal context, outperforming many fine-tuned models! 🎥📚 arxiv.org/abs/2407.15841

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: "Vcoder: Versatile Vision Encoders for Multimodal LLMs" - A novel encoder boosts object perception in MLLMs, outperforming GPT-4V in visual reasoning! 🌆👀 arxiv.org/abs/2312.14233

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: "SpeechAlign: Aligning Speech Generation to Human Preferences" - A novel approach to improving speech language models with human feedback! 🎤🔗 arxiv.org/abs/2404.05600

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: "Learning Source Disentanglement in Neural Audio Codec" - Introducing SD-Codec for domain-aware audio compression & source separation! 🎵🔊 arxiv.org/abs/2409.11228

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Meetween

@meetweeneu

a year ago

Great result from a paper funded by the #Meetween project!

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: "An image is worth 1/2 tokens after layer 2" - FastV slashes LVLM computational costs by up to 45% while maintaining top-notch performance! ⚡📷 arxiv.org/abs/2403.06764

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: "ST-LLM: Large Language Models are effective temporal learners" – ST-LLM excels in video-based dialogue with spatial-temporal modeling, setting new SOTA! 🎥🤖 arxiv.org/abs/2404.00308

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: “LongVU: Spatiotemporal adaptive compression for long video-language understanding” – Efficiently processes hour-long videos with minimal detail loss. 🎥🤖 arxiv.org/abs/2410.17434

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: “Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition” – Boost ASR with synthetic multi-speaker data. 🎙️🤖 arxiv.org/abs/2408.09215

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: “Video-SALMONN: Speech-enhanced audio-visual large language models” – Redefining video comprehension with speech-aware AV-LLMs and groundbreaking QA accuracy. 🎥🎤🤖 arxiv.org/abs/2406.15704

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Meetween

@meetweeneu

a year ago

Weekly pick from the #MeetweenScientificWatch: “PaliGemma 2: A family of versatile VLMs for transfer” – An upgraded VLM family excelling in OCR, captioning, and radiography, setting new state-of-the-art benchmarks! 📖🖼️📊 arxiv.org/abs/2412.03555

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Meetween

@meetweeneu

10 months ago

Treasure Hunt at the Meetween project meeting in Shmoopy, Karlsruhe last week! 🏴‍☠️🔍 Any guesses on what's inside the treasure? 👀💡 #Meetween #TreasureHunt #Karlsruhe

Treasure Hunt at the Meetween project meeting in <a href="/KITKarlsruhe/">Shmoopy</a>, Karlsruhe last week! 🏴‍☠️🔍

Any guesses on what's inside the treasure? 👀💡

#Meetween #TreasureHunt #Karlsruhe

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Meetween

@meetweeneu

9 months ago

Weekly pick from the #MeetweenScientificWatch: “The Ultra-Scale Playbook: Training LLMs on GPU Clusters” – A deep dive into scaling LLM training from 1 to 1000+ GPUs, demystifying parallelism techniques with practical examples. 🚀🖥️📖 huggingface.co/spaces/nanotro…

thumb_up_off_alt2

chat_bubble_outline0

repeat2

shareShare

Sara Papi

@sarapapi

8 months ago

📢 The evaluation period of the Instruction Following task at IWSLT 2025 just started! 🖥️ Consider submitting your speech-to-text system! The outputs can be easily uploaded on the SPEECHM platform developed in the Meetween project! ➡️ iwslt2025.speechm.cloud.cyfronet.pl

thumb_up_off_alt15

chat_bubble_outline0

repeat9

shareShare

Meetween

@meetweeneu

6 months ago

Weekly pick from the #MeetweenScientificWatch: Overtrained language models are harder to fine-tune — This study reveals how longer pretraining can harm downstream adaptability, introducing "catastrophic overtraining" as a real risk. 🧠📉 arxiv.org/abs/2503.19206

thumb_up_off_alt0

chat_bubble_outline0

repeat1

shareShare