Soujanya Poria (@soujanyaporia) Twitter Tweets • TwiCopy

Rada Mihalcea

2 years ago

“What should I work on?” is a question we hear more & more often from NLP students, during a time when the media rhetoric is that “it’s been all solved” Turns out there are many NLP research areas rich for exploration—here is our answer from 20+ students arxiv.org/abs/2305.12544

thumb_up_off_alt635

chat_bubble_outline10

repeat169

shareShare

Soujanya Poria

@soujanyaporia

10 months ago

🧩 Introducing PuzzleVQA: A dataset consisting of puzzles based on abstract patterns. It evaluates large multimodal models using fundamental concepts like colors, numbers, sizes, and shapes. 🔍 Our experiments reveal these models struggle to generalize with simple abstract

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Henry (Wei) Han

@cheqianghan

10 months ago

🚨 [NAACL 2024] Thrilled to share our latest research paper SEALING (SElf-Adaptive sampLING), a study on the frame sampling methods for video question answering task on image--text models (ITMs, or VLMs). 📖 paper: arxiv.org/abs/2307.04192 💻 code: github.com/declare-lab/Se…

thumb_up_off_alt4

chat_bubble_outline2

repeat2

shareShare

arXiv Sound

@arxivsound

9 months ago

``HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks,'' Yingting Li, Rishabh Bhardwaj, Ambuj Mehrish, Bo Cheng, Soujanya Poria, ift.tt/geBK4H2

thumb_up_off_alt17

chat_bubble_outline0

repeat6

shareShare

Gradio

@gradio

9 months ago

🎉𝐓𝐚𝐧𝐠𝐨 𝟐: A new text-to-audio model that aligns diffusion-based generations🎧 🔍Tango 2 from Declare-lab captures concepts and their temporal ordering in the generated audio, enhancing the audio-text alignment (one event followed by another event in generation)

thumb_up_off_alt72

chat_bubble_outline1

repeat20

shareShare

Sylvain Filoni

@fffiloni

9 months ago

📢 I added the new Tango-2 text-to-audio model to the Image-to-SFX space on Hugging Face, please try it and compare with alternatives —› huggingface.co/spaces/fffilon…

📢 I added the new Tango-2 text-to-audio model to the Image-to-SFX space on <a href="/huggingface/">Hugging Face</a>, please try it and compare with alternatives —› huggingface.co/spaces/fffilon…

thumb_up_off_alt48

chat_bubble_outline2

repeat9

shareShare

Soujanya Poria

@soujanyaporia

7 months ago

Proud moment as we find our research "Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense" has been recognized with the Social Impact Award at #NAACL2024. Kudos to all the co-authors!

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

AI Bites | YouTube Channel

@ai_bites

7 months ago

DARWIN is a decode-time alignment technique that uses a reward-guided tree search framework to align the LLM and achieve comparable performance to preference optimization on 2 instruction following benchmarks. Paper: Reward Steering with Evolutionary Heuristics for Decoding-time

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

arXiv Sound

@arxivsound

7 months ago

``Improving Text-To-Audio Models with Synthetic Captions,'' Zhifeng Kong, Sang-gil Lee, Deepanway Ghosal, Navonil Majumder, Ambuj Mehrish, Rafael Valle, Soujanya Poria, Bryan Catanzaro, ift.tt/DyCxpJE

thumb_up_off_alt17

chat_bubble_outline0

repeat4

shareShare

Rafael Valle

@rafaelvalleart

6 months ago

Synthetic labels are amazing! Do you need an audio labelling machine? Audio Flamingo checkpoints are available on github.com/NVIDIA/audio-f… ...and pre-training with synthetic labels from Audio Flamingo gives large improvements in text-to-audio models arxiv.org/abs/2406.15487

thumb_up_off_alt64

chat_bubble_outline3

repeat16

shareShare

Animesh Mukherjee

@animesh43061078

6 months ago

#indoml2024 gearing up with a stellar set of speakers (indoml.in) Bhramar Mukherjee Ranjay Krishna Chandrajit Bajaj Soujanya Poria Munmun De Choudhury, PhD Arindam Banerjee ... stay tuned for more updates ...

#indoml2024 gearing up with a stellar set of speakers (indoml.in)
<a href="/BhramarBioStat/">Bhramar Mukherjee</a> <a href="/RanjayKrishna/">Ranjay Krishna</a> Chandrajit Bajaj <a href="/soujanyaporia/">Soujanya Poria</a> <a href="/munmun10/">Munmun De Choudhury, PhD</a> Arindam Banerjee ... stay tuned for more updates ...

thumb_up_off_alt18

chat_bubble_outline0

repeat4

shareShare

Zonglin Yang

@yang_zy223

5 months ago

Can LLM generate novel and valid research hypotheses, and therefore work as a copilot for scientists? For the first time, we find that it CAN!!! Accordingly we propose a new paradigm for research with a research copilot. Join us at poster session, Monday 12:45 pm! #ACL2024

thumb_up_off_alt55

chat_bubble_outline5

repeat12

shareShare

Soujanya Poria

@soujanyaporia

5 months ago

For me, it was the highlight of the trip. #ACL2024

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare